Hi,
I'm currently working on a text file which was saved as UTF-8. A few days ago, it was suddenly saved as ANSI and therefore had many characters replaced by weird stuff like ä
I managed to get it saved as UTF-8 again, but that didn't change the characters back to what they were before.
What happened there? Do I have to manually replace all destroyed characters with the correct ones? And could that happen again? It basically ruined the work of many days, and I can swear that I didn't do anything different the day before it happened, I just saved the file as usual.
UTF-8 file suddenly saved as ANSI
Moderators: AmigoJack, bbadmin, helios, Bob Hansen, MudGuard
-
- Posts: 1
- Joined: Sat Feb 17, 2007 5:47 pm
- Location: Versailles (France)
May be due to the MS-known UTF-8 conversion problems
I am afraid it could.Xover wrote:I'm currently working on a text file which was saved as UTF-8. A few days ago, it was suddenly saved as ANSI and therefore had many characters replaced by weird stuff like ä
..........
And could that happen again?
If you are in Windows and have OE (Outlook Express), it would be useful to try first the test I posted with detailed images on Sun 21 Jan 2007 16:39:10 GMT in Please post successful test of source-editing UTF-8 European HTML.
This problem usually comes from an MS bug in OE, that may also be found elsewhere since apparently due IMO to the complication inherent to UTF-8 (that outpasses the average ability in programmers' crowds). So, if you can do the test, we can discuss further.
PS 19:12:40. MS itself admits problems when using UTF-8; see in Why change to Unicode 5.0: "...Inadequate algorithmic support for operations such as UTF-8 conversions".
Versailles, Sat 17 Feb 2007 19:05:20 +0100