UTF-8 file suddenly saved as ANSI

General questions about using TextPad

Moderators: AmigoJack, bbadmin, helios, Bob Hansen, MudGuard

Post Reply
Xover
Posts: 7
Joined: Fri Apr 23, 2004 3:33 pm

UTF-8 file suddenly saved as ANSI

Post by Xover »

Hi,

I'm currently working on a text file which was saved as UTF-8. A few days ago, it was suddenly saved as ANSI and therefore had many characters replaced by weird stuff like ä
I managed to get it saved as UTF-8 again, but that didn't change the characters back to what they were before.
What happened there? Do I have to manually replace all destroyed characters with the correct ones? And could that happen again? It basically ruined the work of many days, and I can swear that I didn't do anything different the day before it happened, I just saved the file as usual.
Michel Merlin
Posts: 1
Joined: Sat Feb 17, 2007 5:47 pm
Location: Versailles (France)

May be due to the MS-known UTF-8 conversion problems

Post by Michel Merlin »

Xover wrote:I'm currently working on a text file which was saved as UTF-8. A few days ago, it was suddenly saved as ANSI and therefore had many characters replaced by weird stuff like ä
..........
And could that happen again?
I am afraid it could.

If you are in Windows and have OE (Outlook Express), it would be useful to try first the test I posted with detailed images on Sun 21 Jan 2007 16:39:10 GMT in Please post successful test of source-editing UTF-8 European HTML.

This problem usually comes from an MS bug in OE, that may also be found elsewhere since apparently due IMO to the complication inherent to UTF-8 (that outpasses the average ability in programmers' crowds). So, if you can do the test, we can discuss further.

PS 19:12:40. MS itself admits problems when using UTF-8; see in Why change to Unicode 5.0: "...Inadequate algorithmic support for operations such as UTF-8 conversions".

Versailles, Sat 17 Feb 2007 19:05:20 +0100
Post Reply