UTF-8 encoding
Posted: Mon Oct 25, 2004 3:25 pm
When you clic save as and choose the UTF8 option in the code section, textpad actually encode special character in UTF-8 format BUT doesn't prefix the UTF-8 stream with the character U+FEFF (ZERO WIDTH NO-BREAK SPACE), or Byte-Order Mark (BOM).
Some programs like flash (and maybe others...) won't read the file as a UTF stream but like a standard ASCII file.
I read this page as reference before posting this suggestion :
http://www.cl.cam.ac.uk/~mgk25/unicode.html#ucsutf
It is mentionned that, i quote :
A good encoding converter will also offer options for adding or removing the BOM:
* Unconditionally prefix the output text with U+FEFF.
* Prefix the output text with U+FEFF unless it is already there.
* Remove the first character if it is U+FEFF.
I hope this help, and congratulation for the editor, it is really a good one, my favorite actually...so i hope this bug could be fix !
Thanks.
Some programs like flash (and maybe others...) won't read the file as a UTF stream but like a standard ASCII file.
I read this page as reference before posting this suggestion :
http://www.cl.cam.ac.uk/~mgk25/unicode.html#ucsutf
It is mentionned that, i quote :
A good encoding converter will also offer options for adding or removing the BOM:
* Unconditionally prefix the output text with U+FEFF.
* Prefix the output text with U+FEFF unless it is already there.
* Remove the first character if it is U+FEFF.
I hope this help, and congratulation for the editor, it is really a good one, my favorite actually...so i hope this bug could be fix !
Thanks.