Hello,
I get this strange message when I open some XML files. This specific file is the "footnote" xml file for a Word document. Perhaps it has something to do with Unicode characters??
"Warning. "footnotes.xml" contains characters that do no exist in code page 1252 (ANSI - Latin 1). They will be converted to the system default character, if you click OK."
Weird message when I open an XML file.
Moderators: AmigoJack, bbadmin, helios, Bob Hansen, MudGuard
-
- Posts: 2461
- Joined: Sun Mar 02, 2003 9:22 pm
Your file is presumably a Unicode file. Unicode is a 21-bit character set containing more than 100 000 characters; CP 1252 (which, whatever the message says, is not the same as Latin 1) is an 8-bit character set containing about 224 printable charcters. There are characters in your file that are in Unicode but not in CP 1252.
Internally, TextPad stores all the text of each document in a single 8-bit character set (a "code page" or "script"). It supports a number of code pages, but only one at a time: it displays all the text of each document using a single code page.
The range of code pages available depends on the font you are using. You can select the code page at
Configure | Preferences | Document Classes | <Class> | Font | Script
or
View | Document Properties | Font | Script.
Here is a correspondence between some script names and code pages:
If the characters in your document are not all in a single available code page, TextPad will not be able to display or save it correctly.
Internally, TextPad stores all the text of each document in a single 8-bit character set (a "code page" or "script"). It supports a number of code pages, but only one at a time: it displays all the text of each document using a single code page.
The range of code pages available depends on the font you are using. You can select the code page at
Configure | Preferences | Document Classes | <Class> | Font | Script
or
View | Document Properties | Font | Script.
Here is a correspondence between some script names and code pages:
Code: Select all
Western 1252
Greek 1253
Turkish 1254
Central European 1250
Cyrillic 1251