Weird message when I open an XML file.

General questions about using TextPad

Moderators: AmigoJack, bbadmin, helios, Bob Hansen, MudGuard

Post Reply
pb4072
Posts: 14
Joined: Tue Nov 10, 2009 7:24 pm
Location: Washington, DC area

Weird message when I open an XML file.

Post by pb4072 »

Hello,
I get this strange message when I open some XML files. This specific file is the "footnote" xml file for a Word document. Perhaps it has something to do with Unicode characters??

"Warning. "footnotes.xml" contains characters that do no exist in code page 1252 (ANSI - Latin 1). They will be converted to the system default character, if you click OK."
ben_josephs
Posts: 2461
Joined: Sun Mar 02, 2003 9:22 pm

Post by ben_josephs »

Your file is presumably a Unicode file. Unicode is a 21-bit character set containing more than 100 000 characters; CP 1252 (which, whatever the message says, is not the same as Latin 1) is an 8-bit character set containing about 224 printable charcters. There are characters in your file that are in Unicode but not in CP 1252.

Internally, TextPad stores all the text of each document in a single 8-bit character set (a "code page" or "script"). It supports a number of code pages, but only one at a time: it displays all the text of each document using a single code page.

The range of code pages available depends on the font you are using. You can select the code page at
Configure | Preferences | Document Classes | <Class> | Font | Script
or
View | Document Properties | Font | Script.

Here is a correspondence between some script names and code pages:

Code: Select all

Western            1252
Greek              1253
Turkish            1254
Central European   1250
Cyrillic           1251
If the characters in your document are not all in a single available code page, TextPad will not be able to display or save it correctly.
Post Reply