Why does it say "ANSI characters"?
It's ASCII up to 128... then what does it use? Some ISO standard? Or some extended ASCII one? "ANSI ANSI"?
"ANSI" characters?
Moderators: AmigoJack, bbadmin, helios, Bob Hansen, MudGuard
-
- Posts: 2461
- Joined: Sun Mar 02, 2003 9:22 pm
A good question. Even Microsoft acknowledges that the term "ANSI character" is wrong (http://www.microsoft.com/globaldev/refe ... ssary.mspx):
Perhaps TextPad tries to use the code page set in Windows' "Regional and Language Option". In English-speaking countries by default it's windows-1252.
You can set the code page with
Windows-1252 (CP1252) differs from iso-8859-1 in code points 0x80..0x9F. In iso-8859-1 these are control characters; in windows-1252 most of them are printable characters.ANSI: Acronym for the American National Standards Institute. The term “ANSI� as used to signify Windows code pages is a historical reference, but is nowadays a misnomer that continues to persist in the Windows community. The source of this comes from the fact that the Windows code page 1252 was originally based on an ANSI draft�which became International Organization for Standardization (ISO) Standard 8859-1. “ANSI applications� are usually a reference to non-Unicode or code page–based applications.
Perhaps TextPad tries to use the code page set in Windows' "Regional and Language Option". In English-speaking countries by default it's windows-1252.
You can set the code page with
orConfigure | Preferences | Document Classes | <Class> | Font | Script
The correspondence isView | Document Properties | Font | Script
Code: Select all
1250 Central European
1251 Cyrillic
1252 Western; Latin 1
1253 Greek
1254 Turkish
The term ANSI is used, because the characters defined for code points between 128 and 255 depend on the selected font script. For example, Western characters are different from Central European, and two bytes are needed for many Japanese characters. These characters are defined by ANSI standards - although Windows may not adhere to those standards.
Keith MacDonald
Helios Software Solutions
Keith MacDonald
Helios Software Solutions
-
- Posts: 2461
- Joined: Sun Mar 02, 2003 9:22 pm
They are used. They're control characters. If you putMudGuard wrote:The ANSI character set is a superset of ISO-8859-1 - in ISO-8859-1 the code positions 128 to 159 are not used, in ANSI they are used - e.g. ANSI 128 is the EURO currency symbol.
Code: Select all
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" />
-
- Posts: 2461
- Joined: Sun Mar 02, 2003 9:22 pm
Hehe... at least it seems less than obvious.
It has bugged me for a while. I knew, of course, what ASCII is. I knew that ANSI is a standards body.
I had read something about "extended" ASCII, to be called simply "ANSI", which would make it "ANSI ANSI" in my eyes...
If it's ISO-8859-1 or similar, a great burden of confused would be taken away from the "back" of my mind.
It has bugged me for a while. I knew, of course, what ASCII is. I knew that ANSI is a standards body.
I had read something about "extended" ASCII, to be called simply "ANSI", which would make it "ANSI ANSI" in my eyes...
If it's ISO-8859-1 or similar, a great burden of confused would be taken away from the "back" of my mind.