Converting Website construction to UNICODE
Posted: Thu Feb 15, 2018 4:34 pm
Greetings,
I am having a devil of a time figuring out how to convert my site to Unicode.
By Unicode I mean being able to see diacriticals in the font used in the textpad window, (versus & # 000; number ids), being able to input the diacriticals by typing in something like ALT000 or as in IAST Unicode & # 000; and having them appear correctly in the textpad window; being able to view the diacriticals in my web browser using 'view in web browser' (by way of an internal apache web server); and finally to view the correct diacriticals in the browser window from the installed web page.
I am using TP 8.1.2 64bit on a Windows 7 ultimate PC.
Currently I use a custom font (specifying charset="windows-1252") and call out the special characters with "<span class="mozp"></span>"
the files are then converted to IAST Unicode prior to uploading to the site:
http://obo.genaud.net
The advantage of having the diacriticals appear in the source htm files is obvious (I need to read the copy as I go along!), but so far I have not figured out how to manage it.
I have taken sample files and have tried converting the charset to "UTF-8" and leaving it as "Windows 1252", and have tried 'save as' UTF-8 and the default which is ANSI.
1. If I set up as charset="utf-8"; and 'save as' "utf-8"; and copy a selection of text with diacriticals from the browser window from my site:
a. I see the correct diacriticals in the textpad window
b. I am unable to input diacriticals in any way I have tried
c. view in web browser I get jibberish: Ekaɱ samayaɱ Bhagav� S�vatthiyaɱ
d. uploading to on-line site, ditto jibberish
2. Same situation using charset="windows 1252"
3. If I set up as charset="utf-8"; and 'save as' "utf-8"; and copy a selection of text with diacriticals from the source code page from my site:
a. I see only the character number id in the textpad window: Eka& # 625; samaya& # 625; Bhagav& # 257; S& # 257;vatthiya& # 625; viharati<br />
FOR
Ekaɱ samayaɱ Bhagavā Sāvatthiyaɱ viharati
b. I am able to input diacriticals only by copying and pasting the entire number id.
c. view in web browser comes up with the diacriticals properly displayed
d. uploading to on-line site, ditto diacriticals properly displayed
I have a macro keypad to input diacriticals. If I were able to see the diacriticals correctly in Textpad, and have them display properly in a web browser, I would not be concerned about how I went about inputting the characters.
I apologize for the way I present this problem. I am completely confused by this issue. Please feel free to ask for more information.
Any help you can provide with this issue will be appreciated.
I am having a devil of a time figuring out how to convert my site to Unicode.
By Unicode I mean being able to see diacriticals in the font used in the textpad window, (versus & # 000; number ids), being able to input the diacriticals by typing in something like ALT000 or as in IAST Unicode & # 000; and having them appear correctly in the textpad window; being able to view the diacriticals in my web browser using 'view in web browser' (by way of an internal apache web server); and finally to view the correct diacriticals in the browser window from the installed web page.
I am using TP 8.1.2 64bit on a Windows 7 ultimate PC.
Currently I use a custom font (specifying charset="windows-1252") and call out the special characters with "<span class="mozp"></span>"
the files are then converted to IAST Unicode prior to uploading to the site:
http://obo.genaud.net
The advantage of having the diacriticals appear in the source htm files is obvious (I need to read the copy as I go along!), but so far I have not figured out how to manage it.
I have taken sample files and have tried converting the charset to "UTF-8" and leaving it as "Windows 1252", and have tried 'save as' UTF-8 and the default which is ANSI.
1. If I set up as charset="utf-8"; and 'save as' "utf-8"; and copy a selection of text with diacriticals from the browser window from my site:
a. I see the correct diacriticals in the textpad window
b. I am unable to input diacriticals in any way I have tried
c. view in web browser I get jibberish: Ekaɱ samayaɱ Bhagav� S�vatthiyaɱ
d. uploading to on-line site, ditto jibberish
2. Same situation using charset="windows 1252"
3. If I set up as charset="utf-8"; and 'save as' "utf-8"; and copy a selection of text with diacriticals from the source code page from my site:
a. I see only the character number id in the textpad window: Eka& # 625; samaya& # 625; Bhagav& # 257; S& # 257;vatthiya& # 625; viharati<br />
FOR
Ekaɱ samayaɱ Bhagavā Sāvatthiyaɱ viharati
b. I am able to input diacriticals only by copying and pasting the entire number id.
c. view in web browser comes up with the diacriticals properly displayed
d. uploading to on-line site, ditto diacriticals properly displayed
I have a macro keypad to input diacriticals. If I were able to see the diacriticals correctly in Textpad, and have them display properly in a web browser, I would not be concerned about how I went about inputting the characters.
I apologize for the way I present this problem. I am completely confused by this issue. Please feel free to ask for more information.
Any help you can provide with this issue will be appreciated.