Hi all,
there seems to be a bug in search in files.
If searching for an word which has German Umlauts, Textpad does not find the word when the charcter set of the file is set to utf-8. There is no difference whether the file is saved with or without BOM.
If the files charcter set is to ANSI TP does find the words.
If searching another word, nearby the umlaut-word in the utf-8 file, TP finds the the other word and displays the umlaut-word in a wrong encoding.
If somebody want to see my search results, i can send screenshot.
Win7 x64 TP 7.0.9 German
Regards
Horst
Search in files - wrong encoding for German Umlauts
Moderators: AmigoJack, bbadmin, helios, Bob Hansen, MudGuard
the same problem with syntax definition
Hi,
keywords in syntax definition with Umlaute are not highlighted in UTF8 files when they contain german Umlaute (like üäöß).
keywords in syntax definition with Umlaute are not highlighted in UTF8 files when they contain german Umlaute (like üäöß).
New tests...
The function "search in files" do NOT work in any file type, which contains the umlauts but for ANSI files.
It is somehow confusing, but i try to explain what works and what doesn't.
I have tested a folder containing 7 files with all availabe file types. Every files has just one line, which explains the file.
ANSI
utf-8 without BOM
utf-8 with BOM
Unicode
Unicode/Big Endian
Unicode/Big Endian without containing umlauts
Unicode without containing umlauts
If searching for the word "test", next to the umlaut word in the file, TP finds the the other word and displays the whole line. The umlaut word is shown in a wrong encoding:
http://haeberlen.org/privat/tp/textpad_ ... auts_e.PNG
Searching for the word "test" should find 7 files, but it does find only 4 occurrences. Unicode or Unicode/Big Endian or files which are containing umlauts are not in the list. But if the Unicode file do not contain umlauts it will be found by TP.
Confusing - Test it yourself:
http://haeberlen.org/privat/tp/testfiles.zip
Horst
The function "search in files" do NOT work in any file type, which contains the umlauts but for ANSI files.
It is somehow confusing, but i try to explain what works and what doesn't.
I have tested a folder containing 7 files with all availabe file types. Every files has just one line, which explains the file.
ANSI
utf-8 without BOM
utf-8 with BOM
Unicode
Unicode/Big Endian
Unicode/Big Endian without containing umlauts
Unicode without containing umlauts
If searching for the word "test", next to the umlaut word in the file, TP finds the the other word and displays the whole line. The umlaut word is shown in a wrong encoding:
http://haeberlen.org/privat/tp/textpad_ ... auts_e.PNG
Searching for the word "test" should find 7 files, but it does find only 4 occurrences. Unicode or Unicode/Big Endian or files which are containing umlauts are not in the list. But if the Unicode file do not contain umlauts it will be found by TP.
Confusing - Test it yourself:
http://haeberlen.org/privat/tp/testfiles.zip
Horst