Developer forums (C::B DEVELOPMENT STRICTLY!) > Development
Looking for non english sources to test encoding detection
Jenna:
I'm currently experimenting mozillas charset-detection for C::B (see this thread: http://forums.codeblocks.org/index.php/topic,10159.msg70493.html#msg70493)
I'm looking for files that use encodings, that are not correctly recognized by C::B's encoding detection.
I mean any files that can only be opened after conversion to UTF-8, or by forcing a special fallback encoding or by bypassing C::B's autodetetction.
Especially files in that contain chinese, japanese, cyrillic, eastern-europe or hebrew characters.
It would be nice to have a native and a UTF-8 version to see if the characters are detected/displayed correctly.
Please don't attach such files to your posts, but send them via mail to "chardet at jenslody dot de".
So we reduce unnecessary server-load.
I will put them on my server, for others to test them, if they want.
They will be available on http://chardet.jenslody.de/ (empty at the moment).
If you don't want the files to be published, please put a short note inside the mail.
I'm interested in single-files and of course also complete (short example) projects/workspaces.
ollydbg:
Ok, I can report some files which are located in code::blocks source folder:
src/plugins/codecompletion/parser/tokenizer.cpp
src/sdk/wxscintilla/src/scintilla/src/LexMatlab.cxx
src/sdk/wxscintilla/src/scintilla/src/LexErlang.cxx
src/sdk/wxscintilla/src/scintilla/src/Editor.cxx
src/sdk/resources/lexers/lexer_css.xml
src/plugins/compilergcc/compilergcc.cpp
Thank you!
Jenna:
The last tow files are identified correctly in pure trunk and with the mozilla detection (one as UTF-8 with BOM and the as UTF-8 without BOM).
The others work only using system fallback on trunk and are detected as CP1252 (Windows 1252) by the mozilla detector.
ollydbg:
ok :D
These files came from this bug report message one week ago.
http://forums.codeblocks.org/index.php/topic,10130.msg70316.html#msg70316
nanyu:
I send one.
Navigation
[0] Message Index
[#] Next page
Go to full version