Re: [fpc-devel] Unicode support (again)

Michael Schnell Mon, 10 Nov 2008 08:11:27 -0800

I found that the current FPC does have Unicode support, but there aresome problems.

- WideStrings work fine with Unicode UCS-2 but they (of course) havesimilar issues as UTF8-Strings when surrogate codes are used (which israrely necessary in Europe and America).

- FPC does not have a dedicated type "UTF8String", but the type definedas "UTF8String" is just the same as ANSIString and thus the compilercan't decide which is meant by the programmer and can't create theappropriate code when it's necessary to distinguish between them (e.gwhen it automatically should converting between locale-coded ANSIString,UTF8String and WideString)

- by design (for speed sake), UTF8String (and WideString when surrogatecodes are used) count in subcodes and not in Unicode-Characters, so thebehavior is "unexpected" when doing things like s[i], pos(s), copy(),delete(), ... There are not _slow_ functions that do the "expected"versions of s[i], pos(s), copy(), delete(), ... (I've yet to find outhow I can print just the first character of an UTF8String :)

- there is no decent "character" type for UTF8 or UTF16 codedCharacters (WideChar (UCS2 code) works if no surrogate codes are used.)

- there are different option on how the compiler expects the coding ofthe source file. Seemingly if it detects it to be UTF8 coded and acertain (otherwise correct) option is set, even "s := 'hallo äöü'; "does not work correctly as expected if s is a WideString. (Lazarus withdefault settings suffers from this problem).


-Michael
_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] Unicode support (again)

Reply via email to