Re: [fpc-devel] Unit for handling UTF-8 strings

Michael Schnell Tue, 09 Apr 2013 00:49:42 -0700

On 04/09/2013 08:49 AM, Mattias Gaertner wrote:

But how do you examine the characters?

Even defining what a character is, is extremely problematic with any useof Unicode. Regarding that a "printable character" can be assembled bymultiple of the (nearly 2^32) Unicode "codes", and a single Unicodecodes is represented by 1, 2, 3, or 4 Bytes when using UTF-8 or UTF-16encoding, and now the order of those bytes depends on the CPU-archand/or the file the string is imported from and the way it is imported.This of course is not a problem introduced by fpc, but the perfectlynormal complexity of Unicode.

If I understand Michael right, there will be some "implicit functions"for that. I wonder how they work.

This is what Delphi compatibility dictated. (You might read the DelphiXE Docs on how to code Unicode enabled Delphi source.)

I do hope, fpc avoids some of the quirks Delphi introduces and offerssome useful additional features (e.g. dedicated string types such asunencoded (raw, never auto-converted) Byte, Word and DWord Strings, anda "flexible encoded" String type, that inherit the encoding scheme fromthe source string when doing an assignment or using them as a functionparameter, doing auto-conversion whenever dynamically necessary.


-Michael
_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] Unit for handling UTF-8 strings

Reply via email to