Re: [Lazarus] Unicode on Windows

Hans-Peter Diettrich Mon, 09 Apr 2012 15:59:12 -0700

Mattias Gaertner schrieb:

Yes. For Unicode encoding we would need new functions to distinguishbetween number of bytes and number of (visible) glyphs:
LengthInBytes()
LengthInGlyphs()

It should be mentioned that Unicode allows for different encodings ofcomposed/decomposed characters. E.g. 'é' can be stored as 'é' (singlecomposed codepoint) or as '´e' (two decomposed codepoints). Even if bothencodings look the same on screen, Pos (or UTF8Pos) will only find theencoding as given in the search string, and it has to be specified whatLengthInGlyphs really should return - the number of really visibleglyphs, what in case of ligatures etc.?


Every user has to know which kind of "length" he really wants to get:
- number of bytes for storage in a fixed-size variable or streaming
- number of glyphs for length-restricted user input
- number of pixels for GUI layout (TextWidth)
...

DoDi


--
_______________________________________________
Lazarus mailing list
[email protected]
http://lists.lazarus.freepascal.org/mailman/listinfo/lazarus

Re: [Lazarus] Unicode on Windows

Reply via email to