Re: [Lazarus] How to use strings properly with fixes_1_6 and FPC 3.0.0?

Martin Frb via Lazarus Fri, 21 Oct 2016 18:13:59 -0700

On 21/10/2016 22:16, Juha Manninen via Lazarus wrote:

UTF-16. It does not support all the complex rules of combining
CodePoints, but it apparently works well for accented characters in
western languages.


Which ones does it not support?

When I added it to SynEdit it was complete. It had all the combiningsthat the utf8 standard had back then. (at least that I could find in thedocumentation)

Of course if a new combining range is added, it will not contain it. Ifthat is needed one needs an external (OS or otherwise) library, thatcan/will be updated on those occasions.

Mind "combining codepoints" have nothing to do with how many codepointswill be represented by one glyph.

"â" is one character. But it can be a single codepoint (in utf16 onecode-unit or word // in utf8 several code-unit or byte), or 2 codepoints("a" + combining "^").

"fi" are 2 chars. But the may be 2 or 1 glyph (ligature)

It is my understanding (but I do not know for sure) that in somelanguages (such as Arabic) certain letter combinations form a singleglyph (afaik/google see https://en.wikipedia.org/wiki/Hamzah combinedwith a letter). Though maybe it is considered 2 glyph? I do not knowArabic at all.Also in some scripts glyphs are displayed in an order different fromtheir occurrence in the text.All of this however has nothing to do with combining codepoints, or whatcounts a character.


--
_______________________________________________
Lazarus mailing list
Lazarus@lists.lazarus-ide.org
http://lists.lazarus-ide.org/listinfo/lazarus

Re: [Lazarus] How to use strings properly with fixes_1_6 and FPC 3.0.0?

Reply via email to