RE: Newbie questions: 1) Surrogates in WinXP? 2) Unicode in PostScript?

At 10:49 PM 4/7/2004, Peter Constable wrote:

>, and the length it reports
> is the number of code units, not the number of characters or graphemes
in
> the string.

True; that is documented.


However, that's very common; many APIs relating to UTF-8 would report
the number of bytes, not the number of characters.

While it's interesting to have a method that can derive grapheme
boundaries, e.g. for UI support, it's far less useful to get a grapheme count.

A./

Reply via email to