>, and the length it reports > is the number of code units, not the number of characters or graphemes in > the string.
True; that is documented.
However, that's very common; many APIs relating to UTF-8 would report the number of bytes, not the number of characters.
While it's interesting to have a method that can derive grapheme boundaries, e.g. for UI support, it's far less useful to get a grapheme count.
A./

