> On Dec 17, 2025, at 22:17, Jacob Moody <[email protected]> wrote:
> 
> I've been poking at some of the utf* functions lately and utfutf is a bit 
> puzzling.
> At face value, strstr() should be sufficient for handling utf8 encoded 
> strings just as strcmp() is.

Maybe normalization could be the reason: there can be multiple representations, 
for example, ü might be one code point (Unicode: U+00FC, UTF-8: C3 BC), or 
might be u with a combining umlaut.  I would assume converting to a rune would 
turn out the same either way: then you can compare them even if the haystack is 
represented one way in utf8 and the needle is the other way.  (Disclaimer: I’m 
not a unicode expert, even less so on 9)


------------------------------------------
9fans: 9fans
Permalink: 
https://9fans.topicbox.com/groups/9fans/T8831073f8b8bb351-Mcf1aad549b2989d69b4d6347
Delivery options: https://9fans.topicbox.com/groups/9fans/subscription

Reply via email to