> On Dec 17, 2025, at 22:17, Jacob Moody <[email protected]> wrote: > > I've been poking at some of the utf* functions lately and utfutf is a bit > puzzling. > At face value, strstr() should be sufficient for handling utf8 encoded > strings just as strcmp() is.
Maybe normalization could be the reason: there can be multiple representations, for example, ü might be one code point (Unicode: U+00FC, UTF-8: C3 BC), or might be u with a combining umlaut. I would assume converting to a rune would turn out the same either way: then you can compare them even if the haystack is represented one way in utf8 and the needle is the other way. (Disclaimer: I’m not a unicode expert, even less so on 9) ------------------------------------------ 9fans: 9fans Permalink: https://9fans.topicbox.com/groups/9fans/T8831073f8b8bb351-Mcf1aad549b2989d69b4d6347 Delivery options: https://9fans.topicbox.com/groups/9fans/subscription
