On Monday, 18 May 2020 at 14:28:33 UTC, Steven Schveighoffer wrote:
On 5/18/20 9:44 AM, Martin Tschierschke wrote:
[...]

using == on strings is going to compare the exact bits for equality. In unicode, things can be encoded differently to make the same grapheme. For example, รถ is a code unit that is the o with a diaeresis (U+00F6). But you could encode it with 2 code points -- a standard o, and then an diaeresis combining character (U+006F, U+0308)

What you need is to normalize the data for comparison: https://dlang.org/phobos/std_uni.html#normalize
Thank you, I will check that.

Reply via email to