On Saturday, 18 April 2015 at 17:04:54 UTC, Tobias Pankrath wrote:
Isn't this solved commonly with a normalization pass? We
should have a normalizeUTF() that can be inserted in a
pipeline.
Yes.
Then the rest of Phobos doesn't need to mind these combining
characters. -- Andrei
I don't think so. The thing is, even after normalization we
have to deal with combining characters because in all
normalization forms there will be combining characters left
after normalization.
Yes, again and again I encountered length related bugs with
Unicode characters. Normalization is not 100% reliable. I don't
know anyone who works with non English characters who doesn't
have problems with Unicode related issues sometimes.