Re: The Case Against Autodecode

tsbockman via Digitalmars-d Thu, 02 Jun 2016 13:42:52 -0700

On Thursday, 2 June 2016 at 20:13:14 UTC, Andrei Alexandrescuwrote:

On 06/02/2016 03:34 PM, tsbockman wrote:
Your 'ö' examples will NOT work reliably with auto-decodedcode points,and for nearly the same reason that they won't work with codeunits; you
would have to use byGrapheme.
They do work per spec: find this code point. It would besurprising if 'ö' were found but the string were positioned ata different code point.

Your examples will pass or fail depending on how (and whether)the 'ö' grapheme is normalized. They only ever succeeds because'ö' happens to be one of the privileged graphemes that *can* be(but often isn't!) represented as a single code point. Many othergraphemes have no such representation.

Working directly with code points is sometimes useful anyway -but then, working with code units can be, also. Neither will leadto inherently "correct" Unicode processing, and in the absence ofa compelling context, your examples fall completely flat as anargument for the inherent superiority of processing at the codeunit level.

The fact that you still don't get that, even after a dozenplus attemptsby the community to explain the difference, makes you unfit todirect
Phobos' Unicode support.
Well there's gotta be a reason why my basic comprehension isunder constant scrutiny whereas yours is safe.

Who said mine is safe? I *know* that I'm not qualified to be incharge of this.

Your comprehension is under greater scrutiny because you areproposing to overrule nearly all other active contributorscombined.

Please, either go study Unicode until you
really understand it, or delegate this issue to someone else.


Would be happy to. To whom would I delegate?

If you're serious, I would suggest Dmitry Olshansky. He seems tobe our top Unicode expert, based on his contributions to`std.uni` and `std.regex`. But, if he is unwilling/unsuitable forsome reason there are other candidates participating in thisthread (not me).

Re: The Case Against Autodecode

Reply via email to