Re: Of possible interest: fast UTF8 validation

Joakim via Digitalmars-d Wed, 16 May 2018 10:21:23 -0700

On Wednesday, 16 May 2018 at 16:48:28 UTC, Dmitry Olshansky wrote:

On Wednesday, 16 May 2018 at 15:48:09 UTC, Joakim wrote:
On Wednesday, 16 May 2018 at 11:18:54 UTC, Andrei Alexandrescuwrote:
https://www.reddit.com/r/programming/comments/8js69n/validating_utf8_strings_using_as_little_as_07/
Sigh, this reminds me of the old quote about people spending abunch of time making more efficient what shouldn't be done atall.
Validating UTF-8 is super common, most text protocols and filesthese days would use it, other would have an option to do so.
I’d like our validateUtf to be fast, since right now we dovalidation every time we decode string. And THAT is slow.Trying to not validate on decode means most things should bevalidated on input...

I think you know what I'm referring to, which is that UTF-8 is abadly designed format, not that input validation shouldn't bedone.

Re: Of possible interest: fast UTF8 validation

Reply via email to