> . what do we need to do to make the verification code more efficient? I 
> think we need to address the correctness issue first, but doing so 
> should certainly make us want to improve the verification code. For 
> example, I'm wondering if it might benefit from having a tiny cache.

It has been pointed out the the verification for UTF-8 is very
inefficient, involving several function calls to first get the length,
then check characters, etc. It could be significantly improved. I don't
know whether a cache would make any useful difference.

