Re: UTF-8 ill-formed question

Doug Ewell Sun, 16 Dec 2012 10:08:41 -0800

I remember Marco's original post in 2002. His intent was to give peoplewith an actual U+ code point that needed converting—like James Lin tenyears later—a quick way to do so without getting immersed in all thebit-shifting math.

If this were a routine being run by a computer, or a tutorial on UTF-8,I would agree that it should have taken loose surrogates into account.But it's not. It's just a quick manual reference guide, and loosesurrogates are 0.0001% of the real-world problem for users like James.

While I note that Philippe's amended version seems straightforward andin keeping with Marco's original intent (short and simple), I'd like tosuggest that neither Marco for creating the original guide, nor anyoneelse for doing up UTF-16 and UTF-32 versions, nor Otto for repostingthem on the list this week, need to be beaten up any further over thisedge case.


--
Doug Ewell | Thornton, Colorado, USA

http://www.ewellic.org | @DougEwell

Re: UTF-8 ill-formed question

Reply via email to