Re: Freeing HTTP::Message from HTML::Parser dependency

Bjoern Hoehrmann Mon, 16 Jan 2012 19:52:29 -0800

* Christopher J. Madsen wrote:
>Dropping support for UTF-32 from HTTP::Message is a separate issue from
>removing HTML::Parser.  I've got no comment on that.


(It's not quite as black and white as that, "HTML5" could be exempted in
the algorithm, for instance.)

>> Your UTF-8 validation code seems wrong to me, you consider the sequence
>> F0 80 to be incomplete, but it's actually invalid, same for ED 80, see
>> the chart in <http://bjoern.hoehrmann.de/utf-8/decoder/dfa/#design>.
>
>I guess the RE could be improved, but I'm not sure it's worth the effort
>and added complication to catch a tiny fraction of false positives.

Why make the check at all if you don't care if it's right?

>> Anyway, if people think this is the way to go, maybe HTTP::Message can
>> adopt the Content-Type header charset extraction tests in HTML::Encoding
>> so they don't get lost as my module becomes redundant?
>
>I thought it already did that?

Not as far as I can tell; links welcome though.
-- 
Björn Höhrmann · mailto:bjo...@hoehrmann.de · http://bjoern.hoehrmann.de
Am Badedeich 7 · Telefon: +49(0)160/4415681 · http://www.bjoernsworld.de
25899 Dagebüll · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/

Re: Freeing HTTP::Message from HTML::Parser dependency

Reply via email to