[issue14251] [PATCH]HTMLParser decode issue

Ezio Melotti Sat, 10 Mar 2012 18:34:35 -0800

Ezio Melotti <ezio.melo...@gmail.com> added the comment:

Can you provide a minimal example to reproduce this error?


On Python 2 it's always better to decode the HTML first and then pass unicode 
to the parser.  Even though on Python 2 the parser accepts bytes string too, 
there are a few corner cases where it fails.

On Python 3 the parser only accepts unicode, and it should work fine with it 
(especially if you have an updated clone of cpython).  Can you show what 
failure you get with Python 3?  Also, can you reproduce the error if you use 
strict=False?

----------
assignee:  -> ezio.melotti
nosy: +ezio.melotti
type: crash -> behavior

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue14251>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue14251] [PATCH]HTMLParser decode issue

Reply via email to