[issue12008] HtmlParser non-strict goes wrong with unquoted attributes

svilen dobrev Thu, 05 May 2011 03:47:37 -0700

New submission from svilen dobrev <a...@svilendobrev.com>:

nonstrict mode seems to eat too much into data and gets past endpos of the 
chunk processed, and parser gets confused and treats any subsequent stuff as 
data. i didn't think out how to fix the regexp as such, but instead limited its 
span to :endpos so it doesnot eat too much. 
seems to happen with unquoted attributes.


----------
files: html.parser.diff
keywords: patch
messages: 135182
nosy: svilend
priority: normal
severity: normal
status: open
title: HtmlParser non-strict goes wrong with unquoted attributes
Added file: http://bugs.python.org/file21893/html.parser.diff

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue12008>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue12008] HtmlParser non-strict goes wrong with unquoted attributes

Reply via email to