New submission from svilen dobrev <a...@svilendobrev.com>: nonstrict mode seems to eat too much into data and gets past endpos of the chunk processed, and parser gets confused and treats any subsequent stuff as data. i didn't think out how to fix the regexp as such, but instead limited its span to :endpos so it doesnot eat too much. seems to happen with unquoted attributes.
---------- files: html.parser.diff keywords: patch messages: 135182 nosy: svilend priority: normal severity: normal status: open title: HtmlParser non-strict goes wrong with unquoted attributes Added file: http://bugs.python.org/file21893/html.parser.diff _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue12008> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com