[issue14506] HTMLParser can't handle erronous end tags with additional info in them

2012-04-05 Thread Olaf Tomalka
Changes by Olaf Tomalka olaf.toma...@gmail.com: -- title: HTMLParser can't handle erronous end tags with additional tags in it - HTMLParser can't handle erronous end tags with additional info in them ___ Python tracker rep...@bugs.python.org

[issue14506] HTMLParser can't handle erronous end tags with additional info in them

2012-04-05 Thread R. David Murray
R. David Murray rdmur...@bitdance.com added the comment: Which version of python did you test with? There have been several improvements html parsing recently. -- nosy: +ezio.melotti, r.david.murray ___ Python tracker rep...@bugs.python.org

[issue14506] HTMLParser can't handle erronous end tags with additional info in them

2012-04-05 Thread Olaf Tomalka
Olaf Tomalka olaf.toma...@gmail.com added the comment: Python 3.2.2, which is latest on arch linux -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue14506 ___

[issue14506] HTMLParser can't handle erronous end tags with additional info in them

2012-04-05 Thread R. David Murray
R. David Murray rdmur...@bitdance.com added the comment: I just tested your script on 3.2.3a2+, and it raises an error. Ezio made the other parsing changes, I'll leave it to him to evaluate what if anything should be done here. -- versions: +Python 3.3

[issue14506] HTMLParser can't handle erronous end tags with additional info in them

2012-04-05 Thread Ezio Melotti
Ezio Melotti ezio.melo...@gmail.com added the comment: This is already fixed, but only in non-strict mode (and 3.2.3 iirc). You should always use HTMLParser(strict=False). The non-strict mode will probably become the default and strict=True will be deprecated. Thanks anyway for the report, and