[issue13358] HTMLParser incorrectly handles cdata elements.

2011-11-17 Thread Michael Brooks
Michael Brooks firealwayswo...@gmail.com added the comment: Has anyone else been able to verify this? On Mon, Nov 7, 2011 at 7:46 AM, Michael Brooks rep...@bugs.python.orgwrote: Michael Brooks firealwayswo...@gmail.com added the comment: This one should also have a priority change. Tested

[issue13358] HTMLParser incorrectly handles cdata elements.

2011-11-17 Thread Michael Brooks
Michael Brooks firealwayswo...@gmail.com added the comment: Ok so until you fix this bug, i'll be overriding HTMLParser with my fix, becuase this is a blocking issue for my project. My HTMLParser must behave like a browser, period end of story. Thanks. On Thu, Nov 17, 2011 at 9:24 AM, Ezio

[issue13358] HTMLParser incorrectly handles cdata elements.

2011-11-17 Thread Michael Brooks
Michael Brooks firealwayswo...@gmail.com added the comment: Oah, then there is a misunderstanding. No browser will parse the html that is declared within a javascript variable, it must be treated as a continues data segment (with cdata properties) until the exit /\s*script\s* is encountered

[issue13358] HTMLParser incorrectly handles cdata elements.

2011-11-07 Thread Michael Brooks
Michael Brooks firealwayswo...@gmail.com added the comment: This one should also have a priority change. Tested python 2.7.3 --MIke On Sun, Nov 6, 2011 at 12:54 PM, Michael Brooks rep...@bugs.python.orgwrote: Michael Brooks firealwayswo...@gmail.com added the comment: Yes I am running

[issue13357] HTMLParser parses attributes incorrectly.

2011-11-06 Thread Michael Brooks
New submission from Michael Brooks firealwayswo...@gmail.com: Open the attached file red_test.html in a browser. The bad elements are blue because the style tag isn't parsed by any known browser. However, the HTMLParser library will incorrectly recognize them. -- components

[issue13358] HTMLParser incorrectly handles cdata elements.

2011-11-06 Thread Michael Brooks
New submission from Michael Brooks firealwayswo...@gmail.com: The HTML tag at the bottom of this page correctly identified has having cdata like properties and trigger set_cdata_mode(). Due to the cdata properties of this tag, the only way to end the data segment is with a closing /script

[issue13358] HTMLParser incorrectly handles cdata elements.

2011-11-06 Thread Michael Brooks
Changes by Michael Brooks firealwayswo...@gmail.com: -- type: - behavior ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue13358 ___ ___ Python-bugs

[issue13357] HTMLParser parses attributes incorrectly.

2011-11-06 Thread Michael Brooks
Michael Brooks firealwayswo...@gmail.com added the comment: Yes, I am running the latest version, which is python 2.7.2. On Sun, Nov 6, 2011 at 12:14 PM, Ezio Melotti rep...@bugs.python.orgwrote: Ezio Melotti ezio.melo...@gmail.com added the comment: Thanks for the report. Could you try

[issue13358] HTMLParser incorrectly handles cdata elements.

2011-11-06 Thread Michael Brooks
Michael Brooks firealwayswo...@gmail.com added the comment: Yes I am running python 2.7.2. On Sun, Nov 6, 2011 at 12:52 PM, Ezio Melotti rep...@bugs.python.orgwrote: Ezio Melotti ezio.melo...@gmail.com added the comment: Have you tried with the latest 2.7? (see msg147170

[issue13357] HTMLParser parses attributes incorrectly.

2011-11-06 Thread Michael Brooks
Michael Brooks firealwayswo...@gmail.com added the comment: Python 2.7.3 is still affected by both of these issues. On Sun, Nov 6, 2011 at 12:56 PM, Ezio Melotti rep...@bugs.python.orgwrote: Ezio Melotti ezio.melo...@gmail.com added the comment: I mean 2.7.3 (i.e. the development version

[issue10599] sgmllib.parse_endtag() is not respecting quoted text

2010-12-01 Thread Michael Brooks
New submission from Michael Brooks firealwayswo...@gmail.com: In the attached example is a very simple usage of sgmllib that is trying to parse: input value=a href=http://buglink/a The bug is that sgmllib is parsing this href. Browsers on the other hand see this as the input's value. Also

[issue10599] sgmllib.parse_endtag() is not respecting quoted text

2010-12-01 Thread Michael Brooks
Michael Brooks firealwayswo...@gmail.com added the comment: Oops, I had a misnomer in my bug report. input value=\a href=http://buglink/a is not escaped and there for the href should be parsed in this condition but not parsed in the attached sgmllib_bug.py