Re: HTMLParser not parsing whole html file

2010-10-26 Thread John Nagle
On 10/24/2010 11:44 PM, Stefan Behnel wrote: josh logan, 25.10.2010 04:14: I found the error. The HTML file I'm parsing has invalid HTML at line 193. It has something like: a href=mystuff class = stuff Note there is no space between the closing quote for the href tag and the class attribute.

Re: HTMLParser not parsing whole html file

2010-10-25 Thread Stefan Behnel
josh logan, 25.10.2010 04:14: I found the error. The HTML file I'm parsing has invalid HTML at line 193. It has something like: a href=mystuff class = stuff Note there is no space between the closing quote for the href tag and the class attribute. I guess I'll go through each file and correct

HTMLParser not parsing whole html file

2010-10-24 Thread josh logan
Hello, I wanted to use python to scrub an html file for score data, but I'm having trouble. I'm using HTMLParser, and the parsing seems to fizzle out around line 192 or so. None of the event functions are being called anymore (handle_starttag, handle_endtag, etc.) and I don't understand why,

Re: HTMLParser not parsing whole html file

2010-10-24 Thread josh logan
On Oct 24, 4:36 pm, josh logan dear.jay.lo...@gmail.com wrote: Hello, I wanted to use python to scrub an html file for score data, but I'm having trouble. I'm using HTMLParser, and the parsing seems to fizzle out around line 192 or so. None of the event functions are being called anymore

Re: HTMLParser not parsing whole html file

2010-10-24 Thread josh logan
On Oct 24, 4:38 pm, josh logan dear.jay.lo...@gmail.com wrote: On Oct 24, 4:36 pm, josh logan dear.jay.lo...@gmail.com wrote: Hello, I wanted to use python to scrub an html file for score data, but I'm having trouble. I'm using HTMLParser, and the parsing seems to fizzle out around