sharifah ummu kulthum, 22.02.2010 14:24:
>   File "grabmy.py", line 63, in get_html
>     return BeautifulSoup(content)
>   File "build/bdist.linux-i686/egg/BeautifulSoup.py", line 1499, in __init__
>   File "build/bdist.linux-i686/egg/BeautifulSoup.py", line 1230, in __init__
>   File "build/bdist.linux-i686/egg/BeautifulSoup.py", line 1263, in _feed
>   File "/usr/lib/python2.6/HTMLParser.py", line 108, in feed
>     self.goahead(0)
>   File "/usr/lib/python2.6/HTMLParser.py", line 148, in goahead
>     k = self.parse_starttag(i)
>   File "/usr/lib/python2.6/HTMLParser.py", line 226, in parse_starttag
>     endpos = self.check_for_whole_start_tag(i)
>   File "/usr/lib/python2.6/HTMLParser.py", line 301, in
> check_for_whole_start_tag
>     self.error("malformed start tag")
>   File "/usr/lib/python2.6/HTMLParser.py", line 115, in error
>     raise HTMLParseError(message, self.getpos())
> HTMLParser.HTMLParseError: malformed start tag, at line 830, column 36

Just noticed this now - you seem to be using BeautifulSoup, likely version
3.1. This version does not support parsing broken HTML any well, so use
version 3.0.8 instead, or switch to the tools I indicated.

Note that switching tools means that you need to change your code to use
them. Just installing them is not enough.

Stefan

_______________________________________________
XML-SIG maillist  -  XML-SIG@python.org
http://mail.python.org/mailman/listinfo/xml-sig

Reply via email to