Comment #2 on issue 124 by EmilStenstrom: Crash when parsing three swedish
news sites with beautifulsoup treebuilder
http://code.google.com/p/html5lib/issues/detail?id=124
Also happens on the latest source chekout, but with an extra
DataLossWarning:
C:\Program Files
(x86)\python\lib\site-packages\html5lib-0.11-py2.5.egg\html5lib
\treebuilders\soup.py:139: DataLossWarning: BeautifulSoup cannot represent
eleme
nts in any namespace
warnings.warn("BeautifulSoup cannot represent elements in any namespace",
Data
LossWarning)
C:\Program Files
(x86)\python\lib\site-packages\html5lib-0.11-py2.5.egg\html5lib
\treebuilders\soup.py:161: DataLossWarning: BeautifulSoup cannot represent
elements
in any namespace
warnings.warn("BeautifulSoup cannot represent elements in any namespace",
Data
LossWarning)
Traceback (most recent call last):
File "C:\Emils\Kod\sammanfatta\html5bug.py", line 11, in <module>
doc = parser.parse(page)
File "build\bdist.win32\egg\html5lib\html5parser.py", line 211, in parse
File "build\bdist.win32\egg\html5lib\html5parser.py", line 111, in _parse
File "build\bdist.win32\egg\html5lib\html5parser.py", line 179, in
mainLoop
File "build\bdist.win32\egg\html5lib\html5parser.py", line 447, in
processStartTag
File "build\bdist.win32\egg\html5lib\html5parser.py", line 1041, in
startTagA
File "build\bdist.win32\egg\html5lib\html5parser.py", line 1437, in
endTagFormatting
File "build\bdist.win32\egg\html5lib\treebuilders\soup.py", line 96, in
removeChild
TypeError: list indices must be integers
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this issue.
You may adjust your issue notification preferences at:
http://code.google.com/hosting/settings
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups
"html5lib-discuss" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/html5lib-discuss?hl=en-GB
-~----------~----~----~----~------~----~------~--~---