Status: New
Owner: ----

New issue 217 by szepe.vi...@gmail.com: invalid characters in tag and attribute names
http://code.google.com/p/html5lib/issues/detail?id=217

original "HTML" code:

pedig meg is szólaltattuk. <FOTÓGALÉRIA, VIDEÓ</b>

after html5lib.HTMLParser:

pedig meg is szólaltattuk. <fotÓgalÉria, b="" videÓ<=""></fotÓgalÉria,>

then HTMLParser can't handle tag and attributes with accented characters and commas (etc.) in them:

Exception in html2text: <class 'HTMLParser.HTMLParseError'>; malformed start tag, at line 89, column 154


"What is the expected output? What do you see instead?"

<fotgalria b="" vide=""></fotgalria>

Thank you!


--
You received this message because you are subscribed to the Google Groups 
"html5lib-discuss" group.
To post to this group, send an email to html5lib-discuss@googlegroups.com.
To unsubscribe from this group, send email to 
html5lib-discuss+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/html5lib-discuss?hl=en-GB.

Reply via email to