Hello,

In pretty new to this list, and guess what, sorry for my English :)

I have an issue concerning parsing HTML files with the HTMLparser API.
The web page has attributes in tags which contain URI with ampersands
not encoded as "&".
Obviously, the parser (with the HTML_PARSE_RECOVER option) returns an error:
htmlParsEntityRef: expecting ';'

The xmlDoc created lacks of many elements.

So, I would like to know if there is a way to parse such HTML files with libxml?

Thanks,
Pierre

PS: I apology in advance if I have missed an explanation posted in the
previous posts
_______________________________________________
xml mailing list, project page  http://xmlsoft.org/
[email protected]
http://mail.gnome.org/mailman/listinfo/xml

Reply via email to