Is it possible to provide the encoding through the python API to the html parser?
This question arises because sometimes the html files are xhtml ones, and the encoding is in a <?xml> tag that the parser ignores.
As the html parser ignores this tag, so I see two possible actions here:
- Detect that the file is an xhtml one and use the xml parser instead the html parser
- Detect that the file is an xhtml one and set manually the encoding to que html parser (if this is possible)
Thanks, César
_______________________________________________ xml mailing list, project page http://xmlsoft.org/ [email protected] http://mail.gnome.org/mailman/listinfo/xml
