Hi again,

Is it possible to provide the encoding through the python API to the html parser?

This question arises because sometimes the html files are xhtml ones, and the encoding is in a <?xml> tag that the parser ignores.
As the html parser ignores this tag, so I see two possible actions here:

  • Detect that the file is an xhtml one and use the xml parser instead the html parser
  • Detect that the file is an xhtml one and set manually the encoding to que html parser (if this is possible)
which option do you think is more suitable?

Thanks, César


_______________________________________________
xml mailing list, project page  http://xmlsoft.org/
[email protected]
http://mail.gnome.org/mailman/listinfo/xml

Reply via email to