I'm reading a HTML page from various URLs and convert it to XHTML using JTidy (as in HTML Generator). I know that I can configure JTidy to use a certain encoding by calling setCharEncoding(Configuration.UTF8); for example.
My problem is, that the character encoding is very often specifyed only within the HTML document using tags like <meta http-equiv="content-type" content="text/html; charset=ISO-8859-1">.

Any tricks how to solve this problem ?



-- DI Gernot Koller

To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to