Hi!


I'm reading a HTML page from various URLs and convert it to XHTML using JTidy (as in HTML Generator). I know that I can configure JTidy to use a certain encoding by calling setCharEncoding(Configuration.UTF8); for example.
My problem is, that the character encoding is very often specifyed only within the HTML document using tags like <meta http-equiv="content-type" content="text/html; charset=ISO-8859-1">.


Any tricks how to solve this problem ?

thx,

Gernot


-- DI Gernot Koller

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to