El viernes, 8 agos, 2003, a las 15:24 Europe/Madrid, Serge Huber escribi�:


One last thing : for the HTML parsing I used the Java port of Tidy (http://www.sf.net/projects/jtidy). Although it's been abandonned for quite some time (2001 was the last release), it is quite good at building a DOM of even some very bad HTML. Unfortunately the DOM created is not very standard and has problems once you try to modify it. I tried to get into the code to fix this, but it's a quite complicated parser. But I think the license is freer than the LGPL so it might still be interesting.


jtidy license is OK to use in Apache licensed projects, as far as I know. It was/is used heavily inside cocoon, for instance.


Regards,
     Santiago


--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to