Yes, I was going to suggest that exact thing. JTidy is really cool because HTML parsing requires some heuristics, and the heuristics in JTidy (and tidy) are quite refined and easily tweakable.
c
At 11:58 AM 3/8/2001 -0800, you wrote:
We have found JTidy very useful for parsing HTML: http://sourceforge.net/projects/jtidy/
gordy perkins <[EMAIL PROTECTED]> writes: > Found out why HTMLDocumentImpl doesn't work - check > the close() method: > > > public void close() > { > // ! NOT IMPLEMENTED, REQUIRES PARSER ! > if ( _writer != null ) > { > _writer = null; > } > } > > Does anyone know a simple parser for HTML - all I need > is a reliable way to grab forms, some text and/or > title from a WWW page and display on text terminals. > > __________________________________________________ > Do You Yahoo!? > Get email at your own domain with Yahoo! Mail. > http://personal.mail.yahoo.com/ > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
