Yes, I was going to suggest that exact thing. JTidy
is really cool because HTML parsing requires some
heuristics, and the heuristics in JTidy (and tidy)
are quite refined and easily tweakable.

        c

At 11:58 AM 3/8/2001 -0800, you wrote:

We have found JTidy very useful for parsing HTML:
http://sourceforge.net/projects/jtidy/

gordy perkins <[EMAIL PROTECTED]> writes:
> Found out why HTMLDocumentImpl doesn't work - check
> the close() method:
>
>
>     public void close()
>     {
>         // ! NOT IMPLEMENTED, REQUIRES PARSER !
>         if ( _writer != null )
>         {
>             _writer = null;
>         }
>     }
>
> Does anyone know a simple parser for HTML - all I need
> is a reliable way to grab forms, some text and/or
> title from a WWW page and display on text terminals.
>
> __________________________________________________
> Do You Yahoo!?
> Get email at your own domain with Yahoo! Mail.
> http://personal.mail.yahoo.com/
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to