I have a module called HTML::FormatLynx which will do this. I have
been remiss in getting it onto CPAN, but you can find it at:

  http://www-arp.harvard.edu/eng/src/

  -Norton

Gisle Aas wrote:
> Anybody have an perl answer to this?
>
> Use of 'links -dump http://links.sourceforge.net/' seems to work
> pretty well.
> 
> Subject: Parsing tables
> Date: Sat, 16 Feb 2002 15:48:35 -0800
> From: Allen Gee <[EMAIL PROTECTED]>
> To: [EMAIL PROTECTED]
> 
> Hi Gisle,
> 
> I tried using HTML::TreeBuilder to parse some html pages and then print
> them as plain text.  However, this did not work on portions of the pages
> which were contained in tables.  Text contained in tables was just marked
> [TABLE NOT SHOWN].  Do you know of an easy way to parse html pages with
> tables in them and have all the text in the tables formated correctly, just
> as you would get if you had saved the page as text using a web browser?
> 
> Thanks for your help.
> 
> Allen

Reply via email to