I have a module called HTML::FormatLynx which will do this. I have been remiss in getting it onto CPAN, but you can find it at:
http://www-arp.harvard.edu/eng/src/ -Norton Gisle Aas wrote: > Anybody have an perl answer to this? > > Use of 'links -dump http://links.sourceforge.net/' seems to work > pretty well. > > Subject: Parsing tables > Date: Sat, 16 Feb 2002 15:48:35 -0800 > From: Allen Gee <[EMAIL PROTECTED]> > To: [EMAIL PROTECTED] > > Hi Gisle, > > I tried using HTML::TreeBuilder to parse some html pages and then print > them as plain text. However, this did not work on portions of the pages > which were contained in tables. Text contained in tables was just marked > [TABLE NOT SHOWN]. Do you know of an easy way to parse html pages with > tables in them and have all the text in the tables formated correctly, just > as you would get if you had saved the page as text using a web browser? > > Thanks for your help. > > Allen
