Thanks a lot for your help. I'll give your suggestions a try. Allen
At 08:06 PM 2/18/2002 -0500, Norton Allen wrote: >I have a module called HTML::FormatLynx which will do this. I have >been remiss in getting it onto CPAN, but you can find it at: > > http://www-arp.harvard.edu/eng/src/ > > -Norton > >Gisle Aas wrote: >> Anybody have an perl answer to this? >> >> Use of 'links -dump http://links.sourceforge.net/' seems to work >> pretty well. >> >> Subject: Parsing tables >> Date: Sat, 16 Feb 2002 15:48:35 -0800 >> From: Allen Gee <[EMAIL PROTECTED]> >> To: [EMAIL PROTECTED] >> >> Hi Gisle, >> >> I tried using HTML::TreeBuilder to parse some html pages and then print >> them as plain text. However, this did not work on portions of the pages >> which were contained in tables. Text contained in tables was just marked >> [TABLE NOT SHOWN]. Do you know of an easy way to parse html pages with >> tables in them and have all the text in the tables formated correctly, just >> as you would get if you had saved the page as text using a web browser? >> >> Thanks for your help. >> >> Allen > >
