I havent had to do it yet (thank goodness!) but if I did need to I would use "Tidy"

http://www.w3.org/People/Raggett/tidy/
http://tidy.sourceforge.net/

--
Neerav Bhatt
http://www.bhatt.id.au
Web Development & IT consultancy
Mobile: +61 (0)403 8000 27

http://www.bhatt.id.au/blog/ - Ramblings Thoughts
http://www.bookcrossing.com/mybookshelf/neerav

Lea de Groot wrote:
What are people's preferred techniques for 'screen scraping' existing sites to get the text from a tag-soup table layout?
When a page has copious links and such, simply copying the text from the browser doesn't always give enough content to be a useful quick method.


Lea
*****************************************************
The discussion list for http://webstandardsgroup.org/
See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list & getting help
*****************************************************




Reply via email to