Depending on what you are doing... I have found a lot of great ways to pull
out tables from HTML using HTML::TableExtract and LWP::UserAgent and
HTML::TreeBuilder.  I really haven't delved in to all of the libraries under
HTML, but these have been greate.  see cpan.org or perldoc for more info.



-----Original Message-----
From: Gary Hawkins [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, January 15, 2002 2:26 AM
To: [EMAIL PROTECTED]
Subject: RE: read source file of .html


> use LWP. it can be as simple as this :
> 
> 
> use LWP::Simple;
> print get("http://www.mit.edu";);
> 
> Tor.
> 

Neat.  

Along that line, I would like to be able to wind up with pages after
retrieval
as plain text without html tags, hopefully using a module. 

/g




-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to