On Thu, Jul 19, 2001 at 09:51:14AM -0400, Sparkle Williams wrote:
> I just wrote a perl program that retrieves files of type http:// and ftp:// 
> from the internet.  When it retrieves the files it
> comes up in the html syntax of head, body, text etc. Is there any way I can 
> write an addition to my script that will cause
> the text to come up in it's formatted form rather than the html syntax 
> describing it's format?

Rendering HTML is not a simple problem.  Just look at how widely variant the
various browsers can be.  If you want to simply remove tags you can use
either HTML::Parser, with just a text handler that prints out the text; or
possibly HTML::TreeBuilder, and dump the tree as text; or, the quickest and
dirtiest way, use lynx -dump; yet another way, bring it up in a browser.


Michael
--
Administrator                      www.shoebox.net
Programmer, System Administrator   www.gallanttech.com
--

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to