Re: perl and internet files

Jos I. Boumans Thu, 19 Jul 2001 13:20:10 -0700

why yes there is... i wrote a little module for perl that does just that.

basicly, you pass the module a filehandle, it strips out all hte html
(leaving the links in the file intact however) and passes back a string

a sample script is also available. look at http://japh.nu/index.cgi

the site is still a bit under construction so if you're looking for the
tutorials, they're at http://japh.nu

regards,

Jos Boumans


> Is there a way to rip the HTML tags with multi-line HTML tags?


> >There's the wonderful recipe 20.5 ("Converting HTML to ASCII") in Chapter
> >20

> >open HTMLFILE, "<the_html_file's_name" || die "Can't open that: $!";
> >while (<HTMLFILE>)
> >{
> >   chomp;
> >   s/<p[^>]+>/\n\n/gi;
> >   s/<br[^>]+>/\n/gi;
> >   s/<[^>]+>//g;
> >   print;
> >}
> >
> >- but be careful! This won't work when there are multi-line HTML tags!


-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: perl and internet files

Reply via email to