why yes there is... i wrote a little module for perl that does just that.
basicly, you pass the module a filehandle, it strips out all hte html
(leaving the links in the file intact however) and passes back a string
a sample script is also available. look at http://japh.nu/index.cgi
the site is still a bit under construction so if you're looking for the
tutorials, they're at http://japh.nu
regards,
Jos Boumans
> Is there a way to rip the HTML tags with multi-line HTML tags?
> >There's the wonderful recipe 20.5 ("Converting HTML to ASCII") in Chapter
> >20
> >open HTMLFILE, "<the_html_file's_name" || die "Can't open that: $!";
> >while (<HTMLFILE>)
> >{
> > chomp;
> > s/<p[^>]+>/\n\n/gi;
> > s/<br[^>]+>/\n/gi;
> > s/<[^>]+>//g;
> > print;
> >}
> >
> >- but be careful! This won't work when there are multi-line HTML tags!
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]