###   What is the best way to search and replace in a .html-document
###   all text outside HTML-tags?
###   Would HTML::Parser be of any avail for this task?
###   All hints are welcome.

###   Detlef


###   In the following approach I only capitalize
###   all text . (Actually I have to do some kind
###   of spell checking on it.)
###   What exceptions are not covered by this regex?
###   What about speed and efficiency?

$_ = join "", <DATA>;
s,([^<]*)(<.*?>),uc($1).$2,ges;
print

__DATA__
<html><head></head><body>
hallo world
</body></html>


###   This produces:

<html><head></head><body>
HALLO WORLD
</body></html>



Reply via email to