I've managed to strip all HTML from a document except for some strange
hangers-on:

 
’
“
”
—
¨
®

I'd like to kill all of these kinds of things, substituting whitespace or apostrophes or whatever they are supposed to be, without my having to
specify each thing.

Any suggestions? I looked on CPAN, but their modules seem very complex for my needs, unless I'm being stupid.

-- Craig


_______________________________________________
ActivePerl mailing list
[email protected]
To unsubscribe: http://listserv.ActiveState.com/mailman/mysubs

Reply via email to