1. remove all CR and LF characters.
2 remove all </p>
3 change all <p> to CR/LF
4 change all <br> to CR/LF
While I recognize this is a "first stab" heuristic, it fails because of
too many assumptions.
For line endings:
Windows/DOS use CR/LF
Unix/Linux/Mac OS X use LF
Mac classic uses CR
What is worse, many email servers "munge" line endings as they
store/forward messages.
Also HTML should be _parsed_ and not just willy-nilly remove </p> info.
An emerging
requirement for HTML is that ALL tags be paired win an <TAG ON> </TAG
OFF>.
Parsing is still required if the HTML is malformed.
--
You take your life in your own hands, and what happens?
A terrible thing: no one to blame.
-- Erica Jong, writer (1942- )
--
You take your life in your own hands, and what happens?
A terrible thing: no one to blame.
-- Erica Jong, writer (1942- )
To subscribe/unsubscribe, point your browser to: http://www.tullochgorm.com/lists.html