On 05/18/2012 07:55 PM, Alex Brelsfoard wrote:
Hi All,
I was wondering if I could get some help here. I am looking for an
existing function/method/module that will properly convert all special
characters (like those from Microsoft Word: smart quotes, mdash, ellipses,
bullet points, etc.) to either a matching simpler character, or an HTML
entity.
HTML::Entities does a close job, but it does not handle everything
correctly.
years ago someone i know wrote such a beast. it is appropriately called
the demoronizer (replaces 'smart' crapola).
http://www.fourmilab.ch/webtools/demoroniser/
it may do the trick. at least it is pure perl and would be easy for you
to hack to your specific needs.
note that it is very old and written in perl4 code!
uri
_______________________________________________
Boston-pm mailing list
[email protected]
http://mail.pm.org/mailman/listinfo/boston-pm