Thanks Uri, Yeah I found that one when I was Googling. Sadly it only converts a few special characters (smart quotes and M and N dashes). I need something that does as many as can be thought of/found.
--Alex On Sat, May 19, 2012 at 1:26 AM, Uri Guttman <[email protected]> wrote: > On 05/18/2012 07:55 PM, Alex Brelsfoard wrote: > > Hi All, >> >> I was wondering if I could get some help here. I am looking for an >> existing function/method/module that will properly convert all special >> characters (like those from Microsoft Word: smart quotes, mdash, >> ellipses, >> bullet points, etc.) to either a matching simpler character, or an HTML >> entity. >> >> HTML::Entities does a close job, but it does not handle everything >> correctly. >> > > years ago someone i know wrote such a beast. it is appropriately called > the demoronizer (replaces 'smart' crapola). > > http://www.fourmilab.ch/**webtools/demoroniser/<http://www.fourmilab.ch/webtools/demoroniser/> > > it may do the trick. at least it is pure perl and would be easy for you > to hack to your specific needs. > > note that it is very old and written in perl4 code! > > uri > > > > > ______________________________**_________________ > Boston-pm mailing list > [email protected] > http://mail.pm.org/mailman/**listinfo/boston-pm<http://mail.pm.org/mailman/listinfo/boston-pm> > _______________________________________________ Boston-pm mailing list [email protected] http://mail.pm.org/mailman/listinfo/boston-pm

