On 05/18/2012 07:55 PM, Alex Brelsfoard wrote:

 Hi All,

 I was wondering if I could get some help here.  I am looking for an
 existing function/method/module that will properly convert all special
 characters (like those from Microsoft Word: smart quotes, mdash, ellipses,
 bullet points, etc.) to either a matching simpler character, or an HTML
 entity.

 HTML::Entities does a close job, but it does not handle everything
 correctly.

years ago someone i know wrote such a beast. it is appropriately called
the demoronizer (replaces 'smart' crapola).

http://www.fourmilab.ch/webtools/demoroniser/

it may do the trick. at least it is pure perl and would be easy for you
to hack to your specific needs.

note that it is very old and written in perl4 code!

uri




_______________________________________________
Boston-pm mailing list
[email protected]
http://mail.pm.org/mailman/listinfo/boston-pm

Reply via email to