Huffington Post had this issue with their MT installation and created a plugin called Naughty Word Chars.[1] It was so popular that it was essentially incorporated into MT4. What it does is run a series of regexs over the content of an object before its saved to the database replacing the "naughty characters" from Word to safe representations. Those representations vary from ASCII to numeric entities to(I think) UTF8 representations.
Anyway, it might provide some ideas and code. Naughty Word Chars is GPL and most of MT will be there shortly. <tim/> [1] http://tech.huffingtonpost.com/2006/01/naughtywordchars.html --------------------------------------------------------------------- Web Archive: http://www.mail-archive.com/[email protected]/ http://marc.theaimsgroup.com/?l=cgiapp&r=1&w=2 To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
