Converting MS Word-exported HTML to clean HTML

2007-02-27 Thread Dov Katz
Does anyone have a CF (needs to be server-side) component, or cfm script/page to convert the HTML you get from a MS Word export into safe HTML for sending a HTML email? In particular, I'm looking to have the extended characters escaped into ; references, such as the forward/backward

Re: Converting MS Word-exported HTML to clean HTML

2007-02-27 Thread Steve Bryant
Dov, I don't know of anything to strip out the styling, but DataMgr converts the extended characters automatically. The relevant code: cfscript // Replace the special characters that Microsoft uses. MyStruct[Key] = Replace(MyStruct[Key], Chr(8217), Chr(39), ALL);// apostrophe MyStruct[Key] =

Re: Converting MS Word-exported HTML to clean HTML

2007-02-27 Thread Jim Wright
Dov Katz wrote: I know there's something like this in fckEditor, but I'm looking for it to be server-side. The js used to do this in FCK is in FCKEditor/editor/dialog/fck_paste.html...it is just a bunch of regex replaces...perhaps you could massage those over to some cfscript.