Ugh. Those inline styles are a killer. Anthony, your idea of batch-processing first is nice but I really need an option which works when the client pastes directly from the WP doc to the WYSIWYG. Server-side or client-side tidying via JS seems to be the only real option.
Unless I insist the bustids just use WordPad. :) -----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Faruk Ates Sent: Thursday, 8 September 2005 6:23 PM To: [email protected] Subject: RE: [WSG CMS] Know any standard's compliant WYSIWYG XHTML editors for a CMS integration? > Has anyone found a fluid cost-free process to achieve clean output > from pasting via Word docs? HTML Tidy is nice but doesn't really cut > it as part of a client process. 1. WVWare -> save Word file to HTML (this has to be installed on the server, btw) 2. str_replace() out empty content (it has a tendency to leave some empty paragraphs in there for no good purpose whatsoever; see example of that output below) 3. parse with html Tidy as XHTML, then do a manual XML parse over your content and rebuild it according to the rules you set yourself (i.e. only allow content to be put back into the rebuild-construction that you approve) Voilá. Super-duper clean output from Word :) Example of empty crap that WVWare leaves as residue: <p><div name="Default" align="left" style=" padding: 0.00mm 0.00mm 0.00mm 0.00mm; "> <p style="text-indent: 0.00mm; text-align: left; line-height: 4.166667mm; color: Black; background-color: White; "> </p></div> -- Faruk Ates Web consultant, designer, developer and project manager www.kurafire.net - www.mediadesign.nl - www.happyclog.com ********************************************************* The CMS discussion list for http://webstandardsgroup.org/ ********************************************************* ********************************************************* The CMS discussion list for http://webstandardsgroup.org/ *********************************************************
