[WSG] converting WORD text into clean XHTML
Hi group. I'm wondering if there's some easy (and free) way to convert text from a WORD document into clean XHTML that retains the formatting. Thanks. -- ~john _ Dr. Zeus Web Development http://www.DrZeus.net content without clutter ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help **
RE: [WSG] converting WORD text into clean XHTML
I do not know of a program you can download to work on your computer. But Dean Allen of Textism fame has this online. http://textism.com/wordcleaner/ Tim Hill Computer Associates Graphic Artist tel: +612 9937 0792 fax: +612 9937 0546 [EMAIL PROTECTED] -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of john Sent: Tuesday, 23 November 2004 9:10 AM To: web standards group Subject: [WSG] converting WORD text into clean XHTML Hi group. I'm wondering if there's some easy (and free) way to convert text from a WORD document into clean XHTML that retains the formatting. Thanks. -- ~john _ Dr. Zeus Web Development http://www.DrZeus.net content without clutter ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help ** ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help **
RE: [WSG] converting WORD text into clean XHTML
There is a tool in Dreamweaver that can auto generate but I must admit I have never used it ... a plugin that works in [ free xcellent ] HTML-Kit - http://www.chami.com/html-kit/ -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of john Sent: Tuesday, 23 November 2004 9:10 AM To: web standards group Subject: [WSG] converting WORD text into clean XHTML Hi group. I'm wondering if there's some easy (and free) way to convert text from a WORD document into clean XHTML that retains the formatting. Thanks. -- ~john _ Dr. Zeus Web Development http://www.DrZeus.net content without clutter ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help ** This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this email in error please notify the system manager. Please note that any views or opinions presented in this email are solely those of the author and do not necessarily represent those of the company. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email. ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help **
RE: [WSG] converting WORD text into clean XHTML
XStandard will do this on-the-fly. It's a WYSIWYG editor plugin for CMSs, not a stand-alone product. http://www.xstandard.com/ Paul Hempsall Web Developer Lake Macquarie City Council Tel: (02) 4921 0713 Fax: (02) 4958 7257 Email: [EMAIL PROTECTED] Web: http://www.lakemac.com.au -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of john Sent: Tuesday, 23 November 2004 9:10 AM To: web standards group Subject: [WSG] converting WORD text into clean XHTML Hi group. I'm wondering if there's some easy (and free) way to convert text from a WORD document into clean XHTML that retains the formatting. Thanks. -- ~john _ Dr. Zeus Web Development http://www.DrZeus.net content without clutter ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help ** This information is intended for the addressee only. The use, copying or distribution of this message or any information it contains, by anyone other than the addressee is prohibited by the sender. Any views expressed in this communication are those of the individual sender, except where the sender specifically states them to be the views of Council. ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help **
Re: [WSG] converting WORD text into clean XHTML
Wybrow, Mark wrote: I'm wondering if there's some easy (and free) way to convert text from a WORD document into clean XHTML that retains the formatting. I have been using this from MS http://www.microsoft.com/downloads/details.aspx?displaylang=enFamilyID=209adbee-3fbd-482c-83b0-96fb79b74ded It is Office 2000 HTML Filter 2.0.. Not xhtml, but clean html http://tidy.sf.net can be used as a plugin on some xhtml -wysiwyg editors (search this list) Yours, Antti Tuppurainen System Specialist Timecan Finland | http://www.timecan.fi Personal | http://antti.tuppurainen.fi ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help **
Re: [WSG] converting WORD text into clean XHTML
john wrote: I'm wondering if there's some easy (and free) way to convert text from a WORD document into clean XHTML that retains the formatting. If you have Dreamweaver, try using the 'Clean Up Word HTML Tool'. Then 'Convert to XHTML'. Any gunk left over after that is easily cleaned out using a few decent regular expressions in the 'Find and Replace' Cheers, Lachlan ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help **
Re: [WSG] converting WORD text into clean XHTML
I've always found that you still need to eyeball the code because Word does some very strange things to lists, and headings in particular. Also a lot of Word documents are not styled properly to begin with (e.g. bold+font-size, instead of headings) which leads to added complexity to resolve. Terrence Wood. On 2004-11-23 12:19 PM, Lachlan Hardy wrote: john wrote: I'm wondering if there's some easy (and free) way to convert text from a WORD document into clean XHTML that retains the formatting. If you have Dreamweaver, try using the 'Clean Up Word HTML Tool'. Then 'Convert to XHTML'. Any gunk left over after that is easily cleaned out using a few decent regular expressions in the 'Find and Replace' Cheers, Lachlan ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help ** -- *** Are you in the Wellington area and interested in web standards? Wellington Web Standards Group inaugural meeting 9 Dec 2004. See http://webstandardsgroup.org/go/event24.cfm for details *** ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help **
Re: [WSG] converting WORD text into clean XHTML
On 23/11/04 9:19 AM, Wybrow, Mark [EMAIL PROTECTED] wrote: There is a tool in Dreamweaver that can auto generate but I must admit I have never used it ... I've found that the Clean Word HTML command in Dreamweaver helps but still leaves too much junk I don't want. If you use Mac OS, cut and paste from Word into AppleWorks and then save as an html document. If you use Windows, there might be another word processing app that will give you cleaner html. If there is still some junk coding from the AppleWorks produced html page, I get rid of it with Find and Replace. I'd love a better system to this work-around that I use, so I too will be interested to hear what others do. ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help **
Re: [WSG] converting WORD text into clean XHTML
Textism have a word cleaner that works quite well: http://textism.com/wordcleaner/ On Tue, 23 Nov 2004 11:45:30 +1100, Hope A. Stewart [EMAIL PROTECTED] wrote: On 23/11/04 9:19 AM, Wybrow, Mark [EMAIL PROTECTED] wrote: There is a tool in Dreamweaver that can auto generate but I must admit I have never used it ... I've found that the Clean Word HTML command in Dreamweaver helps but still leaves too much junk I don't want. If you use Mac OS, cut and paste from Word into AppleWorks and then save as an html document. If you use Windows, there might be another word processing app that will give you cleaner html. If there is still some junk coding from the AppleWorks produced html page, I get rid of it with Find and Replace. I'd love a better system to this work-around that I use, so I too will be interested to hear what others do. ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help ** -- Gmail invites - just ask nicely ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help **
Re: [WSG] converting WORD text into clean XHTML
john wrote: I'm wondering if there's some easy (and free) way to convert text from a WORD document into clean XHTML that retains the formatting. Another addition: I just remembered that recent versions of Word allow you to save as HTML, Filtered. This is MS-speak for removing all Office specific tags. You still get that MSo-style rubbish, but it clears some of the more awkward stuff out straight away ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help **
Re: [WSG] converting WORD text into clean XHTML
I asked much the same question a little while back and what I got together was: First have the doc saved as HTML (Filtered) if it's coming from Word 2003 (earlier versions can get the filtered thingy someone else mentioned). Then in my case I wrote a filter for the content management system I built to pass the Word HTML through. What you could do is use one of the implementations of Tidy ( http://tidy.sourceforge.net/ ) e.g. For a web version try: http://infohound.net/tidy/ ...As I type this I'm just testing it on a big Word filtered HTML doc... and yes it seems to do a decent job. Nick Hi group. I'm wondering if there's some easy (and free) way to convert text from a WORD document into clean XHTML that retains the formatting. Thanks. ** The discussion list for http://webstandardsgroup.org/ See http://webstandardsgroup.org/mail/guidelines.cfm for some hints on posting to the list getting help **