RE: [WSG] converting WORD text into clean XHTML

2004-11-22 Thread Hill, Tim
I do not know of a program you can download to work on your computer.
But Dean Allen of Textism fame has this online.
http://textism.com/wordcleaner/ 


Tim Hill
Computer Associates
Graphic Artist
tel: +612 9937 0792
fax: +612 9937 0546
[EMAIL PROTECTED]
 

-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On
Behalf Of john
Sent: Tuesday, 23 November 2004 9:10 AM
To: web standards group
Subject: [WSG] converting WORD text into clean XHTML

Hi group.

I'm wondering if there's some easy (and free) way to convert text from a
WORD document into clean XHTML that retains the formatting.

Thanks.
-- 

~john
_
Dr. Zeus Web Development
http://www.DrZeus.net
content without clutter



**
The discussion list for  http://webstandardsgroup.org/

 See http://webstandardsgroup.org/mail/guidelines.cfm
 for some hints on posting to the list  getting help
**


**
The discussion list for  http://webstandardsgroup.org/

 See http://webstandardsgroup.org/mail/guidelines.cfm
 for some hints on posting to the list  getting help
**



RE: [WSG] converting WORD text into clean XHTML

2004-11-22 Thread Wybrow, Mark

There is a tool in Dreamweaver that can auto generate but I must admit I
have never used it ...  a plugin that works in [ free xcellent ]
HTML-Kit - http://www.chami.com/html-kit/

-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On
Behalf Of john
Sent: Tuesday, 23 November 2004 9:10 AM
To: web standards group
Subject: [WSG] converting WORD text into clean XHTML

Hi group.

I'm wondering if there's some easy (and free) way to convert text from a

WORD document into clean XHTML that retains the formatting.

Thanks.
--

~john
_
Dr. Zeus Web Development
http://www.DrZeus.net
content without clutter



**
The discussion list for  http://webstandardsgroup.org/

 See http://webstandardsgroup.org/mail/guidelines.cfm
 for some hints on posting to the list  getting help
**



This email and any files transmitted with it are confidential and intended 
solely for the use of the individual or entity to whom they are addressed. If 
you have received this email in error please notify the system manager. Please 
note that any views or opinions presented in this email are solely those of the 
author and do not necessarily represent those of the company. The recipient 
should check this email and any attachments for the presence of viruses. The 
company accepts no liability for any damage caused by any virus transmitted by 
this email.

**
The discussion list for  http://webstandardsgroup.org/

 See http://webstandardsgroup.org/mail/guidelines.cfm
 for some hints on posting to the list  getting help
**



RE: [WSG] converting WORD text into clean XHTML

2004-11-22 Thread Paul Hempsall
XStandard will do this on-the-fly. It's a WYSIWYG editor plugin for
CMSs, not a stand-alone product.

http://www.xstandard.com/

Paul Hempsall 
Web Developer

Lake Macquarie City Council
Tel: (02) 4921 0713
Fax: (02) 4958 7257
Email: [EMAIL PROTECTED]
Web: http://www.lakemac.com.au 



-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On
Behalf Of john
Sent: Tuesday, 23 November 2004 9:10 AM
To: web standards group
Subject: [WSG] converting WORD text into clean XHTML


Hi group.

I'm wondering if there's some easy (and free) way to convert text from a

WORD document into clean XHTML that retains the formatting.

Thanks.
-- 

~john
_
Dr. Zeus Web Development
http://www.DrZeus.net
content without clutter



**
The discussion list for  http://webstandardsgroup.org/

 See http://webstandardsgroup.org/mail/guidelines.cfm
 for some hints on posting to the list  getting help
**



This information is intended for the addressee only. The use, copying or
distribution of this message or any information it contains, by anyone
other than the addressee is prohibited by the sender.

Any views expressed in this communication are those of the individual
sender, except where the sender specifically states them to be the views
of Council.

**
The discussion list for  http://webstandardsgroup.org/

 See http://webstandardsgroup.org/mail/guidelines.cfm
 for some hints on posting to the list  getting help
**



Re: [WSG] converting WORD text into clean XHTML

2004-11-22 Thread Antti Tuppurainen
Wybrow, Mark wrote:
I'm wondering if there's some easy (and free) way to convert text from a
WORD document into clean XHTML that retains the formatting.
 

I have been using this from MS
http://www.microsoft.com/downloads/details.aspx?displaylang=enFamilyID=209adbee-3fbd-482c-83b0-96fb79b74ded
It is Office 2000 HTML Filter 2.0..
Not xhtml, but clean html
http://tidy.sf.net can be used as a plugin on some xhtml -wysiwyg 
editors (search this list)

Yours, Antti Tuppurainen
System Specialist
Timecan Finland | http://www.timecan.fi
Personal | http://antti.tuppurainen.fi
**
The discussion list for  http://webstandardsgroup.org/
See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list  getting help
**


Re: [WSG] converting WORD text into clean XHTML

2004-11-22 Thread Lachlan Hardy
john wrote:
I'm wondering if there's some easy (and free) way to convert text from a 
WORD document into clean XHTML that retains the formatting.
If you have Dreamweaver, try using the 'Clean Up Word HTML Tool'. Then 
'Convert to XHTML'. Any gunk left over after that is easily cleaned out 
using a few decent regular expressions in the 'Find and Replace'

Cheers,
Lachlan
**
The discussion list for  http://webstandardsgroup.org/
See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list  getting help
**


Re: [WSG] converting WORD text into clean XHTML

2004-11-22 Thread Terrence Wood
I've always found that you still need to eyeball the code because Word 
does some very strange things to lists, and headings in particular. Also 
a lot of Word documents are not styled properly to begin with (e.g. 
bold+font-size, instead of headings) which leads to added complexity to 
resolve.

Terrence Wood.
On 2004-11-23 12:19 PM, Lachlan Hardy wrote:
john wrote:
I'm wondering if there's some easy (and free) way to convert text from 
a WORD document into clean XHTML that retains the formatting.

If you have Dreamweaver, try using the 'Clean Up Word HTML Tool'. Then 
'Convert to XHTML'. Any gunk left over after that is easily cleaned out 
using a few decent regular expressions in the 'Find and Replace'

Cheers,
Lachlan
**
The discussion list for  http://webstandardsgroup.org/
See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list  getting help
**
--
***
  Are you in the Wellington area and interested in web standards?
  Wellington Web Standards Group inaugural meeting 9 Dec 2004.
  See http://webstandardsgroup.org/go/event24.cfm for details
***
**
The discussion list for  http://webstandardsgroup.org/
See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list  getting help
**


Re: [WSG] converting WORD text into clean XHTML

2004-11-22 Thread Hope A. Stewart
On 23/11/04 9:19 AM, Wybrow, Mark [EMAIL PROTECTED] wrote:

 There is a tool in Dreamweaver that can auto generate but I must admit I
 have never used it ...

I've found that the Clean Word HTML command in Dreamweaver helps but still
leaves too much junk I don't want.

If you use Mac OS, cut and paste from Word into AppleWorks and then save as
an html document. If you use Windows, there might be another word processing
app that will give you cleaner html.

If there is still some junk coding from the AppleWorks produced html page, I
get rid of it with Find and Replace.

I'd love a better system to this work-around that I use, so I too will be
interested to hear what others do.

**
The discussion list for  http://webstandardsgroup.org/

 See http://webstandardsgroup.org/mail/guidelines.cfm
 for some hints on posting to the list  getting help
**



Re: [WSG] converting WORD text into clean XHTML

2004-11-22 Thread Joseph Lindsay
Textism have a word cleaner that works quite well:
http://textism.com/wordcleaner/


On Tue, 23 Nov 2004 11:45:30 +1100, Hope A. Stewart
[EMAIL PROTECTED] wrote:
 On 23/11/04 9:19 AM, Wybrow, Mark [EMAIL PROTECTED] wrote:
 
  There is a tool in Dreamweaver that can auto generate but I must admit I
  have never used it ...
 
 I've found that the Clean Word HTML command in Dreamweaver helps but still
 leaves too much junk I don't want.
 
 If you use Mac OS, cut and paste from Word into AppleWorks and then save as
 an html document. If you use Windows, there might be another word processing
 app that will give you cleaner html.
 
 If there is still some junk coding from the AppleWorks produced html page, I
 get rid of it with Find and Replace.
 
 I'd love a better system to this work-around that I use, so I too will be
 interested to hear what others do.
 
 
 
 **
 The discussion list for  http://webstandardsgroup.org/
 
  See http://webstandardsgroup.org/mail/guidelines.cfm
  for some hints on posting to the list  getting help
 **
 
 


-- 
Gmail invites - just ask nicely
**
The discussion list for  http://webstandardsgroup.org/

 See http://webstandardsgroup.org/mail/guidelines.cfm
 for some hints on posting to the list  getting help
**



Re: [WSG] converting WORD text into clean XHTML

2004-11-22 Thread Lachlan Hardy
john wrote:
I'm wondering if there's some easy (and free) way to convert text from a 
WORD document into clean XHTML that retains the formatting.
Another addition: I just remembered that recent versions of Word allow 
you to save as HTML, Filtered. This is MS-speak for removing all Office 
specific tags. You still get that MSo-style rubbish, but it clears some 
of the more awkward stuff out straight away
**
The discussion list for  http://webstandardsgroup.org/

See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list  getting help
**


Re: [WSG] converting WORD text into clean XHTML

2004-11-22 Thread Nick Lo
I asked much the same question a little while back and what I got 
together was:

First have the doc saved as HTML (Filtered) if it's coming from Word 
2003 (earlier versions can get the filtered thingy someone else 
mentioned).

Then in my case I wrote a filter for the content management system I 
built to pass the Word HTML through. What you could do is use one of 
the implementations of Tidy ( http://tidy.sourceforge.net/ ) e.g. For a 
web version try:

http://infohound.net/tidy/
...As I type this I'm just testing it on a big Word filtered HTML 
doc... and yes it seems to do a decent job.

Nick
Hi group.
I'm wondering if there's some easy (and free) way to convert text from 
a WORD document into clean XHTML that retains the formatting.

Thanks.
**
The discussion list for  http://webstandardsgroup.org/
See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list  getting help
**