Thanks, I appreciate the tip. I gave it a try, and Herold works wonderfully!

Regards,
Jeff

From: Michael Fuchs [mailto:[email protected]]
Sent: Thursday, May 24, 2012 1:19 AM
To: Jeff Powanda
Cc: '[email protected]'
Subject: Re: [docbook-apps] RE: Converting MS Word documents to DocBook 5 XML

Hello Jeff,

to transform HTML to DocBook5 please use 
http://www.dbdoclet.org/archives/herold-6_0_1-68.exe. Save your Word document 
as "filtered HTML". Run herold --profile C:\Program Files 
(x86)\Herold\profiles\word.her -i <Document.htm>. I tested this procedure with 
Word 2003 documents, so if encounter any problems, please let me know.

Regards,
Michael Fuchs
http://www.dbdoclet.org

Am 24.05.2012 08:12, schrieb Jeff Powanda:
Sorry, just saw there was a recent post about saving Word to HTML and then 
using dbdoclet to convert to DocBook XML. I'll give that a try.

Regards,
Jeff Powanda
Vocera Communications, Inc.

From: Jeff Powanda [mailto:[email protected]]
Sent: Wednesday, May 23, 2012 10:39 PM
To: 
'[email protected]<mailto:[email protected]>'
Subject: [docbook-apps] Converting MS Word documents to DocBook 5 XML

What's the easiest way to convert MS Word 2007 documents to DocBook 5 XML?

I've tried using the DocBook roundtrip stylesheets. They seemed to work OK if I 
did the following:

1.       Copied the DocBook styles in template.dot to the document.

2.       Applied the DocBook styles to the document.

3.       Saved the document as a Word 2003 XML file.

4.       Converted the Word 2003 XML file to DocBook 5 XML.

This worked OK, but it was a lot of work to apply the DocBook styles to the 
document (and there are several documents to convert). Also, the resulting 
DocBook XML file has dbk namespace prefixes on all the elements. How do I 
remove them?

I'm not interested in the roundtrip aspect of the roundtrip stylesheets. I just 
want to get Word content into DocBook 5.

Regards,
Jeff Powanda
Vocera Communications, Inc.


Reply via email to