Hello Jeff,

to transform HTML to DocBook5 please use
http://www.dbdoclet.org/archives/herold-6_0_1-68.exe. Save your Word
document as "filtered HTML". Run herold --profile C:\Program Files
(x86)\Herold\profiles\word.her -i <Document.htm>. I tested this
procedure with Word 2003 documents, so if encounter any problems, please
let me know.

Regards,
Michael Fuchs
http://www.dbdoclet.org

Am 24.05.2012 08:12, schrieb Jeff Powanda:
>
> Sorry, just saw there was a recent post about saving Word to HTML and
> then using dbdoclet to convert to DocBook XML. I'll give that a try.
>
>  
>
> Regards,
>
> Jeff Powanda
>
> Vocera Communications, Inc.
>
>  
>
> *From:*Jeff Powanda [mailto:[email protected]]
> *Sent:* Wednesday, May 23, 2012 10:39 PM
> *To:* '[email protected]'
> *Subject:* [docbook-apps] Converting MS Word documents to DocBook 5 XML
>
>  
>
> What's the easiest way to convert MS Word 2007 documents to DocBook 5 XML?
>
>  
>
> I've tried using the DocBook roundtrip stylesheets. They seemed to
> work OK if I did the following:
>
> 1.       Copied the DocBook styles in template.dot to the document.
>
> 2.       Applied the DocBook styles to the document.
>
> 3.       Saved the document as a Word 2003 XML file.
>
> 4.       Converted the Word 2003 XML file to DocBook 5 XML.
>
>  
>
> This worked OK, but it was a lot of work to apply the DocBook styles
> to the document (and there are several documents to convert). Also,
> the resulting DocBook XML file has dbk namespace prefixes on all the
> elements. How do I remove them?
>
>  
>
> I'm not interested in the roundtrip aspect of the roundtrip
> stylesheets. I just want to get Word content into DocBook 5.
>
>  
>
> Regards,
>
> Jeff Powanda
>
> Vocera Communications, Inc.
>
>  
>

Reply via email to