Hello Jeff, to transform HTML to DocBook5 please use http://www.dbdoclet.org/archives/herold-6_0_1-68.exe. Save your Word document as "filtered HTML". Run herold --profile C:\Program Files (x86)\Herold\profiles\word.her -i <Document.htm>. I tested this procedure with Word 2003 documents, so if encounter any problems, please let me know.
Regards, Michael Fuchs http://www.dbdoclet.org Am 24.05.2012 08:12, schrieb Jeff Powanda: > > Sorry, just saw there was a recent post about saving Word to HTML and > then using dbdoclet to convert to DocBook XML. I'll give that a try. > > > > Regards, > > Jeff Powanda > > Vocera Communications, Inc. > > > > *From:*Jeff Powanda [mailto:[email protected]] > *Sent:* Wednesday, May 23, 2012 10:39 PM > *To:* '[email protected]' > *Subject:* [docbook-apps] Converting MS Word documents to DocBook 5 XML > > > > What's the easiest way to convert MS Word 2007 documents to DocBook 5 XML? > > > > I've tried using the DocBook roundtrip stylesheets. They seemed to > work OK if I did the following: > > 1. Copied the DocBook styles in template.dot to the document. > > 2. Applied the DocBook styles to the document. > > 3. Saved the document as a Word 2003 XML file. > > 4. Converted the Word 2003 XML file to DocBook 5 XML. > > > > This worked OK, but it was a lot of work to apply the DocBook styles > to the document (and there are several documents to convert). Also, > the resulting DocBook XML file has dbk namespace prefixes on all the > elements. How do I remove them? > > > > I'm not interested in the roundtrip aspect of the roundtrip > stylesheets. I just want to get Word content into DocBook 5. > > > > Regards, > > Jeff Powanda > > Vocera Communications, Inc. > > >
