Or use Word 2007 to do it for you, or save directly to HTML and use xdmp:tidy to clean up MS HTML code..
Kind regards, Geert > -----Original Message----- > From: [email protected] > [mailto:[email protected]] On Behalf Of > [email protected] > Sent: vrijdag 11 september 2009 13:24 > To: [email protected] > Subject: RE: [MarkLogic Dev General] Can we convert word 2007 > documentintoxhtml or DocBook xml using MarkLogic? > > > Geert, > > There is one option.. use openoffice API set along with xcc > for the same (Openoffice to convert 2007 to 2003). > > _________________________________________ > Pramit Ghosh > Program Manager - Consulting | Content & Digital Media > Information, Media & Entertainment | Cognizant Technology Solutions > Mobile: (201) 290-0913 | [email protected] > > -----Original Message----- > From: [email protected] > [mailto:[email protected]] On Behalf Of > Geert Josten > Sent: Friday, September 11, 2009 7:20 AM > To: General Mark Logic Developer Discussion > Subject: RE: [MarkLogic Dev General] Can we convert word 2007 > documentintoxhtml or DocBook xml using MarkLogic? > > Hi Anuj, > > Sorry for the late reply. > > > I have a requirement to convert word 2007 document into xhtml or > > DocBook xml . I know that we can do it for word 2003 documents. > > > > Is this possible for word 2007 docs? > > I am pretty certain that Word 2007 documents will be > converted to XHTML by default, so long as you make sure you > have 'Office OpenXML Extract' > pipelines added to the relevant domains. DocBook Conversion > is triggered by the 'structured-xhtml' state, so I reccon > that if you add that as well to the relevant domains, all > XHTML documents including the Word > 2007 documents will be converted to DocBook automatically. > > Kind regards, > Geert > > > Drs. G.P.H. Josten > Consultant > > > http://www.daidalos.nl/ > Daidalos BV > Source of Innovation > Hoekeindsehof 1-4 > 2665 JZ Bleiswijk > Tel.: +31 (0) 10 850 1200 > Fax: +31 (0) 10 850 1199 > http://www.daidalos.nl/ > KvK 27164984 > De informatie - verzonden in of met dit emailbericht - is > afkomstig van Daidalos BV en is uitsluitend bestemd voor de > geadresseerde. Indien u dit bericht onbedoeld hebt ontvangen, > verzoeken wij u het te verwijderen. Aan dit bericht kunnen > geen rechten worden ontleend. > > > > _______________________________________________ > General mailing list > [email protected] > http://xqzone.com/mailman/listinfo/general > > This e-mail and any files transmitted with it are for the > sole use of the intended recipient(s) and may contain > confidential and privileged information.If you are not the > intended recipient, please contact the sender by reply e-mail > and destroy all copies of the original message. > Any unauthorized review, use, disclosure, dissemination, > forwarding, printing or copying of this email or any action > taken in reliance on this e-mail is strictly prohibited and > may be unlawful. > _______________________________________________ > General mailing list > [email protected] > http://xqzone.com/mailman/listinfo/general > _______________________________________________ General mailing list [email protected] http://xqzone.com/mailman/listinfo/general
