Conversion is currently for Office 2003 documents and earlier. With 2007/2010 we work with the XML directly. The Office Open XML Extract pipeline will unzip the .docx and .pptx, and create the *_parts directory containing their XML components.
Hope this helps, Pete -----Original Message----- From: [email protected] [mailto:[email protected]] On Behalf Of Jakob Fix Sent: Wednesday, February 22, 2012 9:37 AM To: General Mark Logic Developer Discussion Subject: [MarkLogic Dev General] cpf pipeline question Hi, So i'm experimenting with the conversion option in MarkLogic (v5.0). CPF is installed and enabled, conversion is set to true. Import of docx and pptx is via WebDAV. However, conversion visibly doesn't take place. I set logging to "finest", so I see lots of "skipped" lines but no outright errors: 2012-02-22 15:31:17.416 Fine: TaskServer: Documents: on-any-property skipping /AuthoringGuide.docx Uploaded documents are visible via QC's "Explore", their type is "binary", and the properties don't show any errors, e.g.: <prop:properties xmlns:prop="http://marklogic.com/xdmp/property"> <cpf:processing-status xmlns:cpf="http://marklogic.com/cpf">done</cpf:processing-status> <cpf:property-hash xmlns:cpf="http://marklogic.com/cpf">d41d8cd98f00b204e9800998ecf8427e</cpf:property-hash> <cpf:last-updated xmlns:cpf="http://marklogic.com/cpf">2012-02-22T15:23:04.949+01:00</cpf:last-updated> <cpf:state xmlns:cpf="http://marklogic.com/cpf">http://marklogic.com/states/converted</cpf:state> <cpf:self xmlns:cpf="http://marklogic.com/cpf">/AuthoringGuide.docx</cpf:self> <prop:last-modified>2012-02-22T15:23:04+01:00</prop:last-modified> </prop:properties> So, no _toc.xml file or _parts directory is created with XML inside. Could somebody please tell me what else to check? Thanks, Jakob. _______________________________________________ General mailing list [email protected] http://developer.marklogic.com/mailman/listinfo/general _______________________________________________ General mailing list [email protected] http://developer.marklogic.com/mailman/listinfo/general
