Christian K. wrote:

I've stumbled upon a few more questions when converting DOCX to DITAMAP:


1. What is the unit of attributes width and height in image tags? Images
in the rendered PDF are often outside of the page.


XMLmind Word To XML converts the image width and height specified in the DOCX file by MS-Word from a strange unit (twentieth of a point, I would say) to CSS pixels. This works well in our own documents.



--> If you use Apache FOP to generate PDF, then this is probably a FOP problem.

Unless the DPI information is found in the image metadata, FOP considers that the DPI of an image is by default 72DPI, which is ridiculously small. Other XSL-FO processors use 96DPI or 120DPI. The *modified* copy of FOP we ship with some of our products uses 120DPI too.

The workaround for this problem is to properly configure FOP. See https://xmlgraphics.apache.org/fop/2.3/configuration.html





2. Is there a way to keep explicit pagebreaks made in the DOCX file? Or
at least a way to tell that topics should start on a new page?

There is no way to express a page break in DITA XML (I mean, there is no XML element or attribute for that). Hence there is no way to convert a page break to DITA XML.

Automatically generating page breaks is the job of the DITA toolkit (e.g. the DITA Open Toolkit or our own XMLmind DITA Converter) used to convert DITA to other formats.

*Forced* page breaks *may* be supported by your DITA toolkit. For example, our in house XMLmind DITA Converter has a <?pagebreak?> processing-instruction for that.




3. Header and footer including images in those headers and footers are
not exported / converted. Is that possible?


Same answer as above.

There is no way to express headers and footers in DITA XML (I mean, there is no XML element or attribute for that). Hence there is no way to convert a DOCX header or footer to DITA XML.

Automatically generating nice headers and footers is the job of the DITA toolkit (e.g. the DITA Open Toolkit or our own XMLmind DITA Converter) used to convert DITA to other formats.




--
XMLmind Word To XML Support List
w2x-support@xmlmind.com
http://www.xmlmind.com/mailman/listinfo/w2x-support

Reply via email to