docx files are zip files containing lots of XML files. You will also find the 
image files.
You can unzip your file and look at the XML to try to find out how the images 
are referenced.
If you know what XML is being used, you can try to work out how the 
poi-ooxml-full classes can be used to read or manipulate the XML. These classes 
are generated from the XML schemas that have been made public by Microsoft. You 
can get the schemas online or run the POI build and look at the xsd files that 
are downloaded (or exploded from zips that contain the xsds).






On Monday 2 September 2024 at 13:00:11 IST, Janne Jokitalo 
<janne.jokit...@gmail.com> wrote: 





Hi,

I have a project in which I have to deal with a massive amount of Docx
documents. I need to construct one collective manual out of tens of
individual documents.

During the merging process I need to copy some images. Now normally this
should be relatively easy by calling XWPFRun.getEmbeddedPictures(), but in
my case this doesn't provide any output.

So I'm guessing the pictures aren't purely embedded, but linked or attached
in some other way. I can, however, get the required images with
XWPFDocument.getAllPictures(). But in this method I get XWPFPictureData
objects, which lack some properties an XWPFPicture has, like which XWPFRun
the image "belongs" to.

Does anyone here have an idea how that link could be made? I thought of
relationships, but apparently they're only on the physical side, ie. which
document the image belongs to, not logical side, ie. which text part (aka
run) it belongs to.

Any help would be appreciated!

Best regards,

Janne Jokitalo

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@poi.apache.org
For additional commands, e-mail: user-h...@poi.apache.org

Reply via email to