docx files are zip files containing lots of XML files. You will also find the image files. You can unzip your file and look at the XML to try to find out how the images are referenced. If you know what XML is being used, you can try to work out how the poi-ooxml-full classes can be used to read or manipulate the XML. These classes are generated from the XML schemas that have been made public by Microsoft. You can get the schemas online or run the POI build and look at the xsd files that are downloaded (or exploded from zips that contain the xsds).
On Monday 2 September 2024 at 13:00:11 IST, Janne Jokitalo <janne.jokit...@gmail.com> wrote: Hi, I have a project in which I have to deal with a massive amount of Docx documents. I need to construct one collective manual out of tens of individual documents. During the merging process I need to copy some images. Now normally this should be relatively easy by calling XWPFRun.getEmbeddedPictures(), but in my case this doesn't provide any output. So I'm guessing the pictures aren't purely embedded, but linked or attached in some other way. I can, however, get the required images with XWPFDocument.getAllPictures(). But in this method I get XWPFPictureData objects, which lack some properties an XWPFPicture has, like which XWPFRun the image "belongs" to. Does anyone here have an idea how that link could be made? I thought of relationships, but apparently they're only on the physical side, ie. which document the image belongs to, not logical side, ie. which text part (aka run) it belongs to. Any help would be appreciated! Best regards, Janne Jokitalo --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@poi.apache.org For additional commands, e-mail: user-h...@poi.apache.org