Hi list, I am working on a project where I want to find out if an Office (97-2003 formats) document has any external links (i.e. not embedded, but linked) to other documents. So far I have got Word and Excel documents playing nicely, by parsing the "\1Ole" streams in them to find the links. Since PowerPoint docs don't seem to use the OLE streams, I've been trying to use HSLF to find them.
I am currently using POI 3.6 from debian sid repositories (libapache-poi-java 3.6+dfsg-2) Now, my problem here is that using either HSFLSlideShow.getEmbeddedObjects() or the method for reading hyperlinks in http://poi.apache.org/slideshow/how-to-shapes.html#Hyperlinks for some reason will only find excel spreadsheets that I have linked to, not word documents (I've only tried those two types at this point). If I look at a hex dump of the whole ppt file, there is a record with the filename of the .doc file that is linked to, but I have no idea what stream it is in, or how to get at it. Does anyone have any ideas on how to get at those links, or even why it is like this? Cheers Nik ==== CAUTION: This email message and any attachments contain information that may be confidential and may be LEGALLY PRIVILEGED. If you are not the intended recipient, any use, disclosure or copying of this message or attachments is strictly prohibited. If you have received this email message in error please notify us immediately and erase all copies of the message and attachments. Thank you. ====
