Dear Damiano, Thanks for reaching out to me. I am CC'ing the [email protected] list as that is where the development occurs on the project and where most of the devs hang out. Thanks for your question. I'm not sure of the exact cause yet, but best bet (and it looks like you did this) is to attach the offending file to a JIRA ticket at http://issues.apache.org/jira/browse/TIKA and then state the issue you are having. Then we will work together to try and come to a resolution.
Thanks for looking at Tika and we will be sure to help out. If you'd like to subscribe to the dev list please send a blank email to the dev list, [email protected] and follow the instructions from there. Cheers, Chris ------------------------ Chris Mattmann [email protected] -----Original Message----- From: Damiano Porta <[email protected]> Date: Thursday, August 14, 2014 7:30 AM To: Chris Mattmann <[email protected]> Subject: [Tika] Embedded images in PDF documents >Hello Chris! It is nice to meet you. I am Damiano from Italy. > >I just found your email on GitHub as Tika contributor. Unfortunately Tika >has not a big community where to ask for support. > > >I found a problem with PDF documents that have embedded images. > > >Doing: java -jar tika-app-1.5.jar --extract tika.pdf > > >Tika can not find the image. > > >Is this a PDF related problem? Because if i do the same thing with DOC >documents it works perfectly. > > >I am looking forward to your reply. >Have a great day > > >Damiano >
