Hi,
Am 14.08.2013 20:25, schrieb [email protected]:
Hello!
i tried to extract images (PDFs generated by scanner) in .NET. It works for TIF
but fails for JPGs.
Sounds like a .NET related issue. Did you ever try the java version? Which
version of PDFBox are you using?
I've also tested it with the compiled ExtractImages.exe and got following error:
Exception in thread "main" java.lang.NullPointerException
at org.apache.pdfbox.util.ImageIOUtil.addResolution
(ImageIOUtil.java:213)
at org.apache.pdfbox.util.ImageIOUtil.createMetadata
(ImageIOUtil.java:204)
at org.apache.pdfbox.util.ImageIOUtil.writeImage(ImageIOUtil.java:158)
at org.apache.pdfbox.util.ImageIOUtil.writeImage(ImageIOUtil.java:114)
at org.apache.pdfbox.util.ImageIOUtil.writeImage(ImageIOUtil.java:97)
at org.apache.pdfbox.pdmodel.graphics.xobject.PDJpeg.write2OutputStream
(PDJpeg.java:298)
at org.apache.pdfbox.pdmodel.graphics.xobject.PDXObjectImage.write2file
(PDXObjectImage.java:224)
at org.apache.pdfbox.ExtractImages.processResources
(ExtractImages.java:202)
at
org.apache.pdfbox.ExtractImages.extractImages(ExtractImages.java:160)
at org.apache.pdfbox.ExtractImages.main(ExtractImages.java:66)
br,
Martin
(See attached file: TIF.pdf)(See attached file: JPG.pdf)
Your attachments didn't make it due the some restrictions to the mailing list.
Please upload the pdfs to a sharehoster or something similar.
BR
Andreas Lehmkühler