Thanks to Tilman for pointing me to PDFBOX-3246, I now have 2 pdfs with 
embedded jp2 to work with!

How can I extract those?  I've effectively copied/pasted PDFBox's ExtractImages 
into Tika, and I'm using ImageIOUtil.writeImage(image, suffix, out) to write 
non-jpeg images. 

When I run this against the file on PDFBOX-3246 and my other jp2 file[1], 
writeImage returns false and my outputstream has 0 bytes written to it.

Note that rendering of the full page via PDFRenderer basically works.

Thank you!

Best,

                   Tim

[1] 
https://github.com/openpreserve/format-corpus/blob/master/pdfCabinetOfHorrors/balloon_a1b_jp2k.pdf


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to