Thanks to Tilman for pointing me to PDFBOX-3246, I now have 2 pdfs with
embedded jp2 to work with!
How can I extract those? I've effectively copied/pasted PDFBox's ExtractImages
into Tika, and I'm using ImageIOUtil.writeImage(image, suffix, out) to write
non-jpeg images.
When I run this against the file on PDFBOX-3246 and my other jp2 file[1],
writeImage returns false and my outputstream has 0 bytes written to it.
Note that rendering of the full page via PDFRenderer basically works.
Thank you!
Best,
Tim
[1]
https://github.com/openpreserve/format-corpus/blob/master/pdfCabinetOfHorrors/balloon_a1b_jp2k.pdf
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]