Tilman Hausherr created PDFBOX-1916:
---------------------------------------
Summary: java.lang.ArrayIndexOutOfBoundsException in inlineimage
Key: PDFBOX-1916
URL: https://issues.apache.org/jira/browse/PDFBOX-1916
Project: PDFBox
Issue Type: Bug
Affects Versions: 2.0.0
Reporter: Tilman Hausherr
Priority: Minor
I get this with page 8 of the attached file:
13.02.2014 20:10:10.809 WARN [main] org.apache.pdfbox.util.PDFStreamEngine:546
- java.lang.ArrayIndexOutOfBoundsException: 4
java.lang.ArrayIndexOutOfBoundsException: 4
at
org.apache.pdfbox.filter.TIFFFaxDecoder.decodeT6(TIFFFaxDecoder.java:1153)
at
org.apache.pdfbox.filter.CCITTFaxDecodeFilter.decode(CCITTFaxDecodeFilter.java:126)
at
org.apache.pdfbox.pdmodel.graphics.xobject.PDInlinedImage.createImage(PDInlinedImage.java:161)
at
org.apache.pdfbox.util.operator.pagedrawer.BeginInlineImage.process(BeginInlineImage.java:60)
at
org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:533)
at
org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:261)
at
org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:236)
at
org.apache.pdfbox.pdfviewer.PageDrawer.drawType3String(PageDrawer.java:444)
at
org.apache.pdfbox.pdfviewer.PageDrawer.processTextPosition(PageDrawer.java:295)
at
org.apache.pdfbox.util.PDFStreamEngine.processEncodedText(PDFStreamEngine.java:489)
at org.apache.pdfbox.util.operator.ShowText.process(ShowText.java:44)
at
org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:533)
at
org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:261)
at
org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:227)
at
org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:209)
at org.apache.pdfbox.pdfviewer.PageDrawer.drawPage(PageDrawer.java:151)
at org.apache.pdfbox.util.RenderUtil.renderPage(RenderUtil.java:212)
at org.apache.pdfbox.util.RenderUtil.convertToImage(RenderUtil.java:177)
at pdfboxpageimageextraction.ExtractImages.doPdf(ExtractImages.java:273)
at pdfboxpageimageextraction.ExtractImages.main(ExtractImages.java:77)
Some observations:
- I can't see what image is missing when rendered
- The data read between ID and EI includes the LF (0x0A). I tried to remove
that in debugging, but the exception came anyway.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)