[ 
https://issues.apache.org/jira/browse/PDFBOX-2163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14046255#comment-14046255
 ] 

Tilman Hausherr commented on PDFBOX-2163:
-----------------------------------------

Fixed for the trunk in http://svn.apache.org/r1606177

I'm looking in the output stream to see if there are 70 ascii85 bytes. If yes, 
then this EI doesn't count. All the files above now render properly, and so do 
all older files with inline images I kept.

> inline image with EI in the middle incorrectly parsed
> -----------------------------------------------------
>
>                 Key: PDFBOX-2163
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2163
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>            Reporter: Tilman Hausherr
>            Assignee: Tilman Hausherr
>
> This PDF
> http://digitalcorpora.org/corp/nps/files/govdocs1/876/876636.pdf
> has an exception because the end of an inline image is improperly detected. 
> The stream looks like this:
> {code}
> BI
>   /W 452
>   /H 169
>   /BPC 8
>   /CS /RGB
>   /D [0.0 1.0 0.0 1.0 0.0 1.0]
>   /F [/A85 /Fl]
> ID
> ......................................................
> ....................................................EI
> ......................................................
> ...
> ....
> EI Q
> {code}
> The inline images are handled in PDFStreamParser. This is tricky, we look for 
> followup bin data to check that it isn't an EI in the middle, but here it 
> isn't bin data, but ascii85 stuff. We also can't request that there be a LF 
> before the EI, because I remember that I had a PDF at work created by a well 
> known company that doesn't use it.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to