[
https://issues.apache.org/jira/browse/PDFBOX-2521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14225868#comment-14225868
]
Andreas Lehmkühler commented on PDFBOX-2521:
--------------------------------------------
[~ssteiner1] Thanks for the patch. It indicates that your pdf has a malformed
hybrid xref table, or PDFBox has an issue reading that correct. Your patch
isn't optimal, it just guesses that something may be wrong. I'd like to improve
the parser itself to self detect such cases, so that we know that something is
wrong. Is there any chance to share the relevant parts of the pdf or better to
send me the pdf?
> Don't throw IOException if stream length is missing in lenient mode
> -------------------------------------------------------------------
>
> Key: PDFBOX-2521
> URL: https://issues.apache.org/jira/browse/PDFBOX-2521
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 1.8.7, 2.0.0
> Reporter: Andreas Lehmkühler
> Assignee: Andreas Lehmkühler
> Fix For: 1.8.8, 2.0.0
>
> Attachments: nonseq.patch
>
>
> The non-sequential parser throws an IOException if the lenght of a stream is
> missing. This should be avoided in lenient mode, escpacially as the parser
> checks the length afterwards add uses the fallback readUntilEnd if the length
> is invalid.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)