[ 
https://issues.apache.org/jira/browse/PDFBOX-2250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14093106#comment-14093106
 ] 

Andreas Lehmkühler commented on PDFBOX-2250:
--------------------------------------------

In contrast to the old parser the non-sequential one didn't parse 
cross-reference streams. I've added that feature so that especially object 
references for compressed objects could be found now.

This should improve the parser once more if it comes to pdfs using object 
streams. I've used this [sample 
pdf|http://bewerbung.fh-kaernten.at/fileadmin/Anleitung-PDF-erstellen.pdf]  
provided by Martin Tappler on dev@pdfboxf

> Improve XRef self healing mechanism
> -----------------------------------
>
>                 Key: PDFBOX-2250
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2250
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Parsing
>    Affects Versions: 1.8.6, 1.8.7, 2.0.0
>            Reporter: Andreas Lehmkühler
>            Assignee: Andreas Lehmkühler
>
> PDFBOX-1769 introduced a "self healing" mechanism to repair corrupt XRef 
> offsets. But that one was just a starter and there remain a lot of issues to 
> be solved. I'm planing to solve at least some of them.
> All fixes and improvements are targeting the non-sequential parser and I 
> won't port those changes to the old parser.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to