[ 
https://issues.apache.org/jira/browse/PDFBOX-5490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17578510#comment-17578510
 ] 

Tim Allison commented on PDFBOX-5490:
-------------------------------------

My initial request would be for whether or not the xref table had to be 
rebuilt...largely because I'm somewhat interested in that at the moment. 

Any info at the pre-DOM stage for what had to be guessed or assumed -- alleged 
obj stream length != actual object stream.

Other places where PDFBox currently logs warnings (missing font, missing 
unicode mappings etc) after the DOM has been built would also be useful.

> Add reconstruction information to the PDDocument
> ------------------------------------------------
>
>                 Key: PDFBOX-5490
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5490
>             Project: PDFBox
>          Issue Type: Wish
>          Components: Parsing
>            Reporter: Tim Allison
>            Priority: Minor
>
> When the xref has to be rebuilt or there are other anomalies in the parsing 
> of the PDDocument, the results are currently logged.  In a multithreaded 
> environment it is not easy to reconstruct which documents had which problems.
> It would be helpful if a PDF was able to be successfully loaded to include 
> information about what had to be fixed in order to load it successfully.  
> Certainly, rebuilding the xref table comes to mind, but any other info would 
> also be useful.
> This is a wish for 3.x.  I don't think I'll have time to contribute. :(



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to