[
https://issues.apache.org/jira/browse/PDFBOX-1037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13056478#comment-13056478
]
Abraham Farris commented on PDFBOX-1037:
----------------------------------------
I made the modification to PDFParser.java and it returns the correct page
count. You are correct in your theory.
I appreciate your help. One last question - is there a public method to get a
list of conflicts or if the PDF cannot be parsed 100%? The reason I ask is I
am using pdfbox as sort of a clean up tool on uploaded pdfs. If I could log
that a certain pdf could not be parsed that would be excellent!
Thanks
> PDF with multiple %%EOF only parses one page
> --------------------------------------------
>
> Key: PDFBOX-1037
> URL: https://issues.apache.org/jira/browse/PDFBOX-1037
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 1.5.0
> Environment: Windows XP - Java SE 1.6
> Reporter: Abraham Farris
> Attachments: blankpageproblemmod.pdf, blankpageproblemmod.png
>
>
> Any type of page counts (getDocumentCatalog().getPages().getCount()) only
> return int 1. Doing a simple .load and .save will strip out all pages after
> the first.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira