[ https://issues.apache.org/jira/browse/PDFBOX-1037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13056478#comment-13056478 ]
Abraham Farris commented on PDFBOX-1037: ---------------------------------------- I made the modification to PDFParser.java and it returns the correct page count. You are correct in your theory. I appreciate your help. One last question - is there a public method to get a list of conflicts or if the PDF cannot be parsed 100%? The reason I ask is I am using pdfbox as sort of a clean up tool on uploaded pdfs. If I could log that a certain pdf could not be parsed that would be excellent! Thanks > PDF with multiple %%EOF only parses one page > -------------------------------------------- > > Key: PDFBOX-1037 > URL: https://issues.apache.org/jira/browse/PDFBOX-1037 > Project: PDFBox > Issue Type: Bug > Components: Parsing > Affects Versions: 1.5.0 > Environment: Windows XP - Java SE 1.6 > Reporter: Abraham Farris > Attachments: blankpageproblemmod.pdf, blankpageproblemmod.png > > > Any type of page counts (getDocumentCatalog().getPages().getCount()) only > return int 1. Doing a simple .load and .save will strip out all pages after > the first. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira