[ 
https://issues.apache.org/jira/browse/PDFBOX-1037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13056478#comment-13056478
 ] 

Abraham Farris commented on PDFBOX-1037:
----------------------------------------

I made the modification to PDFParser.java and it returns the correct page 
count.  You are correct in your theory.  

I appreciate your help.  One last question - is there a public method to get a 
list of conflicts or if the PDF cannot be parsed 100%?  The reason I ask is I 
am using pdfbox as sort of a clean up tool on uploaded pdfs.  If I could log 
that a certain pdf could not be parsed that would be excellent!

Thanks

> PDF with multiple %%EOF only parses one page
> --------------------------------------------
>
>                 Key: PDFBOX-1037
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1037
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 1.5.0
>         Environment: Windows XP - Java SE 1.6
>            Reporter: Abraham Farris
>         Attachments: blankpageproblemmod.pdf, blankpageproblemmod.png
>
>
> Any type of page counts (getDocumentCatalog().getPages().getCount()) only 
> return int 1.  Doing a simple .load and .save will strip out all pages after 
> the first.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to