[ 
https://issues.apache.org/jira/browse/PDFBOX-1174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13160849#comment-13160849
 ] 

Timo Boehme commented on PDFBOX-1174:
-------------------------------------

@Maruan
While the PDF spec can be used to validate a PDF, we have to cope with content 
after %%EOF if PDFBOX should be able to process all PDF documents which are 
readable by standard PDF readers (this is what most users/customers expect). 
Quite a large number of documents in our collection have this extra content - 
in most cases articles of publishers (it seems that the HTML download modules 
often add this content).

@Thomas
It seems to me that the garbage is created while updating the document. Thus to 
generate such a document one first has to create a PDF and change some content 
afterward. However I'm not sure since we do not use this software but only have 
to cope with the produced results :-(
                
> i have problem in  BaseParser.readInt
> -------------------------------------
>
>                 Key: PDFBOX-1174
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1174
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing, PDModel
>    Affects Versions: 1.6.0
>            Reporter: ahmad makram
>
> i can't load PDF to PDDocument.load( )
> it give me this exception
> java.io.IOException: Error: Expected an integer type, actual='Fatal'
>       at org.apache.pdfbox.pdfparser.BaseParser.readInt(BaseParser.java:1384)
>       at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:517)
>       at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:184)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1069)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1036)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1007)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to