[ 
https://issues.apache.org/jira/browse/PDFBOX-3866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16083952#comment-16083952
 ] 

Quentin Caillard commented on PDFBOX-3866:
------------------------------------------

Indeed, the PDF seems to be corrupted. Some readers succeed in reading it, 
others don't. I have no idea if PDFBox must try to support this use case.

Thanks for your time and your explanations.

> Pdf not read by PdfBox 
> -----------------------
>
>                 Key: PDFBOX-3866
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3866
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.6
>            Reporter: Quentin Caillard
>
> Evince can read this PDF without problems.
> The previous logs and the stack trace :
> 09:38:20.135 [main] DEBUG org.apache.pdfbox.pdfparser.COSParser - Stop 
> checking xref offsets as at least one (10 0 R) couldn't be dereferenced
> 09:38:20.177 [main] DEBUG org.apache.pdfbox.pdfparser.COSParser - Replaced 
> read xref table with the results of a brute force search
> 09:38:20.180 [main] WARN org.apache.pdfbox.pdfparser.COSParser - The end of 
> the stream doesn't point to the correct offset, using workaround to read the 
> stream, stream start position: 2932, length: 891, expected end position: 3823
> 09:38:20.182 [main] WARN org.apache.pdfbox.pdfparser.COSParser - stream ends 
> with 'endobj' instead of 'endstream' at offset 4321
> 09:38:20.191 [main] ERROR 
> com.ariadnext.pki.seal.scheduler.server.utils.PdfUtils - Invalid PDF document
> java.io.IOException: Error: Expected a long type at offset 4096, instead got 
> 'omanPS-BoldMT'
>       at org.apache.pdfbox.pdfparser.BaseParser.readLong(BaseParser.java:1384)
>       at 
> org.apache.pdfbox.pdfparser.BaseParser.readObjectNumber(BaseParser.java:1312)
>       at 
> org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:760)
>       at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:742)
>       at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:673)
>       at 
> org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:633)
>       at 
> org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:241)
>       at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:276)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1213)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1190)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1171)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1154)
>       ....
> Caused by: java.lang.NumberFormatException: For input string: "omanPS-BoldMT"
>       at 
> java.lang.NumberFormatException.forInputString(NumberFormatException.java:65)
>       at java.lang.Long.parseLong(Long.java:589)
>       at java.lang.Long.parseLong(Long.java:631)
>       at org.apache.pdfbox.pdfparser.BaseParser.readLong(BaseParser.java:1379)



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to