[ 
https://issues.apache.org/jira/browse/PDFBOX-3866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16084026#comment-16084026
 ] 

Maruan Sahyoun commented on PDFBOX-3866:
----------------------------------------

Could it be that while transferring/reading the file there was an error or the 
chunks have not been read completely?

If your fine I'd close this as won't fix as 
a) Adobe Reader can't read it
b) even we find a way to skip the corrupt section likely some font information 
will be lost

Let me know what you think.

> Pdf not read by PdfBox 
> -----------------------
>
>                 Key: PDFBOX-3866
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3866
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.6
>            Reporter: Quentin Caillard
>
> Evince can read this PDF without problems.
> The previous logs and the stack trace :
> 09:38:20.135 [main] DEBUG org.apache.pdfbox.pdfparser.COSParser - Stop 
> checking xref offsets as at least one (10 0 R) couldn't be dereferenced
> 09:38:20.177 [main] DEBUG org.apache.pdfbox.pdfparser.COSParser - Replaced 
> read xref table with the results of a brute force search
> 09:38:20.180 [main] WARN org.apache.pdfbox.pdfparser.COSParser - The end of 
> the stream doesn't point to the correct offset, using workaround to read the 
> stream, stream start position: 2932, length: 891, expected end position: 3823
> 09:38:20.182 [main] WARN org.apache.pdfbox.pdfparser.COSParser - stream ends 
> with 'endobj' instead of 'endstream' at offset 4321
> 09:38:20.191 [main] ERROR 
> com.ariadnext.pki.seal.scheduler.server.utils.PdfUtils - Invalid PDF document
> java.io.IOException: Error: Expected a long type at offset 4096, instead got 
> 'omanPS-BoldMT'
>       at org.apache.pdfbox.pdfparser.BaseParser.readLong(BaseParser.java:1384)
>       at 
> org.apache.pdfbox.pdfparser.BaseParser.readObjectNumber(BaseParser.java:1312)
>       at 
> org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:760)
>       at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:742)
>       at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:673)
>       at 
> org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:633)
>       at 
> org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:241)
>       at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:276)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1213)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1190)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1171)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1154)
>       ....
> Caused by: java.lang.NumberFormatException: For input string: "omanPS-BoldMT"
>       at 
> java.lang.NumberFormatException.forInputString(NumberFormatException.java:65)
>       at java.lang.Long.parseLong(Long.java:589)
>       at java.lang.Long.parseLong(Long.java:631)
>       at org.apache.pdfbox.pdfparser.BaseParser.readLong(BaseParser.java:1379)



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to