[
https://issues.apache.org/jira/browse/PDFBOX-3591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15703050#comment-15703050
]
Andreas Lehmkühler commented on PDFBOX-3591:
--------------------------------------------
I had a first look and thought, easy fix, extend the lenient mode and just
convert the float to an int and that's it. But the referenced font has the
object id {{10 0}} instead of {{1 0}}. So, should we simply omit the point? Or
is it better to omit the whole dictionary entry?
It doesn't matter for the given pdf as the font isn't used due to invisible
text (rendering mode 3)
> IOException "expected number, actual=COSFloat{1.0}" on a valid PDF
> ------------------------------------------------------------------
>
> Key: PDFBOX-3591
> URL: https://issues.apache.org/jira/browse/PDFBOX-3591
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 2.0.3
> Environment: Windows 7 x64, JVM 1.8.0_101
> Reporter: Seva Alekseyev
> Attachments: Vagueness and the Rule of Law- Reconsidering Installment
> Land Con.pdf
>
>
> On the attached PDF document, which opens fine with Adobe Reader, the
> PDDocument.load() method throws the following:
> java.io.IOException: expected number, actual=COSFloat{1.0} at offset 577113
> at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue:162
> at
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair:274
> at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary:207
> at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject:854
> at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue:150
> at
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair:274
> at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary:207
> at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject:854
> at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue:150
> at
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair:274
> at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary:207
> at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject:854
> at org.apache.pdfbox.pdfparser.COSParser.parseFileObject:757
> at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically:726
> at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically:657
> at org.apache.pdfbox.pdfparser.COSParser.parseDictObjects:617
> at org.apache.pdfbox.pdfparser.PDFParser.initialParse:215
> at org.apache.pdfbox.pdfparser.PDFParser.parse:249
> at org.apache.pdfbox.pdmodel.PDDocument.load:891
> at org.apache.pdfbox.pdmodel.PDDocument.load:831
> at org.apache.tika.parser.pdf.PDFParser.parse:129
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]