[ 
https://issues.apache.org/jira/browse/PDFBOX-2919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15080433#comment-15080433
 ] 

Andreas Lehmkühler commented on PDFBOX-2919:
--------------------------------------------

One of the used 4 fonts is broken (Tc5, cmti10). I tried to skip the unknown 
operand but ran into another exception. Looks like we have to put some more 
effort into the parser to support at least some of the corrupted fonts, either 
by skipping malformed data or by replacing such fonts.
Fontforge also complains about the font.

> org.apache.fontbox.cff.CFFParser.readEntry sometimes throws 
> java.lang.IllegalArgumentException
> ----------------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-2919
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2919
>             Project: PDFBox
>          Issue Type: Bug
>          Components: FontBox
>    Affects Versions: 2.0.0
>         Environment: Mac OS 10.10.4
> Java(TM) SE Runtime Environment (build 1.8.0_31-b13)
>            Reporter: V
>              Labels: pdfbox
>         Attachments: p5.pdf
>
>
> It happened on one of my PDFs. Traceback:
> {quote}
> Processing p5.pdf...
> Exception in thread "main" java.lang.reflect.InvocationTargetException
>       at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>       at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>       at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>       at java.lang.reflect.Method.invoke(Method.java:483)
>       at 
> org.eclipse.jdt.internal.jarinjarloader.JarRsrcLoader.main(JarRsrcLoader.java:58)
> Caused by: java.lang.IllegalArgumentException
>       at org.apache.fontbox.cff.CFFParser.readEntry(CFFParser.java:209)
>       at org.apache.fontbox.cff.CFFParser.readDictData(CFFParser.java:177)
>       at org.apache.fontbox.cff.CFFParser.parseType1Dicts(CFFParser.java:608)
>       at org.apache.fontbox.cff.CFFParser.parseFont(CFFParser.java:466)
>       at org.apache.fontbox.cff.CFFParser.parse(CFFParser.java:115)
>       at 
> org.apache.pdfbox.pdmodel.font.PDType1CFont.<init>(PDType1CFont.java:99)
>       at 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:60)
>       at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:121)
>       at 
> org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:50)
>       at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:798)
>       at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:460)
>       at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:437)
>       at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:148)
>       at 
> org.apache.pdfbox.text.PDFTextStreamEngine.processPage(PDFTextStreamEngine.java:117)
>       at 
> org.apache.pdfbox.text.PDFTextStripper.processPage(PDFTextStripper.java:367)
>       at 
> org.apache.pdfbox.text.PDFTextStripper.processPages(PDFTextStripper.java:303)
>       at 
> org.apache.pdfbox.text.PDFTextStripper.writeText(PDFTextStripper.java:248)
> {quote}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to