[
https://issues.apache.org/jira/browse/PDFBOX-2145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14034837#comment-14034837
]
Tilman Hausherr commented on PDFBOX-2145:
-----------------------------------------
org.apache.pdfbox.pdmodel.font.PDType1Font addMetric
Schwerwiegend: Something went wrong when reading the adobe afm files
java.lang.NullPointerException
at
org.apache.pdfbox.pdmodel.font.PDType1Font.addMetric(PDType1Font.java:140)
at
org.apache.pdfbox.pdmodel.font.PDType1Font.addMetric(PDType1Font.java:125)
at
org.apache.pdfbox.pdmodel.font.PDType1Font.getAdobeFontMetrics(PDType1Font.java:99)
at
org.apache.pdfbox.pdmodel.font.PDType1Font.<clinit>(PDType1Font.java:94)
at
org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:56)
at org.apache.pdfbox.pdmodel.PDResources.getFonts(PDResources.java:209)
at
org.apache.pdfbox.util.PDFStreamEngine.getFonts(PDFStreamEngine.java:535)
at
org.apache.pdfbox.util.operator.SetTextFont.process(SetTextFont.java:53)
at
org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:505)
at
org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:256)
at
org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:223)
at
org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:206)
at
org.apache.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:417)
at
org.apache.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:341)
at
org.apache.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:301)
at
org.apache.pdfbox.tools.ExtractText.startExtraction(ExtractText.java:274)
at org.apache.pdfbox.tools.ExtractText.main(ExtractText.java:84)
at
org.apache.pdfbox.tools.TestExtractText.testEmbeddedPDFs(TestExtractText.java:43)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at junit.framework.TestCase.runTest(TestCase.java:176)
at junit.framework.TestCase.runBare(TestCase.java:141)
at junit.framework.TestResult$1.protect(TestResult.java:122)
at junit.framework.TestResult.runProtected(TestResult.java:142)
at junit.framework.TestResult.run(TestResult.java:125)
at junit.framework.TestCase.run(TestCase.java:129)
at junit.framework.TestSuite.runTest(TestSuite.java:255)
at junit.framework.TestSuite.run(TestSuite.java:250)
at
org.junit.internal.runners.JUnit38ClassRunner.run(JUnit38ClassRunner.java:84)
at
org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:264)
at
org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:153)
at
org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:124)
at
org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:200)
at
org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:153)
at
org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:103)
> Clean up PDFStreamEngine and PDFTextStripper
> --------------------------------------------
>
> Key: PDFBOX-2145
> URL: https://issues.apache.org/jira/browse/PDFBOX-2145
> Project: PDFBox
> Issue Type: Improvement
> Components: Text extraction
> Affects Versions: 2.0.0
> Reporter: John Hewson
> Assignee: John Hewson
> Priority: Minor
>
> PDFStreamEngine and PDFTextStripper don't really meet our coding conventions
> and have several unused methods and deprecated code which can safely be
> removed.
> This should clear the way to fixing some bugs in PDFStreamEngine,
> PDFTextStripper and the various PDFont classes related to text encoding.
--
This message was sent by Atlassian JIRA
(v6.2#6252)