Perfect. Thank you! -----Original Message----- From: Andreas Lehmkuehler [mailto:andr...@lehmi.de] Sent: Thursday, September 15, 2016 8:31 AM To: dev@pdfbox.apache.org Subject: Re: PDFBox 2.0.3 TIKA comparison
Am 15.09.2016 um 13:52 schrieb Allison, Timothy B.: >> The one apparent major new exception for PDF files was apparently fixed >> before 2.0.3. So, please ignore that one! > > Wait...if possible, please confirm that you did fix this recently (within the > last week or two). I ran pdfbox app's (2.0.3) on a handful of triggering > files and didn't get the exception...however, it is possible that > multithreading might trigger this exception. I've fixed that 2 days ago, it's part of the RC. BR Andreas > > java.lang.NullPointerException > at > org.apache.pdfbox.pdmodel.font.encoding.Encoding.overwrite(Encoding.java:118) > at > org.apache.pdfbox.pdmodel.font.encoding.DictionaryEncoding.applyDifferences(DictionaryEncoding.java:151) > at > org.apache.pdfbox.pdmodel.font.encoding.DictionaryEncoding.<init>(DictionaryEncoding.java:128) > at > org.apache.pdfbox.pdmodel.font.PDSimpleFont.readEncoding(PDSimpleFont.java:129) > at > org.apache.pdfbox.pdmodel.font.PDTrueTypeFont.<init>(PDTrueTypeFont.java:209) > at > org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:75) > at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:143) > at > org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:60) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:815) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:472) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:446) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:149) > at > org.apache.pdfbox.text.LegacyPDFStreamEngine.processPage(LegacyPDFStreamEngine.java:139) > at > org.apache.pdfbox.text.PDFTextStripper.processPage(PDFTextStripper.java:391) > at org.apache.tika.parser.pdf.PDF2XHTML.processPage(PDF2XHTML.java:143) > at > org.apache.pdfbox.text.PDFTextStripper.processPages(PDFTextStripper.java:319) > at > org.apache.pdfbox.text.PDFTextStripper.writeText(PDFTextStripper.java:266) > at org.apache.tika.parser.pdf.PDF2XHTML.process(PDF2XHTML.java:111) > at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:146) > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) > at > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120) > at > org.apache.tika.parser.ParserDecorator.parse(ParserDecorator.java:188) > at org.apache.tika.parser.DigestingParser.parse(DigestingParser.java:74) > at > org.apache.tika.parser.RecursiveParserWrapper.parse(RecursiveParserWrapper.java:158) > at > org.apache.tika.batch.FileResourceConsumer.parse(FileResourceConsumer.java:407) > at > org.apache.tika.batch.fs.RecursiveParserWrapperFSConsumer.processFileResource(RecursiveParserWrapperFSConsumer.java:104) > at > org.apache.tika.batch.FileResourceConsumer._processFileResource(FileResourceConsumer.java:182) > at > org.apache.tika.batch.FileResourceConsumer.call(FileResourceConsumer.java:115) > at > org.apache.tika.batch.FileResourceConsumer.call(FileResourceConsumer.java:50) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > > --------------------------------------------------------------------- > To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For > additional commands, e-mail: dev-h...@pdfbox.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org