[2nd attempt]

I think that one is this:
https://issues.apache.org/jira/browse/TIKA-3112

Tilman

Am 11.07.2020 um 00:32 schrieb Jim Garrison:
Tika App started with

     java -jar tika-app-1.24.1.jar -g

Fails the same way no matter what I try to parse.

Checking here before I submit an issue...

Stack Trace:

Apache Tika was unable to parse the document
at D:\Users\jim\Data\Scans\Receipts\20200706-WinCo.pdf.

The full exception stack trace is included below:

org.apache.tika.exception.TikaException: Unexpected RuntimeException
from org.apache.tika.parser.pdf.PDFParser@7aff13b
     at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:293)
     at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
     at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
     at
org.apache.tika.parser.ParserDecorator.parse(ParserDecorator.java:188)
     at org.apache.tika.parser.DigestingParser.parse(DigestingParser.java:84)
     at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:358)
     at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:309)
     at
org.apache.tika.gui.ParsingTransferHandler.importFiles(ParsingTransferHandler.java:94)
     at
org.apache.tika.gui.ParsingTransferHandler.importData(ParsingTransferHandler.java:77)
     at javax.swing.TransferHandler.importData(Unknown Source)
     at javax.swing.TransferHandler$DropHandler.drop(Unknown Source)
     at java.awt.dnd.DropTarget.drop(Unknown Source)
     at javax.swing.TransferHandler$SwingDropTarget.drop(Unknown Source)
     at sun.awt.dnd.SunDropTargetContextPeer.processDropMessage(Unknown
Source)
     at
sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchDropEvent(Unknown
Source)
     at
sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchEvent(Unknown
Source)
     at sun.awt.dnd.SunDropTargetEvent.dispatch(Unknown Source)
     at java.awt.Component.dispatchEventImpl(Unknown Source)
     at java.awt.Container.dispatchEventImpl(Unknown Source)
     at java.awt.Component.dispatchEvent(Unknown Source)
     at java.awt.LightweightDispatcher.retargetMouseEvent(Unknown Source)
     at java.awt.LightweightDispatcher.processDropTargetEvent(Unknown Source)
     at java.awt.LightweightDispatcher.dispatchEvent(Unknown Source)
     at java.awt.Container.dispatchEventImpl(Unknown Source)
     at java.awt.Window.dispatchEventImpl(Unknown Source)
     at java.awt.Component.dispatchEvent(Unknown Source)
     at java.awt.EventQueue.dispatchEventImpl(Unknown Source)
     at java.awt.EventQueue.access$500(Unknown Source)
     at java.awt.EventQueue$3.run(Unknown Source)
     at java.awt.EventQueue$3.run(Unknown Source)
     at java.security.AccessController.doPrivileged(Native Method)
     at
java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(Unknown
Source)
     at
java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(Unknown
Source)
     at java.awt.EventQueue$4.run(Unknown Source)
     at java.awt.EventQueue$4.run(Unknown Source)
     at java.security.AccessController.doPrivileged(Native Method)
     at
java.security.ProtectionDomain$JavaSecurityAccessImpl.doIntersectionPrivilege(Unknown
Source)
     at java.awt.EventQueue.dispatchEvent(Unknown Source)
     at java.awt.EventDispatchThread.pumpOneEventForFilters(Unknown Source)
     at java.awt.EventDispatchThread.pumpEventsForFilter(Unknown Source)
     at java.awt.EventDispatchThread.pumpEventsForHierarchy(Unknown Source)
     at java.awt.EventDispatchThread.pumpEvents(Unknown Source)
     at java.awt.EventDispatchThread.pumpEvents(Unknown Source)
     at java.awt.EventDispatchThread.run(Unknown Source)
Caused by: java.lang.NullPointerException
     at
org.apache.tika.parser.pdf.AbstractPDF2XHTML.extractXMPXFA(AbstractPDF2XHTML.java:209)
     at
org.apache.tika.parser.pdf.AbstractPDF2XHTML.endDocument(AbstractPDF2XHTML.java:678)
     at
org.apache.pdfbox.text.PDFTextStripper.writeText(PDFTextStripper.java:267)
     at org.apache.tika.parser.pdf.PDF2XHTML.process(PDF2XHTML.java:96)
     at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:174)
     at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
     ... 43 more



Reply via email to