Hello,
I recently upgraded to the latest Tika and am no longer able to parse PDF, at
least the 6 files i just tested, due to:
Caused by: java.lang.NoSuchFieldError: HAS_XMP
at
org.apache.tika.parser.pdf.PDMetadataExtractor.extract(PDMetadataExtractor.java:60)
at
org.apache.tika.parser.pdf.PDFParser.extractMetadata(PDFParser.java:227)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:147)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
Trying to work-around the problem i upgraded PDFBox from 2.0.17 to 2.0.19, but
this did not help.
There are no other PDFBox libraries anywhere on the classpath.
Any suggestions?
Many thanks,
Markus