Hello Tim, Good find. I left some part somewhere with tika-core 1.22. With that being fixed, i can parse PDFs again.
Many thanks, Markus -----Original message----- > From:Tim Allison <[email protected]> > Sent: Monday 2nd March 2020 16:50 > To: [email protected] > Subject: Re: Unable to parse PDF due to NoSuchFieldError: HAS_XMP > > Y, thats a Tika field. Is there a chance that your tika-parsers version does > not match your tika-core version? Which versions of each are you using? > > If this is a problem with Tika, well have time to fix it before the 1.24 > release...coming soon... > > Cheers, > > Tim > > On Mon, Mar 2, 2020 at 9:44 AM Markus Jelsma <[email protected] > <mailto:[email protected]>> wrote: > Hello, > > I recently upgraded to the latest Tika and am no longer able to parse PDF, at > least the 6 files i just tested, due to: > > Caused by: java.lang.NoSuchFieldError: HAS_XMP > at > org.apache.tika.parser.pdf.PDMetadataExtractor.extract(PDMetadataExtractor.java:60) > at > org.apache.tika.parser.pdf.PDFParser.extractMetadata(PDFParser.java:227) > at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:147) > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) > at > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143) > > Trying to work-around the problem i upgraded PDFBox from 2.0.17 to 2.0.19, > but this did not help. > > There are no other PDFBox libraries anywhere on the classpath. > > Any suggestions? > > Many thanks, > Markus
