RE: Unable to parse PDF due to NoSuchFieldError: HAS_XMP

Markus Jelsma Mon, 02 Mar 2020 08:07:54 -0800

Hello Tim,

Good find. I left some part somewhere with tika-core 1.22. With that being 
fixed, i can parse PDFs again.


Many thanks,
Markus


 
-----Original message-----
> From:Tim Allison <[email protected]>
> Sent: Monday 2nd March 2020 16:50
> To: [email protected]
> Subject: Re: Unable to parse PDF due to NoSuchFieldError: HAS_XMP
> 
> Y, thats a Tika field.  Is there a chance that your tika-parsers version does 
> not match your tika-core version?  Which versions of each are you using?  
> 
> If this is a problem with Tika, well have time to fix it before the 1.24 
> release...coming soon... 
> 
> Cheers, 
> 
>             Tim
> 
> On Mon, Mar 2, 2020 at 9:44 AM Markus Jelsma <[email protected] 
> <mailto:[email protected]>> wrote:
> Hello,
 
> 
 
> I recently upgraded to the latest Tika and am no longer able to parse PDF, at 
> least the 6 files i just tested, due to:
 
> 
 
> Caused by: java.lang.NoSuchFieldError: HAS_XMP
 
>         at 
> org.apache.tika.parser.pdf.PDMetadataExtractor.extract(PDMetadataExtractor.java:60)
 
>         at 
> org.apache.tika.parser.pdf.PDFParser.extractMetadata(PDFParser.java:227)
 
>         at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:147)
 
>         at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
 
>         at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
 
>         at 
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
 
> 
 
> Trying to work-around the problem i upgraded PDFBox from 2.0.17 to 2.0.19, 
> but this did not help.
 
> 
 
> There are no other PDFBox libraries anywhere on the classpath.
 
> 
 
> Any suggestions?
 
> 
 
> Many thanks,
 
> Markus

RE: Unable to parse PDF due to NoSuchFieldError: HAS_XMP

Reply via email to