[
https://issues.apache.org/jira/browse/TIKA-2789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jawahar closed TIKA-2789.
-------------------------
Resolution: Workaround
> Apache tika - java.lang.NoClassDefFoundError
> --------------------------------------------
>
> Key: TIKA-2789
> URL: https://issues.apache.org/jira/browse/TIKA-2789
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.19.1
> Reporter: Jawahar
> Priority: Minor
>
> I am running tomcat based server for attachment extraction using apache
> tika-app-jar-1.19.1.(https://www.apache.org/dyn/closer.cgi/tika/tika-app-1.19.1.jar)
> . while trying to extract docx files(not for all docx files,only some
> files), tika throws following exception. JDK version is 1.8.0_162.
> java.lang.NoClassDefFoundError: Could not initialize class
> org.openxmlformats.schemas.wordprocessingml.x2006.main.DocumentDocument at
> org.openxmlformats.schemas.wordprocessingml.x2006.main.DocumentDocument$Factory.parse(Unknown
> Source) at
> org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:178)
> at org.apache.poi.ooxml.POIXMLDocument.load(POIXMLDocument.java:184) at
> org.apache.poi.xwpf.usermodel.XWPFDocument.(XWPFDocument.java:138) at
> org.apache.poi.xwpf.extractor.XWPFWordExtractor.(XWPFWordExtractor.java:60)
> at
> org.apache.poi.ooxml.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:228)
> at
> org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:116)
> at
> org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:110)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143) at
> org.apache.tika.Tika.parseToString(Tika.java:527)
> at org.apache.tika.Tika.parseToString(Tika.java:602)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)