[
https://issues.apache.org/jira/browse/SOLR-7430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Eric Pugh resolved SOLR-7430.
-----------------------------
Resolution: Won't Fix
In Solr 10 we are leveraging either Tika Server (running in it's own seperate
server process) or maybe Tika Pipes (again, running in a seperate JVM).
Please revalidate your issue against Solr 10 with one of those options, and if
it is still present need, happy to work with you on a fix using the new
approach for Tika.
> Encrypted pptx/xlsx causes a ClassNotFoundException
> ---------------------------------------------------
>
> Key: SOLR-7430
> URL: https://issues.apache.org/jira/browse/SOLR-7430
> Project: Solr
> Issue Type: Bug
> Components: contrib - Solr Cell (Tika extraction)
> Affects Versions: 5.1
> Environment: Windows 7 (64bit) jre 1.8.0_40-b26 (64 bit)
> Reporter: Jon Scharff
> Priority: Major
>
> When indexing an encrypted pptx or xlsx file via the command
> solr-home>java -Dc=<core> -Dauto=yes -Ddata=files -jar
> example\exampledocs\post.jar file.pptx
> on a server started with
> solr-home>bin\solr start
> a ClassNotFoundException results instead of a EncryptedDocumentException.
> It appears that poi is using reflection to get the proper encryption handler,
> but the necessary jar files are not supplied by jetty's ClassLoader.
> A portion of the resulting error trace is below.
> org.apache.solr.common.SolrException:
> org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from
> org.apache.tika.parser.microsoft.OfficeParser@2e973e0f
> at
> org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:227)
> ...
> Caused by: org.apache.tika.exception.TikaException: TIKA-198: Illegal
> IOException from org.apache.tika.parser.microsoft.OfficeParser@2e973e0f
> at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:262)
> at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256)
> at
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> at
> org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:221)
> ... 31 more
> Caused by: java.io.IOException: java.lang.ClassNotFoundException:
> org.apache.poi.poifs.crypt.agile.AgileEncryptionInfoBuilder
> at
> org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:69)
> at
> org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:228)
> at
> org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:172)
> at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256)
> ... 34 more
> Caused by: java.lang.ClassNotFoundException:
> org.apache.poi.poifs.crypt.agile.AgileEncryptionInfoBuilder
> at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
> at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
> at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
> at
> org.eclipse.jetty.webapp.WebAppClassLoader.loadClass(WebAppClassLoader.java:430)
> at
> org.eclipse.jetty.webapp.WebAppClassLoader.loadClass(WebAppClassLoader.java:383)
> at
> org.apache.poi.poifs.crypt.EncryptionInfo.getBuilder(EncryptionInfo.java:150)
> at
> org.apache.poi.poifs.crypt.EncryptionInfo.<init>(EncryptionInfo.java:67)
> ... 37 more
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]