[
https://issues.apache.org/jira/browse/CONNECTORS-1294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15220084#comment-15220084
]
Karl Wright commented on CONNECTORS-1294:
-----------------------------------------
Hi Jack,
This is indeed occurring in Solr (that's the 500 error). You can make Solr
ignore Solr Cell errors with a configuration setting, though -- it will mean
that Solr just skips documents with Tika problems.
> 7Zip files are being detected as ucar/nc2/NetcdfFile
> ----------------------------------------------------
>
> Key: CONNECTORS-1294
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1294
> Project: ManifoldCF
> Issue Type: Bug
> Components: Active Directory authority, Tika extractor
> Affects Versions: ManifoldCF 2.3
> Environment: Windows Server 2008 R2, ManifoldCF 2.3, Windows Shares
> Connector, DFS, SOLR 5.4, Isilon OneFS 7.2.0.2
> Reporter: Jack Faust
> Priority: Minor
> Labels: newbie, windows
>
> When running crawl across Windows Shares throgh DFS hosted on an Isilon
> (OneFS 7.2.0.2) this error frequently ocurrs:
> WARN 2016-03-31 13:51:26,759 (Worker thread '14') - Solr exception during
> indexing file://///<Path>/FiberSim_demo.7z (500): Error from server at
> http://localhost:8984/solr/CORE: java.lang.NoClassDefFoundError:
> ucar/nc2/NetcdfFile
> org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error
> from server at http://localhost:8984/solr/CORE:
> java.lang.NoClassDefFoundError: ucar/nc2/NetcdfFile
> I hope this is a ManifoldCF exception rather than Solr, if it isn;t my humble
> apologies - a bit new to all of this.
> File is available upon request - can't see a way of attaching it here but
> it's only 8Mb.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)