[ 
https://issues.apache.org/jira/browse/CONNECTORS-1294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15220084#comment-15220084
 ] 

Karl Wright commented on CONNECTORS-1294:
-----------------------------------------

Hi Jack,

This is indeed occurring in Solr (that's the 500 error).  You can make Solr 
ignore Solr Cell errors with a configuration setting, though -- it will mean 
that Solr just skips documents with Tika problems.


> 7Zip files are being detected as ucar/nc2/NetcdfFile
> ----------------------------------------------------
>
>                 Key: CONNECTORS-1294
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1294
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Active Directory authority, Tika extractor
>    Affects Versions: ManifoldCF 2.3
>         Environment: Windows Server 2008 R2, ManifoldCF 2.3, Windows Shares 
> Connector, DFS, SOLR  5.4, Isilon OneFS 7.2.0.2
>            Reporter: Jack Faust
>            Priority: Minor
>              Labels: newbie, windows
>
> When running crawl across Windows Shares throgh DFS hosted on an Isilon 
> (OneFS 7.2.0.2) this error frequently ocurrs:
> WARN 2016-03-31 13:51:26,759 (Worker thread '14') - Solr exception during 
> indexing file://///<Path>/FiberSim_demo.7z (500): Error from server at 
> http://localhost:8984/solr/CORE: java.lang.NoClassDefFoundError: 
> ucar/nc2/NetcdfFile
> org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error 
> from server at http://localhost:8984/solr/CORE: 
> java.lang.NoClassDefFoundError: ucar/nc2/NetcdfFile
> I hope this is a ManifoldCF exception rather than Solr, if it isn;t my humble 
> apologies - a bit new to all of this.
> File is available upon request - can't see a way of attaching it here but 
> it's only 8Mb.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to