What's going on here? What version of tika should I use?

The version that comes with Solr/SolrCell.

Try sending various document types directly to the Solr Extracting Request Handler and see if it might be related to your parameters or specific document types. Maybe the document isn't what it seems or is a newer version.

-- Jack Krupansky

-----Original Message----- From: Mattmann, Chris A (388J)
Sent: Tuesday, December 11, 2012 4:31 PM
To: solr-user@lucene.apache.org
Subject: Re: Too many Tika errors

Hi there -- you may want to post this to the d...@tika.apache.org list.

Cheers,
Chris

On 12/11/12 11:08 AM, "eShard" <zim...@yahoo.com> wrote:

I'm running Solr 4.0 on Tomcat 7.0.8 and I'm running the solr/example
single
core as well with manifoldcf v1.1
I had everything working but then the crawler stops and I have Tika errors
in the solr log
I had tika 1.1 and that produces these errors:
org.apache.solr.common.SolrException:
org.apache.tika.exception.TikaException: Unexpected RuntimeException from
org.apache.tika.parser.microsoft.OfficeParser@17bc9c03

So, I upgraded to tika 1.2 and again everything seemed to be working (I
indexed 24,000 files) then I recrawled the repository and again it stops;
this time the tika errors are:
null:java.lang.RuntimeException: java.lang.NoClassDefFoundError:
org/mozilla/universalchardet/CharsetListener at
org.apache.solr.servlet.SolrDispatchFilter.sendError(SolrDispatchFilter.ja
va:456)

What's going on here? What version of tika should I use?



--
View this message in context:
http://lucene.472066.n3.nabble.com/Too-many-Tika-errors-tp4026126.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to