Tika Colleagues (Tika'ers, Tikis?),

Is this the right answer:

Drop the relevant parsers from the tika.config file and make sure to point solr 
to this file in your solr request handler definition: <str 
name="tika.config">/my/path/to/tika.config</str>?

  I only have experience as a programmatic user of Tika and would use a 
DocumentSelector, but would the above work?

-----Original Message-----
From: keeblerh [mailto:[email protected]] 
Sent: Thursday, September 18, 2014 10:15 AM
To: [email protected]
Subject: Re: How to exclude a mimetype in tika?

eShard wrote
> Good afternoon,
> I'm using solr 4.0 Final
> I need movies "hidden" in zip files that need to be excluded from the
> index.
> I can't filter movies on the crawler because then I would have to exclude
> all zip files.
> I was told I can have tika skip the movies.
> the details are escaping me at this point.
> How do I exclude a file in the tika configuration?
> I assume it's something I add in the update/extract handler but I'm not
> sure.
> 
> Thanks,

I am having the same issue.  I need to exlcude some mime types from the zip
files and using SOLR 4.8.  Did you ever get an answer to this?  THanks.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/How-to-exclude-a-mimetype-in-tika-tp4127168p4159676.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to