Tika Colleagues (Tika'ers, Tikis?), Is this the right answer:
Drop the relevant parsers from the tika.config file and make sure to point solr to this file in your solr request handler definition: <str name="tika.config">/my/path/to/tika.config</str>? I only have experience as a programmatic user of Tika and would use a DocumentSelector, but would the above work? -----Original Message----- From: keeblerh [mailto:[email protected]] Sent: Thursday, September 18, 2014 10:15 AM To: [email protected] Subject: Re: How to exclude a mimetype in tika? eShard wrote > Good afternoon, > I'm using solr 4.0 Final > I need movies "hidden" in zip files that need to be excluded from the > index. > I can't filter movies on the crawler because then I would have to exclude > all zip files. > I was told I can have tika skip the movies. > the details are escaping me at this point. > How do I exclude a file in the tika configuration? > I assume it's something I add in the update/extract handler but I'm not > sure. > > Thanks, I am having the same issue. I need to exlcude some mime types from the zip files and using SOLR 4.8. Did you ever get an answer to this? THanks. -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-exclude-a-mimetype-in-tika-tp4127168p4159676.html Sent from the Solr - User mailing list archive at Nabble.com.
