Re: How to exclude a mimetype in tika?

2014-09-20 Thread Jorge Luis Betancourt Gonzalez
Which crawler are you using? On Sep 18, 2014, at 10:14 AM, keeblerh keebl...@yahoo.com wrote: eShard wrote Good afternoon, I'm using solr 4.0 Final I need movies hidden in zip files that need to be excluded from the index. I can't filter movies on the crawler because then I would have to

RE: How to exclude a mimetype in tika?

2014-09-19 Thread Allison, Timothy B.
One option (I think--answer is untested!) is to remove the parsers you don't want from the tika config file. Make sure to specify the tika.config file parameter in your ExtractingRequestHandler in Solr (https://wiki.apache.org/solr/ExtractingRequestHandler). In response to this question, I

Re: How to exclude a mimetype in tika?

2014-09-18 Thread keeblerh
eShard wrote Good afternoon, I'm using solr 4.0 Final I need movies hidden in zip files that need to be excluded from the index. I can't filter movies on the crawler because then I would have to exclude all zip files. I was told I can have tika skip the movies. the details are escaping me

How to exclude a mimetype in tika?

2014-03-26 Thread eShard
Good afternoon, I'm using solr 4.0 Final I need movies hidden in zip files that need to be excluded from the index. I can't filter movies on the crawler because then I would have to exclude all zip files. I was told I can have tika skip the movies. the details are escaping me at this point. How do