Hello everyone, I'm wondering about Solr/Nutch that uses Tika. As far as I found out, I'm correct here with my need:
I'd like to index a bunch of webs (like 100 or so).But *only* index a webpage if it contains a certain word (or better: a certain regular expression).
Is it possible via a custom parser? And where and how do I put/deploy the parser? Thank you in advance Bye, Chris
