Hello everyone,

I'm wondering about Solr/Nutch that uses Tika.
As far as I found out, I'm correct here with my need:

I'd like to index a bunch of webs (like 100 or so).
But *only* index a webpage if it contains a certain word (or better: a certain regular expression).
Is it possible via a custom parser?
And where and how do I put/deploy the parser?

Thank you in advance
Bye, Chris

Reply via email to