When using nutch to crawl some sites, I want to index fetched contents
selectively only when the urls to these contents fit my filter, for other
urls I just want nutch to crawl them and parse them without index.
How can I achieve this? Which extension point should I extend?

Reply via email to