When using nutch to crawl some sites, I want to index fetched contents selectively only when the urls to these contents fit my filter, for other urls I just want nutch to crawl them and parse them without index. How can I achieve this? Which extension point should I extend?
- Which extension point should I extend? Elwin
- Re: Which extension point should I extend? Stefan Groschupf
- Re: Which extension point should I extend? Elwin
- Re: Which extension point should I extend? Stefan Groschupf
