We're running a few crawlers in parallel (different seed lists & topN parameters, using different crawl dirs) and we'd like to set the regex-urlfilter separate per fetch/seed list. As of now the urlfilter is global per access of bin/nutch. Is there anything you can besides making two instances of the nutch dir structure?



Reply via email to