Hi All,

Can I increase the number of reducer in Deduplication on crawldb? Currently it 
is running with 1 reducer.
Will it impact the crawling in any way?

Current command in crawl script:
__bin_nutch dedup "$CRAWL_PATH"/crawldb

Can I update it to:
__bin_nutch dedup "$CRAWL_PATH"/crawldb mapreduce.job.reduces=32

Thanks it advance.

Suraj Singh

Reply via email to