Hi All, Can I increase the number of reducer in Deduplication on crawldb? Currently it is running with 1 reducer. Will it impact the crawling in any way?
Current command in crawl script: __bin_nutch dedup "$CRAWL_PATH"/crawldb Can I update it to: __bin_nutch dedup "$CRAWL_PATH"/crawldb mapreduce.job.reduces=32 Thanks it advance. Regards, Suraj Singh