Hi All,

I am running Nutch on a single node Hadoop cluster  , I do not use a
indexing URL and I have disabled the LinkInversion phase as I do not need
any scores to be attached to any URL.

My question is that if LinkInversion phase in Nutch is the only phase that
requires the Reduce task to be run , as since I have disabled it in the
Crawl.java class, can I go ahead and set the number of reduce tasks in
Hadoop job submission to zero, or is there any other phase that still
requires a reduce tasks.

Reply via email to