how can i use more than one "Num Taks" in hadoop

beneo_7 Sun, 26 Dec 2010 23:20:45 -0800

hi, all

i use hadoop to run mahout, and the "Total input paths to process: " will 
always be 1, why???


i have already set the "Map Task Capacity" to 32, the "Reduce Task Capacity" to 
16, however , the mahout always run in one task.

but, when i use the cmd "bin/hadoop jar hadoop-*-examples.jar grep input output 
'dfs[a-z.]+'" the "Total input paths to process:" will be 14.

what should i do. i have 32cpus and 32g memory, and only one cpu used for 
clustering, what should i do ?

how can i use more than one "Num Taks" in hadoop

Reply via email to