hi, all

i use hadoop to run mahout, and the "Total input paths to process: " will 
always be 1, why???

i have already set the "Map Task Capacity" to 32, the "Reduce Task Capacity" to 
16, however , the mahout always run in one task.

but, when i use the cmd "bin/hadoop jar hadoop-*-examples.jar grep input output 
'dfs[a-z.]+'" the "Total input paths to process:" will be 14.

what should i do. i have 32cpus and 32g memory, and only one cpu used for 
clustering, what should i do ?

Reply via email to