I have a map/reduce job that has a total of 6000 map tasks. The issue is
that the number of maps that is "running" at any given time is 6 (number of
nodes) and rest are pending. Does anyone know how to force the cluster to
run more maps in parallel to increase the throughput? This is the only job
that is running on this cluster.

Cluster summary:  0.19.2, 6 nodes, Map tasks capacity: 192, Avg tasks/Node:
64

Thanks,
Zeev

Reply via email to