I have a map/reduce job that has a total of 6000 map tasks. The issue is that the number of maps that is "running" at any given time is 6 (number of nodes) and rest are pending. Does anyone know how to force the cluster to run more maps in parallel to increase the throughput? This is the only job that is running on this cluster.
Cluster summary: 0.19.2, 6 nodes, Map tasks capacity: 192, Avg tasks/Node: 64 Thanks, Zeev
