Got the swimlanes tool working. It revealed a problem. In the left-hand side of the graph (container ID) it shows a job (1TB data scan) kicking off 200 containers. Cool, right? The problem is that YARN is only reporting 100 available containers. Happy to share the SVG if I can host it somewhere (JIRA perhaps?)
My cluster configuration: 10 datanodes/nodemanagers 20CPU allocated / node to NMs 20GB RAM / node to the NMs 2GB min, 8GB max allocation / container When I spin up a full MapReduce job doing the same 1TB data scan it shows 100 containers in use in the ResourceManager web UI. Tez had the exact same showing, only swimlanes is different. Thoughts? Thad
