Are there recommendations regarding master / scheduler machines resources as function of cluster size?
Say I have a cluster with hundreds of slave machines and thousands of CPUs, with a single framework that will schedule millions of tasks. How does the strength of the master & scheduler machines affect the overall cluster performance? Thanks, - Itamar.

