Hello Part fo the Gang Scheduling work in 0.9, are you also looking topology awareness at rack and cluster level as well to use the scheduler with distributed Deep Learning training. There are uses case where cluster can have smaller affinity zone of nodes ( CPU + Accelerators) and lower network hop count so you want schedule smaller job in these zones as well larger jobs running job over full cluster ( CPU + Accelerators)
Greg --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
