> Slurm version 17.11.0-pre1 is the first pre-release of version 17.11, to > be > released in November 2017. This version contains the support for > scheduling of > a workload across a set (federation) of clusters which is described in > some > detail here: > https://slurm.schedmd.com/SLUG16/FederatedScheduling.pdf
Something that seems to be missing in the PDF (unless it's in the "Magic: TBD" part) is the ability for a federated job to have dependencies on sibling jobs - is this sill part of the workflow? ie Federation = MySite sibling cluster 1 = BigCray sibling cluster 2 = PrePostCluster ideallly we'd like a user who probably logged into BigCray as their local cluster to submiit a job with step1 - serial work on PrePostCluster step2 - large srun on BigCray, dependency = afterok: PrePostCluster:step1 step3 - small parallel cleanup on PrePostCluster, dependency=afterok: BigCray:step2 Is this still on schedule for the initial 17.11 release or will it land in a later update or release? Andrew (trying to work out if we'll have time to test 17.11 before upgrading all clusters the 1st week in Jan)
