> Slurm version 17.11.0-pre1 is the first pre-release of version 17.11, to
> be
> released in November 2017. This version contains the support for
> scheduling of
> a workload across a set (federation) of clusters which is described in
> some
> detail here:
> https://slurm.schedmd.com/SLUG16/FederatedScheduling.pdf

Something that seems to be missing in the PDF (unless it's in the
"Magic: TBD" part) is the ability for a federated job to have
dependencies on sibling jobs - is this sill part of the workflow?

ie
Federation = MySite
sibling cluster 1 = BigCray
sibling cluster 2 = PrePostCluster

ideallly we'd like a user who probably logged into BigCray as their
local cluster to submiit a job with
step1 - serial work on PrePostCluster
step2 - large srun on BigCray, dependency = afterok: PrePostCluster:step1
step3 - small parallel cleanup on PrePostCluster, dependency=afterok:
BigCray:step2


Is this still on schedule for the initial 17.11 release or will it
land in a later update or release?


Andrew (trying to work out if we'll have time to test 17.11 before
upgrading all clusters the 1st week in Jan)

Reply via email to