Greetings, We run multiple types of jobs on our cluster and many of our nodes have 48 cores or more. We have found that some jobs are idle on such nodes when are accessing data in aggregate over the available bandwidth. I was thinking of trying to create a “high-bandwidth” queue which would only allow say 10 such processes to run on each node so the bandwidth wouldn’t become a problem. Is such a thing possible with the slurm scheduler? If not any suggestions on how to solve such a problem?
Regards, Brian
smime.p7s
Description: S/MIME cryptographic signature
