[slurm-dev] Re: Architectural constraints of Slurm

Paul Edmon Tue, 24 May 2016 06:38:30 -0700

This email actually reminded me of what we run here. We've beenoperating with slurm for the past 3 years in an environment similar toyour own. We schedule about 50k cores, 90 partitions, heterogeneousarchitecture of AMD, Intel, high memory versus low memory and GPU.Slurm has handled it all and a lot of the lessons we learned have beenpushed back to the community. Thus the current version of slurm shouldbe able to handle all that you have defined there.

The only tricky part that I am aware of may be the Kerberos part. We useldap for our auth on the boxes, and slurm does have a pam module forhandling who gets access. That plus cgroups should take care of yoursecurity problems.

Anyways, suffice it to say slurm can work for your environment as yourenvironment is fairly similar to ours.


-Paul Edmon-

On 05/24/2016 07:33 AM, Šimon Tóth wrote:

Architectural constraints of Slurm
Hello,
I'm looking for information on Slurm architectural constraints as weare considering a switch to Slurm.
We are currently running a heavily modified version of Torque with acustom scheduler.
Our system (~13k CPU cores), is heavily heterogenous (~40 clusters)with complex operational constraints. The system generally handles10k-50k enqued jobs.
We are currently scheduling CPU cores, Memory, GPU cards, Scratchspace (local, SSD and NFS with different machines having access todifferent combination of these) and software licenses.
Machines are described by a set of physical and software properties(can be requested by users) and their speed (users can request rangesof machine performance).
Jobs carry complex requests. Each job can request sets of machines,where each set caries a different specification. Each set is describedby the amount of resources requested, machine properties (negativespecification is supported for specifying nodes that do not havespecific property) and the number of nodes with such specification.Nodes can be allocated exclusively (in which case, the specificationdescribes the minimum amount) or shared, with each resource stillbeing allocated exclusively (jobs cannot overlap in cores, memory, gpucards....).
We are relying on Kerberos which is used for both identification andauthentication. Each running task inside of a job has a nanny processthat periodically refreshes the kerberos ticket for that particularprocess.
For scalability reasons, our scheduler relies on the server to keep anup-to-date and full state of the system. The server therefore countsthe current resource allocation state for each of the resources oneach of the nodes.
Please let me know if this sounds like something Slurm could handle,or if there are any limitations in Slurm that would make thisimpossible to support.
Sincerely,
Simon Toth

[slurm-dev] Re: Architectural constraints of Slurm

Reply via email to