What you are asking for does not exist today, but it would be relatively simple to add. There are already some RPCs that communicate only with a local slurmd on the compute node (see "scontrol listpids"). Note this would only work once the job has actually tried to launch something on the node (e.g. "salloc" by itself would not work unless Slurm was configured to launch a Prolog on every compute node at job allocation time).

Quoting Olli-Pekka Lehto <[email protected]>:
Hi,

I would like to regularly poll the resources reserved by SLURM on each compute node with a very short interval. Is there some simple way to get this information locally on each node, without having to poll the server. The only thing that comes to mind immediately is to start digging into the cgroups configuration on each node but I’m hoping there would be a simpler solution.

Best regards,
Olli-Pekka=


--
Morris "Moe" Jette
CTO, SchedMD LLC
Commercial Slurm Development and Support

Reply via email to