The slurmctld log will print out if hosts are out of sync with the slurmctld slurm.conf.  That said it doesn't report on cgroup consistency changes like that.  It's possible that dialing up the verbosity on the slurmd logs may give that info but I haven't seen it in normal operating.

-Paul Edmon-

On 10/6/22 5:47 PM, Davide DelVento wrote:
Is there a simple way to check that whas slurm is running is what the
config say it should be?

For example, my understanding is that changing cgroup.conf should be
followed by 'systemctl stop slurmd' on all compute nodes, then
'systemctl restart slurmctld' on the head node, then 'systemctl start
slurmd' on the compute nodes.

Assuming this is correct, is there a way to query the nodes and ask if
they are indeed running what the config is saying (or alternatively
have them dump their config files somewhere for me to manually run a
diff on)?

Thanks,
Davide


Reply via email to