Hello,

I've set a basic configuration using slurm with a master node, backup node,
a login node and eight compute node. The login node is considered so thanks
to pam module which restricts access to the compute nodes. So in the eight
compute nodes, I edited /etc/pam.d/system-auth and /etc/pam.d/password-auth
and added
"account required pam_slurm.so debug"
Everything in slurm is working fine. I can issue jobs and see the state of
the eight nodes as Idle. However, when I try to connect to a compute node
with a user, even if he has a job running on, I get rejected. The log shows
that the pam module is unable to contact slurm controller and the error is
returned by slurm_load_job_user. But the squeue command which uses the same
routine as I read works fine on the same node...

Do you have any idea about the origin of the problem ? I can post further
details about my configuration if required.

Thank you.

Reply via email to