Hello, I've set a basic configuration using slurm with a master node, backup node, a login node and eight compute node. The login node is considered so thanks to pam module which restricts access to the compute nodes. So in the eight compute nodes, I edited /etc/pam.d/system-auth and /etc/pam.d/password-auth and added "account required pam_slurm.so debug" Everything in slurm is working fine. I can issue jobs and see the state of the eight nodes as Idle. However, when I try to connect to a compute node with a user, even if he has a job running on, I get rejected. The log shows that the pam module is unable to contact slurm controller and the error is returned by slurm_load_job_user. But the squeue command which uses the same routine as I read works fine on the same node...
Do you have any idea about the origin of the problem ? I can post further details about my configuration if required. Thank you.
