Re: [slurm-users] srun: Error generating job credential

2019-10-09 Thread Marcus Wagner
Damn, I almost always forget, that most of the submission part is done on the master :/ Best Marcus On 10/8/19 11:45 AM, Eddy Swan wrote: Hi Sean, Thank you so much for your additional information. The issue is indeed due to missing user on the head node. After i configured ldap client on

Re: [slurm-users] srun: Error generating job credential

2019-10-08 Thread Eddy Swan
Hi Sean, Thank you so much for your additional information. The issue is indeed due to missing user on the head node. After i configured ldap client on slurm-master, srun command is now working using ldap account. Best regards, Eddy Swan On Tue, Oct 8, 2019 at 4:15 PM Sean Crosby wrote: >

Re: [slurm-users] srun: Error generating job credential

2019-10-08 Thread Sean Crosby
Looking at the SLURM code, it looks like it is failing with a call to getpwuid_r on the ctld What is (on slurm-master): getent passwd turing getent passwd 1000 Sean -- Sean Crosby | Senior DevOpsHPC Engineer and HPC Team Lead Research Platform Services | Business Services CoEPP Research

Re: [slurm-users] srun: Error generating job credential

2019-10-07 Thread Eddy Swan
Hi Marcus, I did not restarted munge previously. So I restarted munge and follow by slurmd, but the issue still persists. I ran the following test from piglet-17 to verify the munge installation, it looks good. $ munge -n | unmunge STATUS: Success (0) ENCODE_HOST:

Re: [slurm-users] srun: Error generating job credential

2019-10-07 Thread Marcus Wagner
Hmm, that is strange. I asked because of the errors below: On 10/7/19 9:36 AM, Eddy Swan wrote: [2019-10-07T13:38:49.260] error: slurm_cred_create: getpwuid failed for uid=1000 [2019-10-07T13:38:49.260] error: slurm_cred_create error and "id" uses the same call (ltrace excerpt):