Thanks Daniel. But somehow the error is gone. I will look into the nsswitch conf when I see the same error next time.
Koji On Tue, Jan 5, 2016 at 2:42 PM, Daniel Letai <[email protected]> wrote: > > What's the nsswitch like on the node? > from the node, can you do: > # getent passwd | grep <ldap user> > > > On 01/05/2016 08:31 PM, Koji Tanaka wrote: >> >> Hello Slurm Community, >> >> I get following errors when I run a job as a LDAP user. However, as a >> local user, everything works fine. >> >> $ srun -N1 hostname >> srun: error: Task launch for 23.3 failed on node e1c001: User not found on >> host >> srun: error: Application launch failed: User not found on host >> srun: Job step aborted: Waiting up to 32 seconds for job step to finish. >> e1c001 >> >> And, there's following logs on slurmctld.log. >> >> [2016-01-05T12:45:41.182] job_step_signal step 23.2 not found >> [2016-01-05T12:45:41.198] step_partial_comp: StepID=23.2 invalid >> >> What could cause this error? I would appreciate any comments and >> suggestions. >> >> Thank you, >> Koji
