05.11.2014, 22:38, "Uwe Sauter" <[email protected]>:
> Hi Anna,
>
> I'm sorry to inform you that you have to have the user information on
> all nodes. You cannot run jobs with UIDs from users the local system
> does not know.
>
> If you don't want to distribute your /etc/passdw, /etc/shadow and
> /etc/group everytime a user is added or removed the best option would
> probably be a centralized user directory, e.g. LDAP

Another approach is to use diskless slurmd nodes that boot from some tftp 
server (you can use your slurmctld node for that role) and then mount root 
directory using nfs . With diskless nodes you can just update necessary files 
(/etc/passwd, /etc/shadow, etc) in your diskless nodes' root directory (for 
example, /tftpboot/rootfs), and then slurmctld will be able to spawn jobs on 
slurmd nodes under new user's account, no reboot will be needed.

Maybe LDAP is a better approach, though - I do not have such installations now 
to compare.

>
> Regards,
>
> О©╫О©╫О©╫О©╫О©╫О©╫О©╫О©╫Uwe
>
> Am 05.11.2014 um 20:32 schrieb Anna Kostikova:
>> О©╫Dear list,
>>
>> О©╫Am I correct assuming that if I use munge for slurm, then only slurmd
>> О©╫and munged should be running on all slurm nodes, and I can keep all
>> О©╫unix users on another server, with, for instance, slurmctld running.
>> О©╫For instance, I create a user with useradd, and when this user run a
>> О©╫job in slurm, then, with a help of munge, node of slurm will recognise
>> О©╫his uid and gid, even though this unix user is not created on this
>> О©╫node of slurm. If yes, then why I might be having this error:
>>
>> О©╫[2014-11-05T10:24:30.729] launch task 602.0 request from 1007.1007@IP
>> О©╫(port 22410)
>> О©╫[2014-11-05T10:24:30.751] error: _send_slurmstepd_init getpwuid_r: No 
>> error
>> О©╫[2014-11-05T10:24:30.751] error: Unable to init slurmstepd
>> О©╫[2014-11-05T10:24:30.751] uid 1007 not found on system
>> О©╫[2014-11-05T10:24:30.752] _step_setup: no job returned
>> О©╫[2014-11-05T10:24:30.752] Unable to send "fail" to slurmd
>> О©╫[2014-11-05T10:24:30.752] done with job
>> О©╫Munge keys are exactly same on slurm server and all nodes.
>>
>> О©╫From the description here (http://linux.die.net/man/7/munge) it looks
>> О©╫like all slurm users are kept in one place. But if not, then why there
>> О©╫is munge?
>> О©╫Or a user with specific uid and gid must exist on a node?
>>
>> О©╫Thanks a lot for your help,
>> О©╫Anna

--О©╫
Vsevolod Nikonorov, JSC NIKIET
О©╫О©╫О©╫О©╫О©╫О©╫О©╫О©╫ О©╫О©╫О©╫О©╫О©╫О©╫О©╫О©╫О©╫, О©╫О©╫О©╫ 
О©╫О©╫О©╫О©╫О©╫О©╫

Reply via email to