-------- Original Message --------
From: Kevin <[email protected]>
Sent: Thu Jan 17 20:21:12 MST 2013
To: [email protected]
Subject: Support Question

Dear Slurm Dev support,


I\'ve setup and configured slurm 2.5.0 with 1 slurm controller and 9 nodes 
(total 10 computers in a cluster, all RHEL6 64 bit) .

Things i\'ve set up:
- using munge
- all computers are joined to windows Active Directory domain, so users can use 
their Active directory credentials to login to any RHEL6 computer in the 
cluster.
- samba + winbind
- NFS share

I have successfully set all the RHEL computers to use the same UID and GID for 
each users, using RID in samba config:
     idmap uid = 17000000-33554431
     idmap gid = 17000000-33554431
     idmap backend = rid:domain=100000-100000000

So when a user log in to node01, node02, etc, the user will always get the same 
UID and GID, thus, the user can run the srun command.


However, i am facing a challenge here.
If everytime a new user (never logged in to any computer in the cluster) try to 
login to the controller and run a srun command, for example \"srun -n30 -c1 
pwd\", the user will get this error:

srun: error: Task launch for 430.0 failed on node node01: User not found on host
srun: error: Task launch for 430.0 failed on node node02: User not found on host


I believe this is due to, the user account is not yet \"created\" on the node01 
and node02.

So the \"temporary\" solution to this is, i ask the user to login to all the 
slurm nodes in order to force all the nodes to create the user\'s 
profile/account.

This cluster will grow in future and we will add more slurm nodes.
I cannot imagine if i need to add 100 nodes, what the users will react if they 
need to login to additional 100 computers, just to create their accounts on 
each node.

I tried to implement NIS, but i think NIS only sync the local linux users 
account. In my case, the domain user accounts are not saved on the local linux 
account, but in winbind database (/var/lib64/samba/*.tdb).


I believe i am not the only one using this approach (using windows AD for 
authentication) for the linux machines together with slurm.

Can you give me a clue how to solve this challenge?


Thanks in advance....

Regards,

Kevin

Reply via email to