Thanks, this might be helpful. Two questions - 1) I don’t see how credentials 
are renewed. Do you use another plugin for that and 2) You are calling 
kdeposit. Is that something in-house? 
_______________
Gedaliah Wolosh
IST Academic and Research Computing Systems (ARCS)
NJIT
GITC 2203
973 596 5437
[email protected]

> On Jul 11, 2017, at 2:29 AM, Daniel Ahlin <[email protected]> wrote:
> 
> Hi,
> 
> We are running Kerberos/AFS on some of our systems. If it is of any help feel 
> free to take a look at our implementation in 
> /afs/pdc.kth.se/cluster/tegner/src/spank_krb5_propagate/spank_krb5_propagate.c
>  
> <http://pdc.kth.se/cluster/tegner/src/spank_krb5_propagate/spank_krb5_propagate.c>,
>  primarily lines 248-263. We are not running AUKS though, so perhaps there 
> are limited reuse possibilities.
> 
> Best regards,
> Daniel
> 
> On Mon, Jul 10, 2017 at 10:29 PM, Glenn (Gedaliah) Wolosh <[email protected] 
> <mailto:[email protected]>> wrote:
> Hello;
> 
> I’ve installed slurm 16.05 on SL 7.3 using ohpc. I also have the latest 
> version of AUKS. I was able to hack auks so that aklog successfully runs when 
> either obtaining or renewing a krb5 ticket.
> 
> For example —
> p-slogin.p-stheno.tartan.njit.edu 
> <http://p-slogin.p-stheno.tartan.njit.edu/>-77 guest24>: kinit
> Password for [email protected] <mailto:[email protected]>:
> p-slogin.p-stheno.tartan.njit.edu 
> <http://p-slogin.p-stheno.tartan.njit.edu/>-78 guest24>: tokens
> 
> Tokens held by the Cache Manager:
> 
>    --End of list--
> p-slogin.p-stheno.tartan.njit.edu 
> <http://p-slogin.p-stheno.tartan.njit.edu/>-79 guest24>: auks -g
> Auks API request succeed
> p-slogin.p-stheno.tartan.njit.edu 
> <http://p-slogin.p-stheno.tartan.njit.edu/>-80 guest24>: tokens
> 
> Tokens held by the Cache Manager:
> 
> User's (AFS ID 22967) tokens for [email protected] <mailto:[email protected]> 
> [Expires Jul 10 22:34]
>    --End of list—
> 
> Works just as well with auks -R loop
> 
> I also set up a function slurm_spank_task_init() to call aklog in the auks 
> spank plugin. Unfortunately, this does not work. 
> I get the following error —
> p-slogin.p-stheno.tartan.njit.edu 
> <http://p-slogin.p-stheno.tartan.njit.edu/>-81 guest24>: srun hostname
> aklog: Couldn't determine realm of user:aklog: unknown RPC error 
> (-1765328189)  while getting realm
> 
> My guess is that in this case the user running aklog is not “guest24” 
> 
> Here is some relevant lines fro the log —
> 2017-07-10T16:19:34.763] [78.0] debug3: Entering _handle_request
> [2017-07-10T16:19:34.763] [78.0] debug3: Leaving  _handle_accept
> [2017-07-10T16:19:34.773] [78.0] debug:  mpi type = (null)
> [2017-07-10T16:19:34.773] [78.0] debug:  Using mpi/none
> [2017-07-10T16:19:34.773] [78.0] debug:  task_p_pre_launch: 78.0, task 0
> [2017-07-10T16:19:34.773] [78.0] spank-auks: running aklog
> [2017-07-10T16:19:34.781] [78.0] debug2: spank: auks.so: task_init = 0
> [2017-07-10T16:19:34.781] [78.0] debug:  [job 78] attempting to run slurm 
> task_prolog [/opt/local/bin/TaskProlog]
> [2017-07-10T16:19:34.813] [78.0] debug2: _set_limit: conf setrlimit 
> RLIMIT_CPU no change in value: 18446744073709551615
> [2017-07-10T16:19:34.813] [78.0] debug2: _set_limit: conf setrlimit 
> RLIMIT_FSIZE no change in value: 18446744073709551615
> [2017-07-10T16:19:34.813] [78.0] debug2: _set_limit: conf setrlimit 
> RLIMIT_DATA no change in value: 18446744073709551615
> [2017-07-10T16:19:34.813] [78.0] debug2: _set_limit: RLIMIT_STACK  : max:inf 
> cur:inf req:8388608
> [2017-07-10T16:19:34.813] [78.0] debug2: _set_limit: conf setrlimit 
> RLIMIT_STACK succeeded
> [2017-07-10T16:19:34.813] [78.0] debug2: _set_limit: conf setrlimit 
> RLIMIT_CORE no change in value: 0
> [2017-07-10T16:19:34.813] [78.0] debug2: _set_limit: conf setrlimit 
> RLIMIT_RSS no change in value: 18446744073709551615
> [2017-07-10T16:19:34.813] [78.0] debug2: _set_limit: conf setrlimit 
> RLIMIT_NPROC no change in value: 4096
> [2017-07-10T16:19:34.813] [78.0] debug2: _set_limit: RLIMIT_NOFILE : 
> max:51200 cur:51200 req:1024
> [2017-07-10T16:19:34.813] [78.0] debug2: _set_limit: conf setrlimit 
> RLIMIT_NOFILE succeeded
> [2017-07-10T16:19:34.813] [78.0] debug:  Couldn't find SLURM_RLIMIT_MEMLOCK 
> in environment
> [2017-07-10T16:19:34.813] [78.0] debug2: _set_limit: conf setrlimit RLIMIT_AS 
> no change in value: 18446744073709551615
> [2017-07-10T16:19:34.815] [78.0] task 0 (5305) exited with exit code 0.
> 
> Note that the TaskProlog also calls aklog. This will get me a token using 
> srun but will not get me a token when using sbatch. 
> 
> I also have in my slurm.conf “UsePAM=1” with the following  slurm pamfile
> 
> auth    required        pam_localuser.so
> account required        pam_unix.so
> session required        pam_limits.so
> session required        pam_afs_session.so
> 
> This doesn’t work either.
> 
> Any advice would be greatly appreciated.
> _______________
> Gedaliah Wolosh
> IST Academic and Research Computing Systems (ARCS)
> NJIT
> GITC 2203
> 973 596 5437 <tel:(973)%20596-5437>
> [email protected] <mailto:[email protected]>
> 
> 

Reply via email to