[slurm-dev] Re: X11 plugin problems

2016-11-07 Thread Wensheng Deng
Hi,

I am relatively to Slurm -facing the same issue. As an user I could ssh
back from the compute node to the login node without being asked for a
password.

My OS is CentOS 7.2, and Slurm 16.05.4.

In slurmd.log, there are message as the following:

[2016-11-07T11:04:31.958] [10004626.0] debug2: Before call to spank_init()

[2016-11-07T11:04:31.958] [10004626.0] debug:  spank: opening plugin stack
/opt/slurm/etc/plugstack.conf

[2016-11-07T11:04:31.959] [10004626.0] debug3: Couldn't find sym
'slurm_spank_slurmd_init' in the plugin

[2016-11-07T11:04:31.959] [10004626.0] debug3: Couldn't find sym
'slurm_spank_job_prolog' in the plugin

[2016-11-07T11:04:31.959] [10004626.0] debug3: Couldn't find sym
'slurm_spank_init_post_opt' in the plugin

[2016-11-07T11:04:31.959] [10004626.0] debug3: Couldn't find sym
'slurm_spank_task_init_privileged' in the plugin

[2016-11-07T11:04:31.959] [10004626.0] debug3: Couldn't find sym
'slurm_spank_task_init' in the plugin

[2016-11-07T11:04:31.959] [10004626.0] debug3: Couldn't find sym
'slurm_spank_task_post_fork' in the plugin

[2016-11-07T11:04:31.959] [10004626.0] debug3: Couldn't find sym
'slurm_spank_task_exit' in the plugin

[2016-11-07T11:04:31.959] [10004626.0] debug3: Couldn't find sym
'slurm_spank_job_epilog' in the plugin

[2016-11-07T11:04:31.959] [10004626.0] debug3: Couldn't find sym
'slurm_spank_slurmd_exit' in the plugin

[2016-11-07T11:04:31.959] [10004626.0] debug:  spank:
/opt/slurm/etc/plugstack.conf:35: Loaded plugin x11.so

[2016-11-07T11:04:31.959] [10004626.0] debug:  SPANK: appending plugin
option "x11"

[2016-11-07T11:04:31.959] [10004626.0] debug2: spank: x11.so: init = 0

[2016-11-07T11:04:31.959] [10004626.0] debug2: After call to spank_init()

[2016-11-07T11:04:31.959] [10004626.0] debug:  mpi type = (null)

[2016-11-07T11:04:31.959] [10004626.0] debug:  mpi/none: slurmstepd prefork

Also when running 'grep x11 ...' to the slurmd.log, I got the following:

[2016-11-07T11:04:31.921] debug2: spank: /opt/slurm/lib64/slurm/x11.so: no
callbacks in this context

[2016-11-07T11:04:31.925] debug:  unsetenv (_SLURM_SPANK_OPTION_x11_x11)

[2016-11-07T11:04:31.959] [10004626.0] debug:  spank:
/opt/slurm/etc/plugstack.conf:35: Loaded plugin x11.so

[2016-11-07T11:04:31.959] [10004626.0] debug:  SPANK: appending plugin
option "x11"

[2016-11-07T11:04:31.959] [10004626.0] debug2: spank: x11.so: init = 0

[2016-11-07T11:04:31.972] [10004626.0] error: x11: unable to read DISPLAY
value

[2016-11-07T11:04:31.972] [10004626.0] debug2: spank: x11.so: user_init = -4

I appreciated any insight!


Thanks,
Wensheng



On Tue, Sep 20, 2016 at 7:27 PM, Christopher Samuel 
wrote:

>
> On 21/09/16 06:13, Simpson, Claire L wrote:
>
> > srun: error: x11: unable to connect node node010
>
> Can users ssh back from the compute node to the login node without being
> prompted for a password/passphrase or to accept an ssh key?
>
> That's usually the source of those issues in my experience.
>
> SSH host based authentication within the cluster helps with that, along
> with caching SSH keys in /etc/ssh/ssh_known_hosts.
>
> Best of luck,
> Chris
> --
>  Christopher SamuelSenior Systems Administrator
>  VLSCI - Victorian Life Sciences Computation Initiative
>  Email: sam...@unimelb.edu.au Phone: +61 (0)3 903 55545
>  http://www.vlsci.org.au/  http://twitter.com/vlsci
>


[slurm-dev] Re: X11 plugin problems

2016-09-20 Thread Christopher Samuel

On 21/09/16 06:13, Simpson, Claire L wrote:

> srun: error: x11: unable to connect node node010

Can users ssh back from the compute node to the login node without being
prompted for a password/passphrase or to accept an ssh key?

That's usually the source of those issues in my experience.

SSH host based authentication within the cluster helps with that, along
with caching SSH keys in /etc/ssh/ssh_known_hosts.

Best of luck,
Chris
-- 
 Christopher SamuelSenior Systems Administrator
 VLSCI - Victorian Life Sciences Computation Initiative
 Email: sam...@unimelb.edu.au Phone: +61 (0)3 903 55545
 http://www.vlsci.org.au/  http://twitter.com/vlsci


[slurm-dev] Re: X11 plugin problems

2016-09-20 Thread Simpson, Claire L
Hi Justin,

Yeah, stupid Outlook. Sorry, I am using the double dash, if you see any singles 
just assume that’s Outlook not me. :-)

I’ve tried using salloc but it doesn’t seem to help:

salloc --partition=interactive --job-name=testx11 srun --pty --x11 
--preserve-env /bin/bash

still gives the same error:

salloc: Granted job allocation 97
srun: Job step created
srun: error: x11: unable to connect node node010

Thanks,

Claire

Dr Claire L. Simpson, PhD
Assistant Professor
University of Tennessee Health Science Center
Dept. of Genetics, Genomics and Informatics (Primary)
Dept. of Ophthalmology (Secondary)
71 S. Manassas Room 417
Memphis, TN  38163

phone 901 448-3084
fax 901 448-3500
email:clairelsimp...@uthsc.edu

From: Senseney, Justin S. (Fed) [mailto:justin.sense...@nist.gov]
Sent: Tuesday, September 20, 2016 3:22 PM
To: slurm-dev <slurm-dev@schedmd.com>
Subject: [slurm-dev] Re: X11 plugin problems

Claire,

*Are you using salloc before hand  with --x11 specified?

*Are you using --pty and not –pty?  Your email program probably converted that 
incorrectly to a single dash.

-Justin


From: "Simpson, Claire L" <csimp...@uthsc.edu<mailto:csimp...@uthsc.edu>>
Reply-To: slurm-dev <slurm-dev@schedmd.com<mailto:slurm-dev@schedmd.com>>
Date: Tuesday, September 20, 2016 at 4:13 PM
To: slurm-dev <slurm-dev@schedmd.com<mailto:slurm-dev@schedmd.com>>
Subject: [slurm-dev] X11 plugin problems

I’m trying to get the x11 plugin to work and I’m having some difficulties.

ssh --X

to one of my compute nodes works fine and

env | grep DIS

displays the correct IP address for that node.

But

srun --x11 –pty bash

gives me an error:

srun: error: x11: unable to connect node node010

and

env | grep DIS

now says DISPLAY=:10.0. So I checked the values in sshd_config and 
X11Forwarding yes and X11UseLocalhost no, so that seems to be correct. I’ve 
checked the locations of x11.so, plugstack.conf, and slurm-spank-x11 and 
they’re all where I believe they are supposed to be. So I’m kind of stuck (and 
new to Slurm) and I really need some help figuring out why this isn’t working.

Thanks,

Claire

Dr Claire L. Simpson, PhD
Assistant Professor
University of Tennessee Health Science Center
Dept. of Genetics, Genomics and Informatics (Primary)
Dept. of Ophthalmology (Secondary)
71 S. Manassas Room 417
Memphis, TN  38163

phone 901 448-3084
fax 901 448-3500
email:clairelsimp...@uthsc.edu<mailto:clairelsimp...@uthsc.edu>



[slurm-dev] Re: X11 plugin problems

2016-09-20 Thread Senseney, Justin S. (Fed)
Claire,

*Are you using salloc before hand  with --x11 specified?

*Are you using --pty and not –pty?  Your email program probably converted that 
incorrectly to a single dash.

-Justin


From: "Simpson, Claire L" >
Reply-To: slurm-dev >
Date: Tuesday, September 20, 2016 at 4:13 PM
To: slurm-dev >
Subject: [slurm-dev] X11 plugin problems

I’m trying to get the x11 plugin to work and I’m having some difficulties.

ssh --X

to one of my compute nodes works fine and

env | grep DIS

displays the correct IP address for that node.

But

srun --x11 –pty bash

gives me an error:

srun: error: x11: unable to connect node node010

and

env | grep DIS

now says DISPLAY=:10.0. So I checked the values in sshd_config and 
X11Forwarding yes and X11UseLocalhost no, so that seems to be correct. I’ve 
checked the locations of x11.so, plugstack.conf, and slurm-spank-x11 and 
they’re all where I believe they are supposed to be. So I’m kind of stuck (and 
new to Slurm) and I really need some help figuring out why this isn’t working.

Thanks,

Claire

Dr Claire L. Simpson, PhD
Assistant Professor
University of Tennessee Health Science Center
Dept. of Genetics, Genomics and Informatics (Primary)
Dept. of Ophthalmology (Secondary)
71 S. Manassas Room 417
Memphis, TN  38163

phone 901 448-3084
fax 901 448-3500
email:clairelsimp...@uthsc.edu