Re: [elasticluster] SLURM sbatch error

Riccardo Murri Thu, 19 Apr 2018 12:55:18 -0700

Hi Orxan,

I tried starting a cluster with your config, but I still cannot
reproduce the error:


* the only differences are:

- I only started 1 compute node
- I used dthe public ebian-9-stretch-v20180401 as a base OS since I
cannot use the image-23 snapshot from your project
- I use my own credentials and project on GCE, of course

* the cluster setup finishes this way::

PLAY RECAP 
***************************************************************************************************************************************************
compute001                 : ok=121  changed=79   unreachable=0    failed=0
frontend001                : ok=124  changed=87   unreachable=0    failed=0

2018-04-19 21:46:09 monia gc3.elasticluster[9250] INFO Cluster
correctly configured.

* I can regularly start jobs on SLURM:

riccardo.murri@frontend001:~$ sinfo -N
NODELIST    NODES PARTITION STATE
compute001      1     main* idle

riccardo.murri@frontend001:~$ vi hello.sh
riccardo.murri@frontend001:~$ cat hello.sh
#!/bin/sh

echo hello

riccardo.murri@frontend001:~$ sbatch hello.sh
Submitted batch job 2

riccardo.murri@frontend001:~$ cat slurm-2.out
hello

I guess there was some error during the setup, which went unnoticed.
Can you dump the entire set session (STDOUT + STDERR) of a problematic
cluster to a file  and send it to me?

Ciao,
R

-- 
You received this message because you are subscribed to the Google Groups 
"elasticluster" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Re: [elasticluster] SLURM sbatch error

Reply via email to