Hi Orxan, I tried starting a cluster with your config, but I still cannot reproduce the error:
* the only differences are: - I only started 1 compute node - I used dthe public ebian-9-stretch-v20180401 as a base OS since I cannot use the image-23 snapshot from your project - I use my own credentials and project on GCE, of course * the cluster setup finishes this way:: PLAY RECAP *************************************************************************************************************************************************** compute001 : ok=121 changed=79 unreachable=0 failed=0 frontend001 : ok=124 changed=87 unreachable=0 failed=0 2018-04-19 21:46:09 monia gc3.elasticluster[9250] INFO Cluster correctly configured. * I can regularly start jobs on SLURM: riccardo.murri@frontend001:~$ sinfo -N NODELIST NODES PARTITION STATE compute001 1 main* idle riccardo.murri@frontend001:~$ vi hello.sh riccardo.murri@frontend001:~$ cat hello.sh #!/bin/sh echo hello riccardo.murri@frontend001:~$ sbatch hello.sh Submitted batch job 2 riccardo.murri@frontend001:~$ cat slurm-2.out hello I guess there was some error during the setup, which went unnoticed. Can you dump the entire set session (STDOUT + STDERR) of a problematic cluster to a file and send it to me? Ciao, R -- You received this message because you are subscribed to the Google Groups "elasticluster" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
