Thanks.
Here the output :
Everything seems normal, no ?
agunst@couperin:~$ salloc -vvvvv -w bruckner
salloc: defined options for program `salloc'
salloc: --------------- ---------------------
salloc: user : `agunst'
salloc: uid : 10001
salloc: gid : 10007
salloc: ntasks : 1 (default)
salloc: cpus_per_task : 1 (default)
salloc: nodes : 1 (default)
salloc: partition : default
salloc: job name : `bash'
salloc: reservation : `(null)'
salloc: wckey : `(null)'
salloc: distribution : unknown
salloc: verbose : 5
salloc: immediate : false
salloc: overcommit : false
salloc: account : (null)
salloc: comment : (null)
salloc: dependency : (null)
salloc: network : (null)
salloc: qos : (null)
salloc: constraints : mincpus=1 nodelist=bruckner
salloc: geometry : (null)
salloc: reboot : yes
salloc: rotate : no
salloc: mail_type : NONE
salloc: mail_user : (null)
salloc: sockets-per-node : -2
salloc: cores-per-socket : -2
salloc: threads-per-core : -2
salloc: ntasks-per-node : 0
salloc: ntasks-per-socket : -2
salloc: ntasks-per-core : -2
salloc: plane_size : 4294967294
salloc: cpu_bind : default
salloc: mem_bind : default
salloc: user command : `/bin/bash'
salloc: debug: Entering slurm_allocation_msg_thr_create()
salloc: debug: port from net_stream_listen is 45043
salloc: debug: Entering _msg_thr_internal
salloc: debug4: eio: handling events for 1 objects
salloc: debug3: Called eio_message_socket_readable 0 3
salloc: debug3: Trying to load plugin /usr/lib/slurm/auth_none.so
salloc: Null authentication plugin loaded
salloc: debug3: Success.
salloc: debug3: Trying to load plugin /usr/lib/slurm/select_cons_res.so
salloc: debug3: Success.
salloc: debug3: Trying to load plugin /usr/lib/slurm/select_bgq.so
salloc: debug3: Success.
salloc: debug3: Trying to load plugin /usr/lib/slurm/select_cray.so
salloc: debug3: Success.
salloc: debug3: Trying to load plugin /usr/lib/slurm/select_linear.so
salloc: debug3: Success.
salloc: debug3: Trying to load plugin /usr/lib/slurm/select_bluegene.so
salloc: debug3: Success.
salloc: debug4: eio: handling events for 1 objects
salloc: debug3: Called eio_message_socket_readable 0 3
salloc: Granted job allocation 323
salloc: debug: laying out the 8 tasks on 1 hosts bruckner
salloc: Relinquishing job allocation 323
salloc: debug3: Called eio_msg_socket_accept
salloc: debug2: got message connection from 192.168.96.104:37736 7
salloc: debug3: job complete message received
salloc: Job allocation 323 has been revoked.
salloc: debug4: eio: handling events for 1 objects
salloc: debug3: Called eio_message_socket_readable 0 3
salloc: debug2: slurm_allocation_msg_thr_destroy: clearing up message thread
salloc: debug4: eio: handling events for 1 objects
salloc: debug3: Called eio_message_socket_readable 1 3
salloc: debug2: false, shutdown
salloc: debug: Leaving _msg_thr_internal
--
Alexis GÜNST HORN
System administrator
Exascale Computing Research
Le 12/07/2011 13:26, Andy Riebs a écrit :
Hi Alexis,
Try adding "-vvvvv" to the command line; it may offer a hint about what
is going wrong.
Andy
On 07/12/2011 04:53 AM, Alexis GÜNST HORN wrote:
Some details if you want.
First, i connect to the service node. Then, I do a salloc :
agunst@couperin:~/nfs$ salloc -w borodine bash
salloc -w borodine bash
salloc: Granted job allocation 90
... bash is opened, so it works
agunst@couperin:~/nfs$ exit
salloc: Relinquishing job allocation 90
salloc: Job allocation 90 has been revoked.
Then, if immediatly after i want to do it again, id does not work
anymore...
agunst@couperin:~/nfs$ salloc -w borodine bash
salloc -w borodine bash
salloc: Granted job allocation 91
salloc: Relinquishing job allocation 91
salloc: Job allocation 91 has been revoked.
agunst@couperin:~/nfs$
The problem is that in both cases, all is OK in the logs.
Any ideas ?