Hello,
I'm sorry, but it still doesn't work :(

There is no trace of errors in the log, the only thing is that when I salloc, i can't take a node.

salloc: Pending job allocation 21
salloc: job 21 queued and waiting for resources
salloc: job 21 has been allocated resources
salloc: Granted job allocation 21
salloc: Relinquishing job allocation 21
salloc: Job allocation 21 has been revoked.


I'm now with 2.2.7 version.

--
Alexis GÜNST HORN
System administrator
Exascale Computing Research


Le 11/07/2011 22:58, [email protected] a écrit :
There were some changes in salloc operation for Cray systems. More
information about the changes is available here:
https://github.com/SchedMD/slurm/blob/slurm-2.2/NEWS

Could you try SLURM v2.2.7? It definitely has some changes that might
fix this.

Moe Jette
SchedMD LLC

Quoting Alexis GÜNST HORN <[email protected]>:

Hello,

I've just upgraded my Debian Cluster from Stable (Squeeze) to Testing
(Wheezy).

In Squeeze, SLURM is 2.1.11
In Wheezy SLURM is 2.2.6

Before, on Squeeze, everything was working fine.
But now, it doesn't work anymore.

In fact, the "salloc" command does not work any more.

For example, if I do this, on the management node :

$ salloc -w node2
salloc: Pending job allocation 9
salloc: job 9 queued and waiting for resources
salloc: job 9 has been allocated resources
salloc: Granted job allocation 9
salloc: Relinquishing job allocation 9
salloc: Job allocation 9 has been revoked.


So, it doesn't work. Moreover, just after that, a process "bash" is
created on the management node, which take 99% of CPU.

Is it a bug ? A changement in config file to do ?
Please help me !

--
Alexis GÜNST HORN




Reply via email to