Hello there!
There is a problem with power save, it's experienced in SLURM 2.2.3.
When node went into NODE_STATE_POWER_SAVE and slurmctld restarted then
node will stay in NODE_STATE_UNKNOWN | NODE_STATE_POWER_SAVE forever in
slurmctld after it restarted. Solution is simple - node in that state
should be awakened so it change state from NODE_STATE_UNKNOWN to state
NODE_STATE_IDLE (if there is no error of course). Patch attached.
Andrej.
diff -udpr slurm-2.2.3.new2/src/slurmctld/node_mgr.c slurm-2.2.3/src/slurmctld/node_mgr.c
--- slurm-2.2.3.new2/src/slurmctld/node_mgr.c 2011-02-08 18:24:16.000000000 +0200
+++ slurm-2.2.3/src/slurmctld/node_mgr.c 2011-03-30 17:35:58.000000000 +0300
@@ -414,6 +414,7 @@ extern int load_all_node_state ( bool st
NODE_STATE_FAIL;
if (node_state & NODE_STATE_POWER_SAVE) {
if (power_save_mode) {
+ node_ptr->last_idle = now;
node_ptr->node_state |=
NODE_STATE_POWER_SAVE;
} else if (hs)