Hi all

We have upgraded to version 25.11.2 and are now noticing that nodes are removed 
getting removed from the topology/tree when the node gets suspended.

In the slurmctld.log we see things like this:

[2026-01-29T23:31:32.517] POWER: power_save: suspending nodes node1
[2026-01-29T23:32:37.536] debug2: _update_node_features_post: nodes node1 
active features set to: feature1
[2026-01-29T23:32:37.536] debug2: topology/tree: topology_p_add_rm_node: 
topology_p_add_rm_node: remove node1 from switch1

If we later on run a job with:

srun -w node1 --pty bash

then we see the following:

[2026-01-30T00:08:04.004] debug2: NodeSet for JobId=1234
[2026-01-30T00:08:04.005] _pick_best_nodes: JobId=1234 never runnable in 
partition part1
[2026-01-30T00:08:04.005] sched: schedule: JobId=1234 non-runnable: Requested 
topology configuration is not available
[2026-01-30T00:08:04.005] debug2: _schedule: setting JobId=1234 to 
"BadConstraints" (Requested topology configuration is not available)

This worked without any issues in the previous version 25.05.2, so something 
has change in the newer version 25.11.2.
My guess is something with dynamic nodes and/or active versus available 
features, but it’s just not clear :-(
Ideas anyone???


Our setup includes in slurm.conf (trimmed):

TopologyPlugin = topology/tree
NodeName=node1 Features=feature1
NodeSet=set1 Feature=feature1
PartitionName=part1 Nodes=set1 [...]
SuspendProgram = /power_save/suspend.sh
ResumeProgram = /power_save/resume.sh
ReconfigFlags = KeepPowerSaveSettings

In topology.conf:

SwitchName=switch1 Nodes=node1

The SuspendProgram and ResumeProgram scripts use tuned-adm to change the power 
profile (to powersave and hpc-compute resp.).

When looking at the node itself, after suspend, it shows (trimmed):

NodeName=node1 AvailableFeatures=feature1 ActiveFeatures=feature1 
State=IDLE+POWERED_DOWN PARTITION=part1


I hope someone can shed a light on this issue...
Thanks in advance for any reply!

-- Kind regards

Franky


-- 
slurm-users mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to