I’m troubleshooting an issue that causes NHC to fail to offline a bad node. The node offline script uses formatted “sinfo" to identify the node status, which returns blank for some reason. Interestingly, sinfo works without custom formatting.
Could this be due to a bug in the current version (17.02.4)? Would someone mind trying the following commands in an older Slurm version to compare the output? [root@devel-vcomp1 nhc]# sinfo --version slurm 17.02.4 [root@devel-vcomp1 nhc]# sinfo -o '%t %E' -hn `hostname` (NOTHING!) [root@devel-vcomp1 nhc]# sinfo -hn `hostname` test up infinite 0 n/a vtest* up infinite 0 n/a (OK) Thanks! -Mehmet ========================================= Mehmet Belgin, Ph.D. Scientific Computing Consultant Partnership for an Advanced Computing Environment (PACE) Georgia Institute of Technology 258 4th Street NW, Rich Building, #326 Atlanta, GA 30332-0700 Office: (404) 385-0665
