Hi, A few of the node in my cluster are sporadically going into a DRAIN state, and the last message i see in the slurmd log is:
error: write /var/spool/slurmd/cred_state.new error No space left on device However, the node in question has more than enough free space and enough inodes. Has anyone else seen this before? thanks in advance for any help.
