Hello

I installed a new version of slurm, 14.11.3.  It works fine.  However I
noticed that my log file /var/log/slurmctld shows
error: slurm_receive_msg: Incompatible versions of client and server code
This led me to discover that old slurm scontrol, squeue and sacct are still
running on the head node, using
ps -u root -F |grep slurm

I have tried to kill this every which way, but they wont die, they keep
resurrecting with different pid's.  I tried
 /old-slurm-version/bin/scontrol shutdown
but this gives me
slurm_shutdown error: Zero Bytes were transmitted or received

It seems like something is automatically restarting the old slurm.  I am
using Bright Cluster Manager, and I set it so that it does NOT auto-start
or run the slurm daemon, but that did not help.

Can someone help me kill this thing?  It is causing the creation of big log
zip files, and using up cpu capacity on the head node.

Regards
Andrew Petersen

Reply via email to