Public bug reported:

I realized that slurmd always shows as failed on ubuntu with a

"Jan 23 18:03:05 c1-compute-1.wehi.edu.au systemd[1]: Can't open PID
file /var/run/slurm/slurmd.pid (yet?) after start: No such file or
directory"


According to https://bugs.schedmd.com/show_bug.cgi?id=8388#c1 , 

This is happening because we create the PID file slightly after systemd
tries to read it.  Commands where systemd needs to know the PID (eg
systemctl restart slurmd.service) it will re-read the file (which
appears to be getting created properly).  From a functional standpoint,
this error shouldn't have any impact on systemd or slurm.


The solution, is to remove the PIDFile line on slurmd.service:, according  to 
upstream https://bugs.schedmd.com/show_bug.cgi?id=8388#c3 :

The quickest workaround you could use is to just comment out "PIDFile=*"
line in the unit file and do a daemon-reload. instead of reading the pid
file we write out, it will "guess" the main pid (and in my tests does so
correctly).

** Affects: slurm-llnl (Ubuntu)
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1959309

Title:
  Slurmd will always fail with PIDFile set on systemd

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/slurm-llnl/+bug/1959309/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to