That function causes slurmctld to send a RPC to the slurmd daemons on the compute nodes, which forwards the RPC to the relevant slurmstepd, which calls a proctrack plugin to signal the appropriate processes. At least that is my recollection.

Quoting Marcin Stolarek <[email protected]>:

Hi,

I'm trying to understand how does job suspension mechanism works in my
configuration, but I stucked in looking for job_suspend function (used in
slurmctld/gang.c) definition.

I suppose that this depends on my proctrack plugin configuration, but... I
cannot figure out where this connection ( in my case probably with
_slurm_cgroup_suspend)   is made in code.

thanks in advance,
marcin

--
Marcin Stolarek
Interdisciplinary Centre for Mathematical and Computational Modelling (ICM),
University of Warsaw, Poland

Reply via email to