I am having slurmctld is hanging... Dec 25 01:36:20 bud30 slurmctld[6134]: sched: Allocate JobId=87 NodeList=172.16.105.33 #CPUs=2 Dec 25 01:36:20 bud30 slurmctld[6134]: sched: Allocate JobId=88 NodeList=172.16.105.34 #CPUs=2 Dec 25 01:36:20 bud30 slurmctld[6134]: sched: Allocate JobId=89 NodeList=172.16.105.35 #CPUs=2 Dec 25 01:36:31 bud30 slurmctld[6134]: error: epilog_slurmctld job 4 epilog exit status 1:0 Dec 25 01:36:56 bud30 slurmctld[6134]: server_thread_count over limit (256), waiting
and so far, it hasn’t recovered. Stu. -- Dr Stuart Midgley [email protected]
