[slurm-dev] debug output problem in cons_res/job_test.c

Phil Sharfstein Tue, 29 Mar 2011 13:56:43 -0700

I ran into an issue where one of my nodes semi-crashed, and remountedits root volume read-only and started causing strange problems withbackfill scheduling only trying the highest priority job. I'm not sureI could reproduce this or get you enough information to figure out whathappened. However, the main issue I had in tracking down the problemwas that the debug output which would have shown jobs getting tested torun on the bad node occurs after the return statement that executed withmy configuration.


The details are:


in plugins/select/cons_res/job_test.c

in _can_job_run_on_node()

the if (select_debug_flags & DEBUG_FLAG_CPU_BIND)

is after

if (!(cr_type & CR_MEMORY))
    return cpus;

Thanks,
Phil

[slurm-dev] debug output problem in cons_res/job_test.c

Reply via email to