Hi,

I am trying to understand backfill chunking in Maui + Torque environment. From what I understand, the configuration will allow Maui to accumulate resources freed by jobs, for a configured time, before allocating them for other backfill jobs.

In my Maui.cfg, I set up the following:

BFCHUNKDURATION 00:05:00
BFCHUNKSIZE 5

The default walltime for torque queues is set to a large value (several thousand hours). The rest of the Maui configuration is default configuration.

Now, on a 14 node cluster, I did the following:

echo "sleep 3600" | qsub -l nodes=7
echo "sleep 3600" | qsub -l nodes=14 (This got queued, and got a reservation)

echo "sleep 3600" | qsub -l nodes=3 -l walltime=0:30:00
echo "sleep 3600" | qsub -l nodes=4 -l walltime=0:30:00 (Both these jobs ran)

echo "sleep 3600" | qsub -l nodes=7 -l walltime=0:30:00 (This got queued)
echo "sleep 3600" | qsub -l nodes=3 -l walltime=0:30:00 (This also got queued)

Now, when I deleted the running 3 node job using qdel, the last 3 node job immediately ran.

Shouldn't Maui have waited for BFCHUNKDURATION time before running this ? Can you please help me understand whether backfill chunking should have worked here ?

Thanks
hemanth





_______________________________________________
mauiusers mailing list
[email protected]
http://www.supercluster.org/mailman/listinfo/mauiusers

Reply via email to