Re: [gridengine users] Resource Reservation logging

Txema Heredia Mon, 07 Oct 2013 04:17:20 -0700

Thanks for your answers.

The problem is that, right now, the mandatory usage of h_rt is not anoption. So we need to work considering that all jobs will last toinfinity and beyond.


Right now, the scheduler configuration is:
max_reservation 50
default_duration 24:00:00

During the weekend, most of the parallel ( and -R y) jobs startedrunning, but now there is something fishy in my queues:

The first 3 jobs in my waiting queue belong to user1. All 3 jobs request-pe mpich_round 12, -R y and -l h_vmem=4G (h_vmem is set to consumable =YES, not JOB). This user has already one job like these running. User1has a RQS that limits him to use only 12 slots in the whole cluster.Thus the 3 waiting jobs will not be able to run until the first onefinishes.


This is the current schedule log:

::::::::
2734185:1:RESERVING:1381142325:86460:Q:[email protected]:slots:1.000000
2734185:1:RESERVING:1381142325:86460:Q:[email protected]:slots:1.000000
2734185:1:RESERVING:1381142325:86460:Q:[email protected]:slots:1.000000
2734185:1:RESERVING:1381142325:86460:Q:[email protected]:slots:1.000000
2734185:1:RESERVING:1381142325:86460:Q:[email protected]:slots:1.000000
2734185:1:RESERVING:1381142325:86460:Q:[email protected]:slots:1.000000
2734185:1:RESERVING:1381142325:86460:Q:[email protected]:slots:1.000000
2734185:1:RESERVING:1381142325:86460:Q:[email protected]:slots:1.000000
2734185:1:RESERVING:1381142325:86460:Q:[email protected]:slots:1.000000
2734185:1:RESERVING:1381142325:86460:Q:[email protected]:slots:1.000000
2734185:1:RESERVING:1381142325:86460:Q:[email protected]:slots:1.000000
2734185:1:RESERVING:1381142325:86460:Q:[email protected]:slots:1.000000
2734186:1:RESERVING:1381228785:86460:Q:[email protected]:slots:1.000000
2734186:1:RESERVING:1381228785:86460:Q:[email protected]:slots:1.000000
2734186:1:RESERVING:1381228785:86460:Q:[email protected]:slots:1.000000
2734186:1:RESERVING:1381228785:86460:Q:[email protected]:slots:1.000000
2734186:1:RESERVING:1381228785:86460:Q:[email protected]:slots:1.000000
2734186:1:RESERVING:1381228785:86460:Q:[email protected]:slots:1.000000
2734186:1:RESERVING:1381228785:86460:Q:[email protected]:slots:1.000000
2734186:1:RESERVING:1381228785:86460:Q:[email protected]:slots:1.000000
2734186:1:RESERVING:1381228785:86460:Q:[email protected]:slots:1.000000
2734186:1:RESERVING:1381228785:86460:Q:[email protected]:slots:1.000000
2734186:1:RESERVING:1381228785:86460:Q:[email protected]:slots:1.000000
2734186:1:RESERVING:1381228785:86460:Q:[email protected]:slots:1.000000
2734187:1:RESERVING:1381315245:86460:Q:[email protected]:slots:1.000000
2734187:1:RESERVING:1381315245:86460:Q:[email protected]:slots:1.000000
2734187:1:RESERVING:1381315245:86460:Q:[email protected]:slots:1.000000
2734187:1:RESERVING:1381315245:86460:Q:[email protected]:slots:1.000000
2734187:1:RESERVING:1381315245:86460:Q:[email protected]:slots:1.000000
2734187:1:RESERVING:1381315245:86460:Q:[email protected]:slots:1.000000
2734187:1:RESERVING:1381315245:86460:Q:[email protected]:slots:1.000000
2734187:1:RESERVING:1381315245:86460:Q:[email protected]:slots:1.000000
2734187:1:RESERVING:1381315245:86460:Q:[email protected]:slots:1.000000
2734187:1:RESERVING:1381315245:86460:Q:[email protected]:slots:1.000000
2734187:1:RESERVING:1381315245:86460:Q:[email protected]:slots:1.000000
2734187:1:RESERVING:1381315245:86460:Q:[email protected]:slots:1.000000

Right now, the cluster is using 190 slots of 320 total. The schedule logsays that the 3 waiting jobs form user1 are the only jobs making anykind of reservation. These jobs are reserving a total of 36 cores. These3 jobs are effectively blocking 36 already-free slots because the RQSdoesn't allow user1 to make usage of more than 12 slots at once. This isnot "nice" but I understand that the scheduler has its limitations andcannot predict the future.

Taking into account the jobs running + the slots & memory locked by thereserving jobs, there is a grand total of 226 slots locked. Thus leaving94 free slots.

Here comes the problem: Even though there are 94 free slots and lots ofspare memory, NONE of the 4300 waiting jobs is running. There are nodeswith 6 free slots and 59 GB of free RAM but none of the waiting jobs isscheduled. New jobs only star running when one of the 190 slots occupiedby running jobs is freed. None of these other waiting jobs is requesting-R y, -pe nor h_rt.

Additionaly, this is creating some odd behaviour. It seems that, on eachscheduler run, it is trying to start jobs in those "blocked slots", butit fails with no apparent reason. Some of the jobs are even trying tostart twice, but almost none (generally none at all) gets to run:

# tail -2000 schedule | grep -A 1000 "::::::" | grep "Q:all" | grepSTARTING | sort

2734121:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734122:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734123:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734124:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734125:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734126:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734127:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734128:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734129:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734130:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734131:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734132:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734133:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734134:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734135:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734136:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734137:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734138:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734139:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734140:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734141:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734142:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734143:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734144:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734145:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734146:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734147:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734148:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734149:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734150:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734151:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734152:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734153:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734154:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734155:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734156:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734157:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734158:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734159:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734160:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2734161:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735158:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735159:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735160:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735161:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735162:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735163:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735164:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735165:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735166:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735167:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735168:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735169:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735170:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735171:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735172:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735173:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735174:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735175:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735176:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735177:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735178:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735179:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735180:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735181:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735182:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735183:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735184:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735185:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735186:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735187:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735188:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735189:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735190:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735191:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735192:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2735193:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743479:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743480:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743481:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743482:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743483:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743484:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743485:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743486:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743487:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743488:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743489:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743490:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743491:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743492:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743493:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743494:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743495:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743496:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743497:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743498:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743499:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743500:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743501:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743502:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743503:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743504:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743505:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743506:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743507:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743508:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743509:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743510:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743511:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743512:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743513:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743514:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743515:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743516:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743517:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000
2743518:1:STARTING:1381144160:86460:Q:[email protected]:slots:1.000000

Even though jobs appear here listed as "starting" they are not runningat all. But they are issuing a "starting" message on each schedulinginterval.

Why are the reservations blocking a third of the cluster??? It shouldn'tbe a backfilling issue, they are blocking the usage of 3 times the slotsreserved. Why the "starting" jobs cannot run?


Txema



El 07/10/13 09:28, Christian Krause escribió:

Hello,

We solved it the way that `h_rt` is set to FORCED in the complex list:

     #name                    shortcut      type        relop requestable 
consumable default  urgency
     
#------------------------------------------------------------------------------------------------
     h_rt                     h_rt          TIME        <=    FORCED      YES   
     0:0:0    0

And have a JSV rejecting jobs that don't request it (because they would be 
pending indefinetely
unless you have a default duration or use qalter).

You could also use a JSV to enforce that only jobs with large resources (in 
your case more than some
amount of slots) are able to request reservation, i.e.:

     # pseudo JSV code

SLOT_RESERVATION_THRESHOLD=...if slots < SLOT_RESERVATION_THRESHOLD then

         "disable reservation / reject"
     else
         "enable reservation"
     fi


On Fri, Oct 04, 2013 at 04:25:29PM +0200, Txema Heredia wrote:

Hi all,

I have a 27-node cluster. Currently there are 320 out of 320 slots
filled up. All by jobs requesting 1-slot.

At the top of my waiting queue there are 28 different jobs
requesting 3 to 12 cores using two different parallel environments.
All these jobs are requesting -R y. They are being ignored and
overrun by the myriad of 1-slot requesting  jobs behind them in the
waiting queue.

I have enabled the scheduler logging. During the last 4 hours, it
has logged 724 new jobs starting, in all the 27 nodes. Not a single
job on the system is requesting -l h_rt, but single-core jobs keep
being scheduled  and all the parallel jobs are starving.

As far as I understand, the backfilling is killing my reservations,
even if no one is requesting any kind of time, but if I set the
"default_duration" to INFINITY, all the RESERVING log messages
disappear.

Additionaly, for some odd reason, I only receive RESERVING messages
from the jobs requesting a given number of slots (-pe whatever N).
The jobs requesting a slot-range (-pe threaded 4-10) seem to reserve
nothing.

My scheduler configuration is as follows:

# qconf -ssconf
algorithm                         default
schedule_interval                 0:0:5
maxujobs                          0
queue_sort_method                 load
job_load_adjustments              np_load_avg=0.50
load_adjustment_decay_time        0:7:30
load_formula                      np_load_avg
schedd_job_info                   true
flush_submit_sec                  0
flush_finish_sec                  0
params                            MONITOR=1
reprioritize_interval             0:0:0
halftime                          168
usage_weight_list cpu=0.187000,mem=0.116000,io=0.697000
compensation_factor               5.000000
weight_user                       0.250000
weight_project                    0.250000
weight_department                 0.250000
weight_job                        0.250000
weight_tickets_functional         1000000000
weight_tickets_share              1000000000
share_override_tickets            TRUE
share_functional_shares           TRUE
max_functional_jobs_to_schedule   200
report_pjob_tickets               TRUE
max_pending_tasks_per_job         50
halflife_decay_list               none
policy_hierarchy                  OSF
weight_ticket                     0.010000
weight_waiting_time               0.000000
weight_deadline                   3600000.000000
weight_urgency                    0.100000
weight_priority                   1.000000
max_reservation                   50
default_duration                  24:00:00


I have also tested it with params PROFILE=1 and default_duration
INFINITY. But, when I set it, not a single reservation is logged in
/opt/gridengine/default/common/schedule and new jobs keep starting.


What am I missing? Is it possible to kill the backfilling? Are my
reservations really working?

Thanks in advance,

Txema
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users


_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] Resource Reservation logging

Reply via email to