On Thu, Feb 08, 2018 at 03:42:03PM -0800, Joshua Baker-LePain wrote:
>  153758 0.51149 tomography USER1       qw    02/08/2018 14:03:05              
>                     192
>  153759 0.00000 qss_svk_ge USER2       qw    02/08/2018 14:15:06              
>                       1 1
>  153760 0.00000 qss_svk_ge USER2       qw    02/08/2018 14:15:06              
>                       1 1
> 
> with more jobs below that, all with 0.0000 priority.  Starting at 14:03:06
> in the messages file, I see this:
> 
> 02/08/2018 14:03:06|worker|wynq1|E|not enough (1) free slots in queue 
> "ondemand.q@cin-id3" for job 153758.1
> 
> And in the schedule file I see this:
> 
> 153758:1:STARTING:1518127386:82860:P:mpi:slots:192.000000
> 153758:1:STARTING:1518127386:82860:H:msg-id19:mem_free:16106127360.000000
> 153758:1:STARTING:1518127386:82860:Q:member.q@msg-id19:slots:15.000000 
> 153758:1:STARTING:1518127386:82860:L:member_queue_limits:/USER1lab////:15.000000
> 153758:1:STARTING:1518127386:82860:H:qb3-id1:mem_free:1073741824.000000
> 153758:1:STARTING:1518127386:82860:Q:ondemand.q@qb3-id1:slots:1.000000 
> 153758:1:STARTING:1518127386:82860:L:ondemand_queue_limits:USER1/////:1.000000
> 153758:1:STARTING:1518127386:82860:H:qb3-id1:mem_free:11811160064.000000
> 153758:1:STARTING:1518127386:82860:Q:long.q@qb3-id1:slots:11.000000
> So why is it trying to give the job slots in ondemand.q?
> 
Has the job in question requested the ondemand queue via -masterq by any 
chance?  I have heard people who should know say that -masterq is somewhat
buggy.  I've never had a problem with -masterq myself but I don't use it much 
and we don't use RQS either.  Possibly the alleged bugginess of -masterq
manifests in the presence of RQS.

Does the pe in question have job_is_first_task set to false?  If so this may be 
a funny with treatment of the MASTER task by RQS.

William

Attachment: signature.asc
Description: PGP signature

_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users

Reply via email to