Fan Dong wrote:
Hope someone can help with this. We submitted hundreds of jobs using something similiar to qsub -pe my_pe 4 my_job.sh. We found that there is always a nodes with 8 slots empty at any time that we checked. A screenshot is pasted here, either comp03 or comp04 is idle while there are bunch of jobs waiting in the queue. Ideally both comp03 and comp04 should have 2 tasks running all the time. Given our setup we expect 6 jobs running simultaneously but there are only 4 jobs instead. These are long-run jobs that each of them may last 4 hours so we checked the queue plenty of times and find this behaviour.

Can someone shed some light on this?

---------------------------------------------------------------------------------
[email protected] BIP 0/4/6 2.45 linux-x64 1344 0.55500 npairs_run anita r 05/27/2013 01:20:46 4 --------------------------------------------------------------------------------- [email protected] BIP 0/4/6 2.71 linux-x64 1343 0.55500 npairs_run anita r 05/27/2013 00:37:16 4 --------------------------------------------------------------------------------- [email protected] BIP 0/8/8 5.46 linux-x64 1345 0.55500 npairs_run anita r 05/27/2013 02:17:01 4 1346 0.55500 npairs_run anita r 05/27/2013 03:11:31 4 --------------------------------------------------------------------------------- [email protected] BIP 0/0/8 0.02 linux-x64
############################################################################
 - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS
############################################################################
1347 0.55500 npairs_run anita qw 05/23/2013 16:37:25 4 1348 0.55500 npairs_run anita qw 05/23/2013 16:37:25 4 1349 0.55500 npairs_run anita qw 05/23/2013 16:37:25 4 1350 0.55500 npairs_run anita qw 05/23/2013 16:37:25 4 1351 0.55500 npairs_run anita qw 05/23/2013 16:37:25 4 1352 0.55500 npairs_run anita qw 05/23/2013 16:37:25 4 1353 0.55500 npairs_run anita qw 05/23/2013 16:37:25 4 1354 0.55500 npairs_run anita qw 05/23/2013 16:37:25 4


On 05/17/2013 08:00 AM, [email protected] wrote:
Send users mailing list submissions to
        [email protected]

To subscribe or unsubscribe via the World Wide Web, visit
        https://gridengine.org/mailman/listinfo/users
or, via email, send a message with subject or body 'help' to
        [email protected]

You can reach the person managing the list at
        [email protected]

When replying, please edit your Subject line so it is more specific
than "Re: Contents of users digest..."


Today's Topics:

   1. Where do the factors for np_load_short come from?
      (Tim Landscheidt)
   2. Re: Where do the factors for np_load_short come   from? (Reuti)
   3. Re: Where do the factors for np_load_short come   from?
      (Tim Landscheidt)
   4. Re: Where do the factors for np_load_short come   from? (Reuti)

Hi.

'qstat -j id-of-pending-job' should telling you why it doesn't run yet.

Best regards.
Robi

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to