Re: [gridengine users] qstat strange statistic

2018-04-13 Thread William Hay
On Fri, Apr 13, 2018 at 01:54:14PM +0200, leconte j??r??me wrote:
> Hello,
>  I'm using SGE 8.1.9 under debian Stretch
> 
> ?? I have a strange problem.
> 
> ?? when I use qstat , sometime the stats displayed are wrong. Then, I
> believe that gridengine doesn't work properly.
> 
> I explain what I see:
> 
> On Master : qstat -f?? |grep para.q
> 
> par...@node10.example.org?? BIP 0/0/40 14.45 
> lx-amd64
> par...@node11.example.org?? BIP 0/1/40 57.78 
> lx-amd64
> par...@node12.example.org?? BIP 0/0/40 1.00 
> lx-amd64
> par...@node13.example.org?? BIP 0/100/40 57.81 
> lx-amd64
> par...@node14.example.org?? BIP 0/40/40?? 98.00 
> lx-amd64?? a
> par...@node15.example.org?? BIP 0/0/40 0.00 
> lx-amd64
> 
> 
> But if I'm connect a terminal on node13 and I type "top" the load avg is 0.0
> or something else that is not similar to 57.81
> 
> When I stop gridengine_execd?? and then restart it
> 
> On?? Master, qstat -f gives me the right value.
> 
> 
> I suppose there is a better way to do this , but I can't find it.
> 
> Have you some advices ?
Does your sched_conf set job_load_adjustments to anything interesting?

Is the environment variable SGE_LOAD_AVG set?

Also 0/100/40 is a tad odd as you are apparently using more slots than 
configured.

William


signature.asc
Description: PGP signature
___
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users


Re: [gridengine users] qstat strange statistic

2018-04-13 Thread Reuti
Hi,

> Am 13.04.2018 um 13:54 schrieb leconte jérôme 
> :
> 
> Hello,
>I'm using SGE 8.1.9 under debian Stretch
> 
>   I have a strange problem.
> 
>   when I use qstat , sometime the stats displayed are wrong. Then, I believe 
> that gridengine doesn't work properly.
> 
> I explain what I see:
> 
> On Master : qstat -f  |grep para.q
> 
> par...@node10.example.org  BIP   0/0/40 14.45 lx-amd64
> par...@node11.example.org  BIP   0/1/40 57.78 lx-amd64
> par...@node12.example.org  BIP   0/0/40 1.00 lx-amd64
> par...@node13.example.org  BIP   0/100/40   57.81 lx-amd64
> par...@node14.example.org  BIP   0/40/4098.00 lx-amd64  a
> par...@node15.example.org  BIP   0/0/40 0.00 lx-amd64
> 
> 
> But if I'm connect a terminal on node13 and I type "top" the load avg is 0.0 
> or something else that is not similar to 57.81
> 
> When I stop gridengine_execd  and then restart it
> 
> On  Master, qstat -f gives me the right value.
> 
> 
> I suppose there is a better way to do this , but I can't find it.

SGE uses the 5 minute value, i.e. the middle one which is shown after load avg. 
And the value is constantly low on the machine for all three times?

-- Reuti


___
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users