.5:
 cat /proc/loadavg
0.04 0.06 0.01 1/185 10512

.6:  cat /proc/loadavg
1.03 1.01 1.00 2/441 19965 

Oops! I think I'm starting to  understand - number of processes on both
machines are the same, but number the threads are different. probably
gmond counts threads, not processes:

.5: ps -ef|wc
     64    
 ps -efm|wc
    187    

.6:
  ps -ef|wc
     62     
  ps -efm|wc
    441   


> -----Original Message-----
> From: Martin Knoblauch [mailto:[EMAIL PROTECTED] 
> Sent: Tuesday, January 16, 2007 11:59 AM
> To: Vitaly Karasik; [EMAIL PROTECTED]; 
> ganglia-general@lists.sourceforge.net
> Subject: RE: [Ganglia-general] Two similar linux hosts 
> provides different metrics
> 
> Vitaly,
> 
>  in this case try to run "gmond" with a debug level higher that 2.
> Maybe this sheds some light on it.
> 
>  Or, you could add debug statements to the proc_run_func and 
> proc_total_func code.
> 
>  But: first of all show us the output of "cat /proc/loadavg" 
> on both nodes.
> 
> cheers
> Martin
> --- Vitaly Karasik <[EMAIL PROTECTED]> wrote:
> 
> > It seems like we have different numbers in gmond:
> > 
> > <HOST NAME="5.5.5.5" IP="5.5.5.5" REPORTED="1168934873" TN="2"
> > TMAX="20"
> > DMAX="0" LOCATION="unspecified" GMOND_STARTED="1166534354"> ......
> > <METRIC NAME="proc_total" VAL="185" TYPE="uint32" UNITS="" TN="229"
> > TMAX="950" DMAX="0" SLOPE="both" SOURCE="gmond"/> ..............
> > <METRIC NAME="proc_run" VAL="0" TYPE="uint32" UNITS="" TN="229"
> > TMAX="950" DMAX="0" SLOPE="both" SOURCE="gmond"/>
> > 
> > 
> > <HOST NAME="5.5.5.6" IP="5.5.5.6" REPORTED="1168934871" TN="3"
> > TMAX="20"
> > DMAX="0" LOCATION="unspecified" GMOND_STARTED="1166534349"> ....
> > <METRIC NAME="proc_run" VAL="15" TYPE="uint32" UNITS="" TN="68"
> > TMAX="950" DMAX="0" SLOPE="both" SOURCE="gmond"/> ....
> > <METRIC NAME="proc_total" VAL="439" TYPE="uint32" UNITS="" TN="68"
> > TMAX="950" DMAX="0" SLOPE="both" SOURCE="gmond"/>
> > 
> > Thanks,
> > Vitaly
> > 
> > > -----Original Message-----
> > > From: Martin Knoblauch [mailto:[EMAIL PROTECTED]
> > > Sent: Monday, January 15, 2007 12:30 PM
> > > To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
> > > Subject: RE: [Ganglia-general] Two similar linux hosts provides 
> > > different metrics
> > > 
> > > Hi Vitaly,
> > > 
> > >  where do you see the invalid numbers:
> > > 
> > > a) in the gmond XML Stream (telnet/nc to the gmond XML port)
> > > b) in the XML Stream from gmetad (telnet/nc to the gmetad 
> XML port)
> > > c) only in the web-frontend
> > > 
> > > Cheers
> > > Martin
> > > --- Vitaly Karasik <[EMAIL PROTECTED]> wrote:
> > > 
> > > > "NON-BUSY" HOST:
> > > > # ps axl|wc
> > > >      61     862    5865
> > > > # uptime
> > > >  08:54:55  up 204 days,  2:00,  1 user,  load average: 0.00,
> > 0.00,
> > > > 0.00
> > > > 
> > > > "BUSY" HOST
> > > >  ]# ps axl|wc
> > > >      62     877    5977
> > > >  ]# uptime
> > > >  08:55:18  up 31 days, 16:30,  1 user,  load average: 0.04,
> > > 0.01, 0.00
> > > >  
> > > > 
> > > > > -----Original Message-----
> > > > > From: Martin Knoblauch [mailto:[EMAIL PROTECTED]
> > > > > Sent: Thursday, January 11, 2007 10:54 AM
> > > > > To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
> > > > > Subject: Re: [Ganglia-general] Two similar linux 
> hosts provides 
> > > > > different metrics
> > > > > 
> > > > > Hi Vitaly,
> > > > > 
> > > > >  what does "ps axl" show on both hosts, as that is basically 
> > > > > what gmond looks at? If it is already different there, the 
> > > > > "problem" is not ganglia related. (OK, I see you already
> > checked
> > > > ...)
> > > > > 
> > > > >  What are the load averages according to "uptime"?
> > > > > 
> > > > > Cheers
> > > > > Martin
> > > > > 
> > > > > 
> > > > > --- Vitaly Karasik <[EMAIL PROTECTED]> wrote:
> > > > > 
> > > > > >   Hi,
> > > > > > 
> > > > > > I have a weird problem - two linux hosts with similar
> > > > configuration
> > > > > > provide very different metrics about  number of running
> > > processes
> > > > -
> > > > > > one shows about 2, and second about 20-40 (I speak about
> > > > > concentrated
> > > > > > "load"
> > > > > > graph at top right.)
> > > > > > proc_total is different too - 171 vs. 350 (BTW,  ps -ef |wc
> > > > > == 61 on
> > > > > > both boxes)
> > > > > > 
> > > > > > Both machines are RHEL3 kernel 2.4.21-37.ELsmp with
> > > > > > ganglia-gmond-3.0.3-1 installed from RPM.
> > > > > > 
> > > > > > Any ideas?
> > > > > > Thanks,
> > > > > > Vitaly
> > > > > > 
> > > > > >  
> > > > > > 
> > > > > >
> > > > > --------------------------------------------------------------
> > > > > -----------
> > > > > > Take Surveys. Earn Cash. Influence the Future of IT Join 
> > > > > > SourceForge.net's Techsay panel and you'll get the chance to
> > > > share
> > > > > > your opinions on IT & business topics through brief surveys
> > > > > - and earn
> > > > > > cash
> > > > > >
> > > > > http://www.techsay.com/default.php?page=join.php&p=sourceforge
> > > > > &CID=DEVDEV
> > > > > > _______________________________________________
> > > > > > Ganglia-general mailing list
> > > > > > Ganglia-general@lists.sourceforge.net
> > > > > > https://lists.sourceforge.net/lists/listinfo/ganglia-general
> > > > > > 
> > > > > > 
> > > > > 
> > > > > 
> > > > > ------------------------------------------------------
> > > > > Martin Knoblauch
> > > > > email: k n o b i AT knobisoft DOT de
> > > > > www:   http://www.knobisoft.de
> > > > > 
> > > > 
> > > > 
> > > 
> > > 
> > > ------------------------------------------------------
> > > Martin Knoblauch
> > > email: k n o b i AT knobisoft DOT de
> > > www:   http://www.knobisoft.de
> > > 
> > 
> > 
> 
> 
> ------------------------------------------------------
> Martin Knoblauch
> email: k n o b i AT knobisoft DOT de
> www:   http://www.knobisoft.de
> 

Reply via email to