Martin,
Thank a lot - now I understand that different number of threads causes
this difference.

Vitaly 

> -----Original Message-----
> From: Martin Knoblauch [mailto:[EMAIL PROTECTED] 
> Sent: Tuesday, January 16, 2007 12:29 PM
> To: Vitaly Karasik; [email protected]
> Subject: RE: [Ganglia-general] Two similar linux hosts 
> provides different metrics
> 
> Vitaly,
> 
>  gmond on Linux just interprets the fourth filed of 
> /proc/loadavg. The number in front of the slash is the number 
> of "running processes", the number following the slash is the 
> "total number of processes".
> 
> Cheers
> Martin
>  
> --- Vitaly Karasik <[EMAIL PROTECTED]> wrote:
> 
> > .5:
> >  cat /proc/loadavg
> > 0.04 0.06 0.01 1/185 10512
> > 
> > .6:  cat /proc/loadavg
> > 1.03 1.01 1.00 2/441 19965
> > 
> > Oops! I think I'm starting to  understand - number of processes on 
> > both machines are the same, but number the threads are different. 
> > probably gmond counts threads, not processes:
> > 
> > .5: ps -ef|wc
> >      64    
> >  ps -efm|wc
> >     187    
> > 
> > .6:
> >   ps -ef|wc
> >      62     
> >   ps -efm|wc
> >     441   
> > 
> > 
> > > -----Original Message-----
> > > From: Martin Knoblauch [mailto:[EMAIL PROTECTED]
> > > Sent: Tuesday, January 16, 2007 11:59 AM
> > > To: Vitaly Karasik; [EMAIL PROTECTED]; 
> > > [email protected]
> > > Subject: RE: [Ganglia-general] Two similar linux hosts provides 
> > > different metrics
> > > 
> > > Vitaly,
> > > 
> > >  in this case try to run "gmond" with a debug level higher that 2.
> > > Maybe this sheds some light on it.
> > > 
> > >  Or, you could add debug statements to the proc_run_func and 
> > > proc_total_func code.
> > > 
> > >  But: first of all show us the output of "cat /proc/loadavg" 
> > > on both nodes.
> > > 
> > > cheers
> > > Martin
> > > --- Vitaly Karasik <[EMAIL PROTECTED]> wrote:
> > > 
> > > > It seems like we have different numbers in gmond:
> > > > 
> > > > <HOST NAME="5.5.5.5" IP="5.5.5.5" REPORTED="1168934873" TN="2"
> > > > TMAX="20"
> > > > DMAX="0" LOCATION="unspecified" GMOND_STARTED="1166534354">
> > ......
> > > > <METRIC NAME="proc_total" VAL="185" TYPE="uint32" UNITS=""
> > TN="229"
> > > > TMAX="950" DMAX="0" SLOPE="both" SOURCE="gmond"/> ..............
> > > > <METRIC NAME="proc_run" VAL="0" TYPE="uint32" UNITS="" TN="229"
> > > > TMAX="950" DMAX="0" SLOPE="both" SOURCE="gmond"/>
> > > > 
> > > > 
> > > > <HOST NAME="5.5.5.6" IP="5.5.5.6" REPORTED="1168934871" TN="3"
> > > > TMAX="20"
> > > > DMAX="0" LOCATION="unspecified" GMOND_STARTED="1166534349"> ....
> > > > <METRIC NAME="proc_run" VAL="15" TYPE="uint32" UNITS="" TN="68"
> > > > TMAX="950" DMAX="0" SLOPE="both" SOURCE="gmond"/> ....
> > > > <METRIC NAME="proc_total" VAL="439" TYPE="uint32" UNITS=""
> > TN="68"
> > > > TMAX="950" DMAX="0" SLOPE="both" SOURCE="gmond"/>
> > > > 
> > > > Thanks,
> > > > Vitaly
> > > > 
> > > > > -----Original Message-----
> > > > > From: Martin Knoblauch [mailto:[EMAIL PROTECTED]
> > > > > Sent: Monday, January 15, 2007 12:30 PM
> > > > > To: Vitaly Karasik; [email protected]
> > > > > Subject: RE: [Ganglia-general] Two similar linux 
> hosts provides
> > 
> > > > > different metrics
> > > > > 
> > > > > Hi Vitaly,
> > > > > 
> > > > >  where do you see the invalid numbers:
> > > > > 
> > > > > a) in the gmond XML Stream (telnet/nc to the gmond XML port)
> > > > > b) in the XML Stream from gmetad (telnet/nc to the gmetad
> > > XML port)
> > > > > c) only in the web-frontend
> > > > > 
> > > > > Cheers
> > > > > Martin
> > > > > --- Vitaly Karasik <[EMAIL PROTECTED]> wrote:
> > > > > 
> > > > > > "NON-BUSY" HOST:
> > > > > > # ps axl|wc
> > > > > >      61     862    5865
> > > > > > # uptime
> > > > > >  08:54:55  up 204 days,  2:00,  1 user,  load average: 0.00,
> > > > 0.00,
> > > > > > 0.00
> > > > > > 
> > > > > > "BUSY" HOST
> > > > > >  ]# ps axl|wc
> > > > > >      62     877    5977
> > > > > >  ]# uptime
> > > > > >  08:55:18  up 31 days, 16:30,  1 user,  load average: 0.04,
> > > > > 0.01, 0.00
> > > > > >  
> > > > > > 
> > > > > > > -----Original Message-----
> > > > > > > From: Martin Knoblauch [mailto:[EMAIL PROTECTED]
> > > > > > > Sent: Thursday, January 11, 2007 10:54 AM
> > > > > > > To: Vitaly Karasik; [email protected]
> > > > > > > Subject: Re: [Ganglia-general] Two similar linux
> > > hosts provides
> > > > > > > different metrics
> > > > > > > 
> > > > > > > Hi Vitaly,
> > > > > > > 
> > > > > > >  what does "ps axl" show on both hosts, as that is
> > basically
> > > > > > > what gmond looks at? If it is already different 
> there, the 
> > > > > > > "problem" is not ganglia related. (OK, I see you already
> > > > checked
> > > > > > ...)
> > > > > > > 
> > > > > > >  What are the load averages according to "uptime"?
> > > > > > > 
> > > > > > > Cheers
> > > > > > > Martin
> > > > > > > 
> > > > > > > 
> > > > > > > --- Vitaly Karasik <[EMAIL PROTECTED]> wrote:
> > > > > > > 
> > > > > > > >   Hi,
> > > > > > > > 
> > > > > > > > I have a weird problem - two linux hosts with similar
> > > > > > configuration
> > > > > > > > provide very different metrics about  number of running
> > > > > processes
> > > > > > -
> > > > > > > > one shows about 2, and second about 20-40 (I speak about
> > > > > > > concentrated
> > > > > > > > "load"
> > > > > > > > graph at top right.)
> > > > > > > > proc_total is different too - 171 vs. 350 (BTW,  ps -ef
> > |wc
> > > > > > > == 61 on
> > > > > > > > both boxes)
> > > > > > > > 
> > > > > > > > Both machines are RHEL3 kernel 2.4.21-37.ELsmp with
> > > > > > > > ganglia-gmond-3.0.3-1 installed from RPM.
> > > > > > > > 
> > > > > > > > Any ideas?
> > > > > > > > Thanks,
> > > > > > > > Vitaly
> > > > > > > > 
> > > > > > > >  
> > > > > > > > 
> > > > > > > >
> > > > > > >
> > --------------------------------------------------------------
> > > > > > > -----------
> > > > > > > > Take Surveys. Earn Cash. Influence the Future 
> of IT Join 
> > > > > > > > SourceForge.net's Techsay panel and you'll get 
> the chance
> > to
> > > > > > share
> > > > > > > > your opinions on IT & business topics through brief
> > surveys
> > > > > > > - and earn
> > > > > > > > cash
> > > > > > > >
> > > > > > >
> > http://www.techsay.com/default.php?page=join.php&p=sourceforge
> > > > > > > &CID=DEVDEV
> > > > > > > > _______________________________________________
> > > > > > > > Ganglia-general mailing list 
> > > > > > > > [email protected]
> > > > > > > >
> > https://lists.sourceforge.net/lists/listinfo/ganglia-general
> > > > > > > > 
> > > > > > > > 
> > > > > > > 
> > > > > > > 
> > > > > > > ------------------------------------------------------
> > > > > > > Martin Knoblauch
> > > > > > > email: k n o b i AT knobisoft DOT de
> > > > > > > www:   http://www.knobisoft.de
> > > > > > > 
> > > > > > 
> > > > > > 
> > > > > 
> > > > > 
> > > > > ------------------------------------------------------
> > > > > Martin Knoblauch
> > > > > email: k n o b i AT knobisoft DOT de
> > > > > www:   http://www.knobisoft.de
> > > > > 
> > > > 
> > > > 
> > > 
> > > 
> > > ------------------------------------------------------
> > > Martin Knoblauch
> > > email: k n o b i AT knobisoft DOT de
> > > www:   http://www.knobisoft.de
> > > 
> > 
> > 
> 
> 
> ------------------------------------------------------
> Martin Knoblauch
> email: k n o b i AT knobisoft DOT de
> www:   http://www.knobisoft.de
> 

Reply via email to