.5: cat /proc/loadavg 0.04 0.06 0.01 1/185 10512 .6: cat /proc/loadavg 1.03 1.01 1.00 2/441 19965
Oops! I think I'm starting to understand - number of processes on both machines are the same, but number the threads are different. probably gmond counts threads, not processes: .5: ps -ef|wc 64 ps -efm|wc 187 .6: ps -ef|wc 62 ps -efm|wc 441 > -----Original Message----- > From: Martin Knoblauch [mailto:[EMAIL PROTECTED] > Sent: Tuesday, January 16, 2007 11:59 AM > To: Vitaly Karasik; [EMAIL PROTECTED]; > ganglia-general@lists.sourceforge.net > Subject: RE: [Ganglia-general] Two similar linux hosts > provides different metrics > > Vitaly, > > in this case try to run "gmond" with a debug level higher that 2. > Maybe this sheds some light on it. > > Or, you could add debug statements to the proc_run_func and > proc_total_func code. > > But: first of all show us the output of "cat /proc/loadavg" > on both nodes. > > cheers > Martin > --- Vitaly Karasik <[EMAIL PROTECTED]> wrote: > > > It seems like we have different numbers in gmond: > > > > <HOST NAME="5.5.5.5" IP="5.5.5.5" REPORTED="1168934873" TN="2" > > TMAX="20" > > DMAX="0" LOCATION="unspecified" GMOND_STARTED="1166534354"> ...... > > <METRIC NAME="proc_total" VAL="185" TYPE="uint32" UNITS="" TN="229" > > TMAX="950" DMAX="0" SLOPE="both" SOURCE="gmond"/> .............. > > <METRIC NAME="proc_run" VAL="0" TYPE="uint32" UNITS="" TN="229" > > TMAX="950" DMAX="0" SLOPE="both" SOURCE="gmond"/> > > > > > > <HOST NAME="5.5.5.6" IP="5.5.5.6" REPORTED="1168934871" TN="3" > > TMAX="20" > > DMAX="0" LOCATION="unspecified" GMOND_STARTED="1166534349"> .... > > <METRIC NAME="proc_run" VAL="15" TYPE="uint32" UNITS="" TN="68" > > TMAX="950" DMAX="0" SLOPE="both" SOURCE="gmond"/> .... > > <METRIC NAME="proc_total" VAL="439" TYPE="uint32" UNITS="" TN="68" > > TMAX="950" DMAX="0" SLOPE="both" SOURCE="gmond"/> > > > > Thanks, > > Vitaly > > > > > -----Original Message----- > > > From: Martin Knoblauch [mailto:[EMAIL PROTECTED] > > > Sent: Monday, January 15, 2007 12:30 PM > > > To: Vitaly Karasik; ganglia-general@lists.sourceforge.net > > > Subject: RE: [Ganglia-general] Two similar linux hosts provides > > > different metrics > > > > > > Hi Vitaly, > > > > > > where do you see the invalid numbers: > > > > > > a) in the gmond XML Stream (telnet/nc to the gmond XML port) > > > b) in the XML Stream from gmetad (telnet/nc to the gmetad > XML port) > > > c) only in the web-frontend > > > > > > Cheers > > > Martin > > > --- Vitaly Karasik <[EMAIL PROTECTED]> wrote: > > > > > > > "NON-BUSY" HOST: > > > > # ps axl|wc > > > > 61 862 5865 > > > > # uptime > > > > 08:54:55 up 204 days, 2:00, 1 user, load average: 0.00, > > 0.00, > > > > 0.00 > > > > > > > > "BUSY" HOST > > > > ]# ps axl|wc > > > > 62 877 5977 > > > > ]# uptime > > > > 08:55:18 up 31 days, 16:30, 1 user, load average: 0.04, > > > 0.01, 0.00 > > > > > > > > > > > > > -----Original Message----- > > > > > From: Martin Knoblauch [mailto:[EMAIL PROTECTED] > > > > > Sent: Thursday, January 11, 2007 10:54 AM > > > > > To: Vitaly Karasik; ganglia-general@lists.sourceforge.net > > > > > Subject: Re: [Ganglia-general] Two similar linux > hosts provides > > > > > different metrics > > > > > > > > > > Hi Vitaly, > > > > > > > > > > what does "ps axl" show on both hosts, as that is basically > > > > > what gmond looks at? If it is already different there, the > > > > > "problem" is not ganglia related. (OK, I see you already > > checked > > > > ...) > > > > > > > > > > What are the load averages according to "uptime"? > > > > > > > > > > Cheers > > > > > Martin > > > > > > > > > > > > > > > --- Vitaly Karasik <[EMAIL PROTECTED]> wrote: > > > > > > > > > > > Hi, > > > > > > > > > > > > I have a weird problem - two linux hosts with similar > > > > configuration > > > > > > provide very different metrics about number of running > > > processes > > > > - > > > > > > one shows about 2, and second about 20-40 (I speak about > > > > > concentrated > > > > > > "load" > > > > > > graph at top right.) > > > > > > proc_total is different too - 171 vs. 350 (BTW, ps -ef |wc > > > > > == 61 on > > > > > > both boxes) > > > > > > > > > > > > Both machines are RHEL3 kernel 2.4.21-37.ELsmp with > > > > > > ganglia-gmond-3.0.3-1 installed from RPM. > > > > > > > > > > > > Any ideas? > > > > > > Thanks, > > > > > > Vitaly > > > > > > > > > > > > > > > > > > > > > > > > > > > > > -------------------------------------------------------------- > > > > > ----------- > > > > > > Take Surveys. Earn Cash. Influence the Future of IT Join > > > > > > SourceForge.net's Techsay panel and you'll get the chance to > > > > share > > > > > > your opinions on IT & business topics through brief surveys > > > > > - and earn > > > > > > cash > > > > > > > > > > > http://www.techsay.com/default.php?page=join.php&p=sourceforge > > > > > &CID=DEVDEV > > > > > > _______________________________________________ > > > > > > Ganglia-general mailing list > > > > > > Ganglia-general@lists.sourceforge.net > > > > > > https://lists.sourceforge.net/lists/listinfo/ganglia-general > > > > > > > > > > > > > > > > > > > > > > > > > > > ------------------------------------------------------ > > > > > Martin Knoblauch > > > > > email: k n o b i AT knobisoft DOT de > > > > > www: http://www.knobisoft.de > > > > > > > > > > > > > > > > > > > > > > ------------------------------------------------------ > > > Martin Knoblauch > > > email: k n o b i AT knobisoft DOT de > > > www: http://www.knobisoft.de > > > > > > > > > > ------------------------------------------------------ > Martin Knoblauch > email: k n o b i AT knobisoft DOT de > www: http://www.knobisoft.de >