subject:"\[Ganglia\-general\] Two similar linux hosts provides different metrics"

Re: [Ganglia-general] Two similar linux hosts provides different metrics

2007-01-16 Thread Vitaly Karasik

It seems like we have different numbers in gmond:

HOST NAME=5.5.5.5 IP=5.5.5.5 REPORTED=1168934873 TN=2 TMAX=20
DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534354 
..
METRIC NAME=proc_total VAL=185 TYPE=uint32 UNITS= TN=229
TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/
..
METRIC NAME=proc_run VAL=0 TYPE=uint32 UNITS= TN=229
TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/


HOST NAME=5.5.5.6 IP=5.5.5.6 REPORTED=1168934871 TN=3 TMAX=20
DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534349

METRIC NAME=proc_run VAL=15 TYPE=uint32 UNITS= TN=68
TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/

METRIC NAME=proc_total VAL=439 TYPE=uint32 UNITS= TN=68
TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/

Thanks,
Vitaly

 -Original Message-
 From: Martin Knoblauch [mailto:[EMAIL PROTECTED] 
 Sent: Monday, January 15, 2007 12:30 PM
 To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
 Subject: RE: [Ganglia-general] Two similar linux hosts 
 provides different metrics
 
 Hi Vitaly,
 
  where do you see the invalid numbers:
 
 a) in the gmond XML Stream (telnet/nc to the gmond XML port)
 b) in the XML Stream from gmetad (telnet/nc to the gmetad XML port)
 c) only in the web-frontend
 
 Cheers
 Martin
 --- Vitaly Karasik [EMAIL PROTECTED] wrote:
 
  NON-BUSY HOST:
  # ps axl|wc
   61 8625865
  # uptime
   08:54:55  up 204 days,  2:00,  1 user,  load average: 0.00, 0.00, 
  0.00
  
  BUSY HOST
   ]# ps axl|wc
   62 8775977
   ]# uptime
   08:55:18  up 31 days, 16:30,  1 user,  load average: 0.04, 
 0.01, 0.00
   
  
   -Original Message-
   From: Martin Knoblauch [mailto:[EMAIL PROTECTED] 
   Sent: Thursday, January 11, 2007 10:54 AM
   To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
   Subject: Re: [Ganglia-general] Two similar linux hosts 
   provides different metrics
   
   Hi Vitaly,
   
what does ps axl show on both hosts, as that is basically 
   what gmond looks at? If it is already different there, the 
   problem is not ganglia related. (OK, I see you already checked
  ...)
   
What are the load averages according to uptime?
   
   Cheers
   Martin
   
   
   --- Vitaly Karasik [EMAIL PROTECTED] wrote:
   
  Hi,

I have a weird problem - two linux hosts with similar
  configuration 
provide very different metrics about  number of running 
 processes
  - 
one shows about 2, and second about 20-40 (I speak about 
   concentrated 
load
graph at top right.)
proc_total is different too - 171 vs. 350 (BTW,  ps -ef |wc 
   == 61 on 
both boxes)

Both machines are RHEL3 kernel 2.4.21-37.ELsmp with
ganglia-gmond-3.0.3-1 installed from RPM.

Any ideas?
Thanks,
Vitaly

 

   
   --
   ---
Take Surveys. Earn Cash. Influence the Future of IT Join 
SourceForge.net's Techsay panel and you'll get the chance to
  share 
your opinions on IT  business topics through brief surveys 
   - and earn 
cash
   
   http://www.techsay.com/default.php?page=join.phpp=sourceforge
   CID=DEVDEV
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


   
   
   --
   Martin Knoblauch
   email: k n o b i AT knobisoft DOT de
   www:   http://www.knobisoft.de
   
  
  
 
 
 --
 Martin Knoblauch
 email: k n o b i AT knobisoft DOT de
 www:   http://www.knobisoft.de

Re: [Ganglia-general] Two similar linux hosts provides different metrics

2007-01-16 Thread Martin Knoblauch

Vitaly,

 in this case try to run gmond with a debug level higher that 2.
Maybe this sheds some light on it.

 Or, you could add debug statements to the proc_run_func and
proc_total_func code.

 But: first of all show us the output of cat /proc/loadavg on both
nodes.

cheers
Martin
--- Vitaly Karasik [EMAIL PROTECTED] wrote:

 It seems like we have different numbers in gmond:
 
 HOST NAME=5.5.5.5 IP=5.5.5.5 REPORTED=1168934873 TN=2
 TMAX=20
 DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534354 
 ..
 METRIC NAME=proc_total VAL=185 TYPE=uint32 UNITS= TN=229
 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/
 ..
 METRIC NAME=proc_run VAL=0 TYPE=uint32 UNITS= TN=229
 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/
 
 
 HOST NAME=5.5.5.6 IP=5.5.5.6 REPORTED=1168934871 TN=3
 TMAX=20
 DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534349
 
 METRIC NAME=proc_run VAL=15 TYPE=uint32 UNITS= TN=68
 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/
 
 METRIC NAME=proc_total VAL=439 TYPE=uint32 UNITS= TN=68
 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/
 
 Thanks,
 Vitaly
 
  -Original Message-
  From: Martin Knoblauch [mailto:[EMAIL PROTECTED] 
  Sent: Monday, January 15, 2007 12:30 PM
  To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
  Subject: RE: [Ganglia-general] Two similar linux hosts 
  provides different metrics
  
  Hi Vitaly,
  
   where do you see the invalid numbers:
  
  a) in the gmond XML Stream (telnet/nc to the gmond XML port)
  b) in the XML Stream from gmetad (telnet/nc to the gmetad XML port)
  c) only in the web-frontend
  
  Cheers
  Martin
  --- Vitaly Karasik [EMAIL PROTECTED] wrote:
  
   NON-BUSY HOST:
   # ps axl|wc
61 8625865
   # uptime
08:54:55  up 204 days,  2:00,  1 user,  load average: 0.00,
 0.00, 
   0.00
   
   BUSY HOST
]# ps axl|wc
62 8775977
]# uptime
08:55:18  up 31 days, 16:30,  1 user,  load average: 0.04, 
  0.01, 0.00

   
-Original Message-
From: Martin Knoblauch [mailto:[EMAIL PROTECTED] 
Sent: Thursday, January 11, 2007 10:54 AM
To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
Subject: Re: [Ganglia-general] Two similar linux hosts 
provides different metrics

Hi Vitaly,

 what does ps axl show on both hosts, as that is basically 
what gmond looks at? If it is already different there, the 
problem is not ganglia related. (OK, I see you already
 checked
   ...)

 What are the load averages according to uptime?

Cheers
Martin


--- Vitaly Karasik [EMAIL PROTECTED] wrote:

   Hi,
 
 I have a weird problem - two linux hosts with similar
   configuration 
 provide very different metrics about  number of running 
  processes
   - 
 one shows about 2, and second about 20-40 (I speak about 
concentrated 
 load
 graph at top right.)
 proc_total is different too - 171 vs. 350 (BTW,  ps -ef |wc 
== 61 on 
 both boxes)
 
 Both machines are RHEL3 kernel 2.4.21-37.ELsmp with
 ganglia-gmond-3.0.3-1 installed from RPM.
 
 Any ideas?
 Thanks,
 Vitaly
 
  
 

--
---
 Take Surveys. Earn Cash. Influence the Future of IT Join 
 SourceForge.net's Techsay panel and you'll get the chance to
   share 
 your opinions on IT  business topics through brief surveys 
- and earn 
 cash

http://www.techsay.com/default.php?page=join.phpp=sourceforge
CID=DEVDEV
 ___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-general
 
 


--
Martin Knoblauch
email: k n o b i AT knobisoft DOT de
www:   http://www.knobisoft.de

   
   
  
  
  --
  Martin Knoblauch
  email: k n o b i AT knobisoft DOT de
  www:   http://www.knobisoft.de
  
 
 


--
Martin Knoblauch
email: k n o b i AT knobisoft DOT de
www:   http://www.knobisoft.de

Re: [Ganglia-general] Two similar linux hosts provides different metrics

2007-01-16 Thread Vitaly Karasik

.5:
 cat /proc/loadavg
0.04 0.06 0.01 1/185 10512

.6:  cat /proc/loadavg
1.03 1.01 1.00 2/441 19965 

Oops! I think I'm starting to  understand - number of processes on both
machines are the same, but number the threads are different. probably
gmond counts threads, not processes:

.5: ps -ef|wc
 64
 ps -efm|wc
187

.6:
  ps -ef|wc
 62 
  ps -efm|wc
441   


 -Original Message-
 From: Martin Knoblauch [mailto:[EMAIL PROTECTED] 
 Sent: Tuesday, January 16, 2007 11:59 AM
 To: Vitaly Karasik; [EMAIL PROTECTED]; 
 ganglia-general@lists.sourceforge.net
 Subject: RE: [Ganglia-general] Two similar linux hosts 
 provides different metrics
 
 Vitaly,
 
  in this case try to run gmond with a debug level higher that 2.
 Maybe this sheds some light on it.
 
  Or, you could add debug statements to the proc_run_func and 
 proc_total_func code.
 
  But: first of all show us the output of cat /proc/loadavg 
 on both nodes.
 
 cheers
 Martin
 --- Vitaly Karasik [EMAIL PROTECTED] wrote:
 
  It seems like we have different numbers in gmond:
  
  HOST NAME=5.5.5.5 IP=5.5.5.5 REPORTED=1168934873 TN=2
  TMAX=20
  DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534354 ..
  METRIC NAME=proc_total VAL=185 TYPE=uint32 UNITS= TN=229
  TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ ..
  METRIC NAME=proc_run VAL=0 TYPE=uint32 UNITS= TN=229
  TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/
  
  
  HOST NAME=5.5.5.6 IP=5.5.5.6 REPORTED=1168934871 TN=3
  TMAX=20
  DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534349 
  METRIC NAME=proc_run VAL=15 TYPE=uint32 UNITS= TN=68
  TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ 
  METRIC NAME=proc_total VAL=439 TYPE=uint32 UNITS= TN=68
  TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/
  
  Thanks,
  Vitaly
  
   -Original Message-
   From: Martin Knoblauch [mailto:[EMAIL PROTECTED]
   Sent: Monday, January 15, 2007 12:30 PM
   To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
   Subject: RE: [Ganglia-general] Two similar linux hosts provides 
   different metrics
   
   Hi Vitaly,
   
where do you see the invalid numbers:
   
   a) in the gmond XML Stream (telnet/nc to the gmond XML port)
   b) in the XML Stream from gmetad (telnet/nc to the gmetad 
 XML port)
   c) only in the web-frontend
   
   Cheers
   Martin
   --- Vitaly Karasik [EMAIL PROTECTED] wrote:
   
NON-BUSY HOST:
# ps axl|wc
 61 8625865
# uptime
 08:54:55  up 204 days,  2:00,  1 user,  load average: 0.00,
  0.00,
0.00

BUSY HOST
 ]# ps axl|wc
 62 8775977
 ]# uptime
 08:55:18  up 31 days, 16:30,  1 user,  load average: 0.04,
   0.01, 0.00
 

 -Original Message-
 From: Martin Knoblauch [mailto:[EMAIL PROTECTED]
 Sent: Thursday, January 11, 2007 10:54 AM
 To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
 Subject: Re: [Ganglia-general] Two similar linux 
 hosts provides 
 different metrics
 
 Hi Vitaly,
 
  what does ps axl show on both hosts, as that is basically 
 what gmond looks at? If it is already different there, the 
 problem is not ganglia related. (OK, I see you already
  checked
...)
 
  What are the load averages according to uptime?
 
 Cheers
 Martin
 
 
 --- Vitaly Karasik [EMAIL PROTECTED] wrote:
 
Hi,
  
  I have a weird problem - two linux hosts with similar
configuration
  provide very different metrics about  number of running
   processes
-
  one shows about 2, and second about 20-40 (I speak about
 concentrated
  load
  graph at top right.)
  proc_total is different too - 171 vs. 350 (BTW,  ps -ef |wc
 == 61 on
  both boxes)
  
  Both machines are RHEL3 kernel 2.4.21-37.ELsmp with
  ganglia-gmond-3.0.3-1 installed from RPM.
  
  Any ideas?
  Thanks,
  Vitaly
  
   
  
 
 --
 ---
  Take Surveys. Earn Cash. Influence the Future of IT Join 
  SourceForge.net's Techsay panel and you'll get the chance to
share
  your opinions on IT  business topics through brief surveys
 - and earn
  cash
 
 http://www.techsay.com/default.php?page=join.phpp=sourceforge
 CID=DEVDEV
  ___
  Ganglia-general mailing list
  Ganglia-general@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/ganglia-general
  
  
 
 
 --
 Martin Knoblauch
 email: k n o b i AT knobisoft DOT de
 www:   http://www.knobisoft.de
 


   
   
   --
   Martin Knoblauch
   email: k n o b i AT knobisoft DOT de
   www:   http://www.knobisoft.de

Re: [Ganglia-general] Two similar linux hosts provides different metrics

2007-01-16 Thread Martin Knoblauch

Vitaly,

 gmond on Linux just interprets the fourth filed of /proc/loadavg. The
number in front of the slash is the number of running processes, the
number following the slash is the total number of processes.

Cheers
Martin
 
--- Vitaly Karasik [EMAIL PROTECTED] wrote:

 .5:
  cat /proc/loadavg
 0.04 0.06 0.01 1/185 10512
 
 .6:  cat /proc/loadavg
 1.03 1.01 1.00 2/441 19965 
 
 Oops! I think I'm starting to  understand - number of processes on
 both
 machines are the same, but number the threads are different. probably
 gmond counts threads, not processes:
 
 .5: ps -ef|wc
  64
  ps -efm|wc
 187
 
 .6:
   ps -ef|wc
  62 
   ps -efm|wc
 441   
 
 
  -Original Message-
  From: Martin Knoblauch [mailto:[EMAIL PROTECTED] 
  Sent: Tuesday, January 16, 2007 11:59 AM
  To: Vitaly Karasik; [EMAIL PROTECTED]; 
  ganglia-general@lists.sourceforge.net
  Subject: RE: [Ganglia-general] Two similar linux hosts 
  provides different metrics
  
  Vitaly,
  
   in this case try to run gmond with a debug level higher that 2.
  Maybe this sheds some light on it.
  
   Or, you could add debug statements to the proc_run_func and 
  proc_total_func code.
  
   But: first of all show us the output of cat /proc/loadavg 
  on both nodes.
  
  cheers
  Martin
  --- Vitaly Karasik [EMAIL PROTECTED] wrote:
  
   It seems like we have different numbers in gmond:
   
   HOST NAME=5.5.5.5 IP=5.5.5.5 REPORTED=1168934873 TN=2
   TMAX=20
   DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534354
 ..
   METRIC NAME=proc_total VAL=185 TYPE=uint32 UNITS=
 TN=229
   TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ ..
   METRIC NAME=proc_run VAL=0 TYPE=uint32 UNITS= TN=229
   TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/
   
   
   HOST NAME=5.5.5.6 IP=5.5.5.6 REPORTED=1168934871 TN=3
   TMAX=20
   DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534349 
   METRIC NAME=proc_run VAL=15 TYPE=uint32 UNITS= TN=68
   TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ 
   METRIC NAME=proc_total VAL=439 TYPE=uint32 UNITS=
 TN=68
   TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/
   
   Thanks,
   Vitaly
   
-Original Message-
From: Martin Knoblauch [mailto:[EMAIL PROTECTED]
Sent: Monday, January 15, 2007 12:30 PM
To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
Subject: RE: [Ganglia-general] Two similar linux hosts provides
 
different metrics

Hi Vitaly,

 where do you see the invalid numbers:

a) in the gmond XML Stream (telnet/nc to the gmond XML port)
b) in the XML Stream from gmetad (telnet/nc to the gmetad 
  XML port)
c) only in the web-frontend

Cheers
Martin
--- Vitaly Karasik [EMAIL PROTECTED] wrote:

 NON-BUSY HOST:
 # ps axl|wc
  61 8625865
 # uptime
  08:54:55  up 204 days,  2:00,  1 user,  load average: 0.00,
   0.00,
 0.00
 
 BUSY HOST
  ]# ps axl|wc
  62 8775977
  ]# uptime
  08:55:18  up 31 days, 16:30,  1 user,  load average: 0.04,
0.01, 0.00
  
 
  -Original Message-
  From: Martin Knoblauch [mailto:[EMAIL PROTECTED]
  Sent: Thursday, January 11, 2007 10:54 AM
  To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
  Subject: Re: [Ganglia-general] Two similar linux 
  hosts provides 
  different metrics
  
  Hi Vitaly,
  
   what does ps axl show on both hosts, as that is
 basically 
  what gmond looks at? If it is already different there, the 
  problem is not ganglia related. (OK, I see you already
   checked
 ...)
  
   What are the load averages according to uptime?
  
  Cheers
  Martin
  
  
  --- Vitaly Karasik [EMAIL PROTECTED] wrote:
  
 Hi,
   
   I have a weird problem - two linux hosts with similar
 configuration
   provide very different metrics about  number of running
processes
 -
   one shows about 2, and second about 20-40 (I speak about
  concentrated
   load
   graph at top right.)
   proc_total is different too - 171 vs. 350 (BTW,  ps -ef
 |wc
  == 61 on
   both boxes)
   
   Both machines are RHEL3 kernel 2.4.21-37.ELsmp with
   ganglia-gmond-3.0.3-1 installed from RPM.
   
   Any ideas?
   Thanks,
   Vitaly
   

   
  
 
 --
  ---
   Take Surveys. Earn Cash. Influence the Future of IT Join 
   SourceForge.net's Techsay panel and you'll get the chance
 to
 share
   your opinions on IT  business topics through brief
 surveys
  - and earn
   cash
  
 
 http://www.techsay.com/default.php?page=join.phpp=sourceforge
  CID=DEVDEV
   ___
   Ganglia-general mailing list
   Ganglia-general@lists.sourceforge.net
  
 https://lists.sourceforge.net

Re: [Ganglia-general] Two similar linux hosts provides different metrics

2007-01-16 Thread Vitaly Karasik

Martin,
Thank a lot - now I understand that different number of threads causes
this difference.

Vitaly 

 -Original Message-
 From: Martin Knoblauch [mailto:[EMAIL PROTECTED] 
 Sent: Tuesday, January 16, 2007 12:29 PM
 To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
 Subject: RE: [Ganglia-general] Two similar linux hosts 
 provides different metrics

 Vitaly,

  gmond on Linux just interprets the fourth filed of 
 /proc/loadavg. The number in front of the slash is the number 
 of running processes, the number following the slash is the 
 total number of processes.

 Cheers
 Martin

 --- Vitaly Karasik [EMAIL PROTECTED] wrote:

  .5:
   cat /proc/loadavg
  0.04 0.06 0.01 1/185 10512

  .6:  cat /proc/loadavg
  1.03 1.01 1.00 2/441 19965

  Oops! I think I'm starting to  understand - number of processes on 
  both machines are the same, but number the threads are different. 
  probably gmond counts threads, not processes:

  .5: ps -ef|wc
   64
   ps -efm|wc
  187

  .6:
ps -ef|wc
   62 
ps -efm|wc
  441   

   -Original Message-
   From: Martin Knoblauch [mailto:[EMAIL PROTECTED]
   Sent: Tuesday, January 16, 2007 11:59 AM
   To: Vitaly Karasik; [EMAIL PROTECTED]; 
   ganglia-general@lists.sourceforge.net
   Subject: RE: [Ganglia-general] Two similar linux hosts provides 
   different metrics

   Vitaly,

in this case try to run gmond with a debug level higher that 2.
   Maybe this sheds some light on it.

Or, you could add debug statements to the proc_run_func and 
   proc_total_func code.

But: first of all show us the output of cat /proc/loadavg 
   on both nodes.

   cheers
   Martin
   --- Vitaly Karasik [EMAIL PROTECTED] wrote:

It seems like we have different numbers in gmond:

HOST NAME=5.5.5.5 IP=5.5.5.5 REPORTED=1168934873 TN=2
TMAX=20
DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534354
  ..
METRIC NAME=proc_total VAL=185 TYPE=uint32 UNITS=
  TN=229
TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ ..
METRIC NAME=proc_run VAL=0 TYPE=uint32 UNITS= TN=229
TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/

HOST NAME=5.5.5.6 IP=5.5.5.6 REPORTED=1168934871 TN=3
TMAX=20
DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534349 
METRIC NAME=proc_run VAL=15 TYPE=uint32 UNITS= TN=68
TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ 
METRIC NAME=proc_total VAL=439 TYPE=uint32 UNITS=
  TN=68
TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/

Thanks,
Vitaly

 -Original Message-
 From: Martin Knoblauch [mailto:[EMAIL PROTECTED]
 Sent: Monday, January 15, 2007 12:30 PM
 To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
 Subject: RE: [Ganglia-general] Two similar linux 
 hosts provides

 different metrics

 Hi Vitaly,

  where do you see the invalid numbers:

 a) in the gmond XML Stream (telnet/nc to the gmond XML port)
 b) in the XML Stream from gmetad (telnet/nc to the gmetad
   XML port)
 c) only in the web-frontend

 Cheers
 Martin
 --- Vitaly Karasik [EMAIL PROTECTED] wrote:

  NON-BUSY HOST:
  # ps axl|wc
   61 8625865
  # uptime
   08:54:55  up 204 days,  2:00,  1 user,  load average: 0.00,
0.00,
  0.00

  BUSY HOST
   ]# ps axl|wc
   62 8775977
   ]# uptime
   08:55:18  up 31 days, 16:30,  1 user,  load average: 0.04,
 0.01, 0.00

   -Original Message-
   From: Martin Knoblauch [mailto:[EMAIL PROTECTED]
   Sent: Thursday, January 11, 2007 10:54 AM
   To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
   Subject: Re: [Ganglia-general] Two similar linux
   hosts provides
   different metrics

   Hi Vitaly,

what does ps axl show on both hosts, as that is
  basically
   what gmond looks at? If it is already different 
 there, the 
   problem is not ganglia related. (OK, I see you already
checked
  ...)

What are the load averages according to uptime?

   Cheers
   Martin

   --- Vitaly Karasik [EMAIL PROTECTED] wrote:

  Hi,

I have a weird problem - two linux hosts with similar
  configuration
provide very different metrics about  number of running
 processes
  -
one shows about 2, and second about 20-40 (I speak about
   concentrated
load
graph at top right.)
proc_total is different too - 171 vs. 350 (BTW,  ps -ef
  |wc
   == 61 on
both boxes)

Both machines are RHEL3 kernel 2.4.21-37.ELsmp with
ganglia-gmond-3.0.3-1 installed from RPM.

Any ideas?
Thanks,
Vitaly

Re: [Ganglia-general] Two similar linux hosts provides different metrics

2007-01-15 Thread Martin Knoblauch

Hi Vitaly,

 where do you see the invalid numbers:

a) in the gmond XML Stream (telnet/nc to the gmond XML port)
b) in the XML Stream from gmetad (telnet/nc to the gmetad XML port)
c) only in the web-frontend

Cheers
Martin
--- Vitaly Karasik [EMAIL PROTECTED] wrote:

 NON-BUSY HOST:
 # ps axl|wc
  61 8625865
 # uptime
  08:54:55  up 204 days,  2:00,  1 user,  load average: 0.00, 0.00,
 0.00
 
 BUSY HOST 
  ]# ps axl|wc
  62 8775977
  ]# uptime
  08:55:18  up 31 days, 16:30,  1 user,  load average: 0.04, 0.01,
 0.00
  
 
  -Original Message-
  From: Martin Knoblauch [mailto:[EMAIL PROTECTED] 
  Sent: Thursday, January 11, 2007 10:54 AM
  To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
  Subject: Re: [Ganglia-general] Two similar linux hosts 
  provides different metrics
  
  Hi Vitaly,
  
   what does ps axl show on both hosts, as that is basically 
  what gmond looks at? If it is already different there, the 
  problem is not ganglia related. (OK, I see you already checked
 ...)
  
   What are the load averages according to uptime?
  
  Cheers
  Martin
  
  
  --- Vitaly Karasik [EMAIL PROTECTED] wrote:
  
 Hi,
   
   I have a weird problem - two linux hosts with similar
 configuration 
   provide very different metrics about  number of running processes
 - 
   one shows about 2, and second about 20-40 (I speak about 
  concentrated 
   load
   graph at top right.)
   proc_total is different too - 171 vs. 350 (BTW,  ps -ef |wc 
  == 61 on 
   both boxes)
   
   Both machines are RHEL3 kernel 2.4.21-37.ELsmp with
   ganglia-gmond-3.0.3-1 installed from RPM.
   
   Any ideas?
   Thanks,
   Vitaly
   

   
  
  --
  ---
   Take Surveys. Earn Cash. Influence the Future of IT Join 
   SourceForge.net's Techsay panel and you'll get the chance to
 share 
   your opinions on IT  business topics through brief surveys 
  - and earn 
   cash
  
  http://www.techsay.com/default.php?page=join.phpp=sourceforge
  CID=DEVDEV
   ___
   Ganglia-general mailing list
   Ganglia-general@lists.sourceforge.net
   https://lists.sourceforge.net/lists/listinfo/ganglia-general
   
   
  
  
  --
  Martin Knoblauch
  email: k n o b i AT knobisoft DOT de
  www:   http://www.knobisoft.de
  
 
 


--
Martin Knoblauch
email: k n o b i AT knobisoft DOT de
www:   http://www.knobisoft.de

[Ganglia-general] Two similar linux hosts provides different metrics

2007-01-11 Thread Vitaly Karasik

  Hi,

I have a weird problem - two linux hosts with similar configuration
provide very different metrics about  number of running processes - one
shows about 2, and second about 20-40 (I speak about concentrated load
graph at top right.) 
proc_total is different too - 171 vs. 350 (BTW,  ps -ef |wc == 61 on
both boxes)

Both machines are RHEL3 kernel 2.4.21-37.ELsmp with
ganglia-gmond-3.0.3-1 installed from RPM.

Any ideas?
Thanks,
Vitaly

Re: [Ganglia-general] Two similar linux hosts provides different metrics

2007-01-11 Thread Martin Knoblauch

Hi Vitaly,

 what does ps axl show on both hosts, as that is basically what gmond
looks at? If it is already different there, the problem is not
ganglia related. (OK, I see you already checked ...)

 What are the load averages according to uptime?

Cheers
Martin


--- Vitaly Karasik [EMAIL PROTECTED] wrote:

   Hi,
 
 I have a weird problem - two linux hosts with similar configuration
 provide very different metrics about  number of running processes -
 one
 shows about 2, and second about 20-40 (I speak about concentrated
 load
 graph at top right.) 
 proc_total is different too - 171 vs. 350 (BTW,  ps -ef |wc == 61 on
 both boxes)
 
 Both machines are RHEL3 kernel 2.4.21-37.ELsmp with
 ganglia-gmond-3.0.3-1 installed from RPM.
 
 Any ideas?
 Thanks,
 Vitaly
 
  
 

-
 Take Surveys. Earn Cash. Influence the Future of IT
 Join SourceForge.net's Techsay panel and you'll get the chance to
 share your
 opinions on IT  business topics through brief surveys - and earn
 cash

http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
 ___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-general
 
 


--
Martin Knoblauch
email: k n o b i AT knobisoft DOT de
www:   http://www.knobisoft.de

Re: [Ganglia-general] Two similar linux hosts provides different metrics

2007-01-11 Thread Vitaly Karasik

NON-BUSY HOST:
# ps axl|wc
 61 8625865
# uptime
 08:54:55  up 204 days,  2:00,  1 user,  load average: 0.00, 0.00, 0.00

BUSY HOST 
 ]# ps axl|wc
 62 8775977
 ]# uptime
 08:55:18  up 31 days, 16:30,  1 user,  load average: 0.04, 0.01, 0.00
 

 -Original Message-
 From: Martin Knoblauch [mailto:[EMAIL PROTECTED] 
 Sent: Thursday, January 11, 2007 10:54 AM
 To: Vitaly Karasik; ganglia-general@lists.sourceforge.net
 Subject: Re: [Ganglia-general] Two similar linux hosts 
 provides different metrics
 
 Hi Vitaly,
 
  what does ps axl show on both hosts, as that is basically 
 what gmond looks at? If it is already different there, the 
 problem is not ganglia related. (OK, I see you already checked ...)
 
  What are the load averages according to uptime?
 
 Cheers
 Martin
 
 
 --- Vitaly Karasik [EMAIL PROTECTED] wrote:
 
Hi,
  
  I have a weird problem - two linux hosts with similar configuration 
  provide very different metrics about  number of running processes - 
  one shows about 2, and second about 20-40 (I speak about 
 concentrated 
  load
  graph at top right.)
  proc_total is different too - 171 vs. 350 (BTW,  ps -ef |wc 
 == 61 on 
  both boxes)
  
  Both machines are RHEL3 kernel 2.4.21-37.ELsmp with
  ganglia-gmond-3.0.3-1 installed from RPM.
  
  Any ideas?
  Thanks,
  Vitaly
  
   
  
 
 --
 ---
  Take Surveys. Earn Cash. Influence the Future of IT Join 
  SourceForge.net's Techsay panel and you'll get the chance to share 
  your opinions on IT  business topics through brief surveys 
 - and earn 
  cash
 
 http://www.techsay.com/default.php?page=join.phpp=sourceforge
 CID=DEVDEV
  ___
  Ganglia-general mailing list
  Ganglia-general@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/ganglia-general
  
  
 
 
 --
 Martin Knoblauch
 email: k n o b i AT knobisoft DOT de
 www:   http://www.knobisoft.de

Re: [Ganglia-general] Two similar linux hosts provides different metrics

Re: [Ganglia-general] Two similar linux hosts provides different metrics

Re: [Ganglia-general] Two similar linux hosts provides different metrics

Re: [Ganglia-general] Two similar linux hosts provides different metrics

Re: [Ganglia-general] Two similar linux hosts provides different metrics

Re: [Ganglia-general] Two similar linux hosts provides different metrics

[Ganglia-general] Two similar linux hosts provides different metrics

Re: [Ganglia-general] Two similar linux hosts provides different metrics

Re: [Ganglia-general] Two similar linux hosts provides different metrics

9 matches

Site Navigation

Mail list logo

Footer information