Re: [Ganglia-general] Two similar linux hosts provides different metrics
It seems like we have different numbers in gmond: HOST NAME=5.5.5.5 IP=5.5.5.5 REPORTED=1168934873 TN=2 TMAX=20 DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534354 .. METRIC NAME=proc_total VAL=185 TYPE=uint32 UNITS= TN=229 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ .. METRIC NAME=proc_run VAL=0 TYPE=uint32 UNITS= TN=229 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ HOST NAME=5.5.5.6 IP=5.5.5.6 REPORTED=1168934871 TN=3 TMAX=20 DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534349 METRIC NAME=proc_run VAL=15 TYPE=uint32 UNITS= TN=68 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ METRIC NAME=proc_total VAL=439 TYPE=uint32 UNITS= TN=68 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ Thanks, Vitaly -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Monday, January 15, 2007 12:30 PM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject: RE: [Ganglia-general] Two similar linux hosts provides different metrics Hi Vitaly, where do you see the invalid numbers: a) in the gmond XML Stream (telnet/nc to the gmond XML port) b) in the XML Stream from gmetad (telnet/nc to the gmetad XML port) c) only in the web-frontend Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: NON-BUSY HOST: # ps axl|wc 61 8625865 # uptime 08:54:55 up 204 days, 2:00, 1 user, load average: 0.00, 0.00, 0.00 BUSY HOST ]# ps axl|wc 62 8775977 ]# uptime 08:55:18 up 31 days, 16:30, 1 user, load average: 0.04, 0.01, 0.00 -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Thursday, January 11, 2007 10:54 AM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Two similar linux hosts provides different metrics Hi Vitaly, what does ps axl show on both hosts, as that is basically what gmond looks at? If it is already different there, the problem is not ganglia related. (OK, I see you already checked ...) What are the load averages according to uptime? Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: Hi, I have a weird problem - two linux hosts with similar configuration provide very different metrics about number of running processes - one shows about 2, and second about 20-40 (I speak about concentrated load graph at top right.) proc_total is different too - 171 vs. 350 (BTW, ps -ef |wc == 61 on both boxes) Both machines are RHEL3 kernel 2.4.21-37.ELsmp with ganglia-gmond-3.0.3-1 installed from RPM. Any ideas? Thanks, Vitaly -- --- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforge CID=DEVDEV ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Martin Knoblauch email: k n o b i AT knobisoft DOT de www: http://www.knobisoft.de -- Martin Knoblauch email: k n o b i AT knobisoft DOT de www: http://www.knobisoft.de
Re: [Ganglia-general] Two similar linux hosts provides different metrics
Vitaly, in this case try to run gmond with a debug level higher that 2. Maybe this sheds some light on it. Or, you could add debug statements to the proc_run_func and proc_total_func code. But: first of all show us the output of cat /proc/loadavg on both nodes. cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: It seems like we have different numbers in gmond: HOST NAME=5.5.5.5 IP=5.5.5.5 REPORTED=1168934873 TN=2 TMAX=20 DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534354 .. METRIC NAME=proc_total VAL=185 TYPE=uint32 UNITS= TN=229 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ .. METRIC NAME=proc_run VAL=0 TYPE=uint32 UNITS= TN=229 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ HOST NAME=5.5.5.6 IP=5.5.5.6 REPORTED=1168934871 TN=3 TMAX=20 DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534349 METRIC NAME=proc_run VAL=15 TYPE=uint32 UNITS= TN=68 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ METRIC NAME=proc_total VAL=439 TYPE=uint32 UNITS= TN=68 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ Thanks, Vitaly -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Monday, January 15, 2007 12:30 PM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject: RE: [Ganglia-general] Two similar linux hosts provides different metrics Hi Vitaly, where do you see the invalid numbers: a) in the gmond XML Stream (telnet/nc to the gmond XML port) b) in the XML Stream from gmetad (telnet/nc to the gmetad XML port) c) only in the web-frontend Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: NON-BUSY HOST: # ps axl|wc 61 8625865 # uptime 08:54:55 up 204 days, 2:00, 1 user, load average: 0.00, 0.00, 0.00 BUSY HOST ]# ps axl|wc 62 8775977 ]# uptime 08:55:18 up 31 days, 16:30, 1 user, load average: 0.04, 0.01, 0.00 -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Thursday, January 11, 2007 10:54 AM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Two similar linux hosts provides different metrics Hi Vitaly, what does ps axl show on both hosts, as that is basically what gmond looks at? If it is already different there, the problem is not ganglia related. (OK, I see you already checked ...) What are the load averages according to uptime? Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: Hi, I have a weird problem - two linux hosts with similar configuration provide very different metrics about number of running processes - one shows about 2, and second about 20-40 (I speak about concentrated load graph at top right.) proc_total is different too - 171 vs. 350 (BTW, ps -ef |wc == 61 on both boxes) Both machines are RHEL3 kernel 2.4.21-37.ELsmp with ganglia-gmond-3.0.3-1 installed from RPM. Any ideas? Thanks, Vitaly -- --- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforge CID=DEVDEV ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Martin Knoblauch email: k n o b i AT knobisoft DOT de www: http://www.knobisoft.de -- Martin Knoblauch email: k n o b i AT knobisoft DOT de www: http://www.knobisoft.de -- Martin Knoblauch email: k n o b i AT knobisoft DOT de www: http://www.knobisoft.de
Re: [Ganglia-general] Two similar linux hosts provides different metrics
.5: cat /proc/loadavg 0.04 0.06 0.01 1/185 10512 .6: cat /proc/loadavg 1.03 1.01 1.00 2/441 19965 Oops! I think I'm starting to understand - number of processes on both machines are the same, but number the threads are different. probably gmond counts threads, not processes: .5: ps -ef|wc 64 ps -efm|wc 187 .6: ps -ef|wc 62 ps -efm|wc 441 -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Tuesday, January 16, 2007 11:59 AM To: Vitaly Karasik; [EMAIL PROTECTED]; ganglia-general@lists.sourceforge.net Subject: RE: [Ganglia-general] Two similar linux hosts provides different metrics Vitaly, in this case try to run gmond with a debug level higher that 2. Maybe this sheds some light on it. Or, you could add debug statements to the proc_run_func and proc_total_func code. But: first of all show us the output of cat /proc/loadavg on both nodes. cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: It seems like we have different numbers in gmond: HOST NAME=5.5.5.5 IP=5.5.5.5 REPORTED=1168934873 TN=2 TMAX=20 DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534354 .. METRIC NAME=proc_total VAL=185 TYPE=uint32 UNITS= TN=229 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ .. METRIC NAME=proc_run VAL=0 TYPE=uint32 UNITS= TN=229 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ HOST NAME=5.5.5.6 IP=5.5.5.6 REPORTED=1168934871 TN=3 TMAX=20 DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534349 METRIC NAME=proc_run VAL=15 TYPE=uint32 UNITS= TN=68 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ METRIC NAME=proc_total VAL=439 TYPE=uint32 UNITS= TN=68 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ Thanks, Vitaly -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Monday, January 15, 2007 12:30 PM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject: RE: [Ganglia-general] Two similar linux hosts provides different metrics Hi Vitaly, where do you see the invalid numbers: a) in the gmond XML Stream (telnet/nc to the gmond XML port) b) in the XML Stream from gmetad (telnet/nc to the gmetad XML port) c) only in the web-frontend Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: NON-BUSY HOST: # ps axl|wc 61 8625865 # uptime 08:54:55 up 204 days, 2:00, 1 user, load average: 0.00, 0.00, 0.00 BUSY HOST ]# ps axl|wc 62 8775977 ]# uptime 08:55:18 up 31 days, 16:30, 1 user, load average: 0.04, 0.01, 0.00 -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Thursday, January 11, 2007 10:54 AM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Two similar linux hosts provides different metrics Hi Vitaly, what does ps axl show on both hosts, as that is basically what gmond looks at? If it is already different there, the problem is not ganglia related. (OK, I see you already checked ...) What are the load averages according to uptime? Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: Hi, I have a weird problem - two linux hosts with similar configuration provide very different metrics about number of running processes - one shows about 2, and second about 20-40 (I speak about concentrated load graph at top right.) proc_total is different too - 171 vs. 350 (BTW, ps -ef |wc == 61 on both boxes) Both machines are RHEL3 kernel 2.4.21-37.ELsmp with ganglia-gmond-3.0.3-1 installed from RPM. Any ideas? Thanks, Vitaly -- --- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforge CID=DEVDEV ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Martin Knoblauch email: k n o b i AT knobisoft DOT de www: http://www.knobisoft.de -- Martin Knoblauch email: k n o b i AT knobisoft DOT de www: http://www.knobisoft.de
Re: [Ganglia-general] Two similar linux hosts provides different metrics
Vitaly, gmond on Linux just interprets the fourth filed of /proc/loadavg. The number in front of the slash is the number of running processes, the number following the slash is the total number of processes. Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: .5: cat /proc/loadavg 0.04 0.06 0.01 1/185 10512 .6: cat /proc/loadavg 1.03 1.01 1.00 2/441 19965 Oops! I think I'm starting to understand - number of processes on both machines are the same, but number the threads are different. probably gmond counts threads, not processes: .5: ps -ef|wc 64 ps -efm|wc 187 .6: ps -ef|wc 62 ps -efm|wc 441 -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Tuesday, January 16, 2007 11:59 AM To: Vitaly Karasik; [EMAIL PROTECTED]; ganglia-general@lists.sourceforge.net Subject: RE: [Ganglia-general] Two similar linux hosts provides different metrics Vitaly, in this case try to run gmond with a debug level higher that 2. Maybe this sheds some light on it. Or, you could add debug statements to the proc_run_func and proc_total_func code. But: first of all show us the output of cat /proc/loadavg on both nodes. cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: It seems like we have different numbers in gmond: HOST NAME=5.5.5.5 IP=5.5.5.5 REPORTED=1168934873 TN=2 TMAX=20 DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534354 .. METRIC NAME=proc_total VAL=185 TYPE=uint32 UNITS= TN=229 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ .. METRIC NAME=proc_run VAL=0 TYPE=uint32 UNITS= TN=229 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ HOST NAME=5.5.5.6 IP=5.5.5.6 REPORTED=1168934871 TN=3 TMAX=20 DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534349 METRIC NAME=proc_run VAL=15 TYPE=uint32 UNITS= TN=68 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ METRIC NAME=proc_total VAL=439 TYPE=uint32 UNITS= TN=68 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ Thanks, Vitaly -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Monday, January 15, 2007 12:30 PM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject: RE: [Ganglia-general] Two similar linux hosts provides different metrics Hi Vitaly, where do you see the invalid numbers: a) in the gmond XML Stream (telnet/nc to the gmond XML port) b) in the XML Stream from gmetad (telnet/nc to the gmetad XML port) c) only in the web-frontend Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: NON-BUSY HOST: # ps axl|wc 61 8625865 # uptime 08:54:55 up 204 days, 2:00, 1 user, load average: 0.00, 0.00, 0.00 BUSY HOST ]# ps axl|wc 62 8775977 ]# uptime 08:55:18 up 31 days, 16:30, 1 user, load average: 0.04, 0.01, 0.00 -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Thursday, January 11, 2007 10:54 AM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Two similar linux hosts provides different metrics Hi Vitaly, what does ps axl show on both hosts, as that is basically what gmond looks at? If it is already different there, the problem is not ganglia related. (OK, I see you already checked ...) What are the load averages according to uptime? Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: Hi, I have a weird problem - two linux hosts with similar configuration provide very different metrics about number of running processes - one shows about 2, and second about 20-40 (I speak about concentrated load graph at top right.) proc_total is different too - 171 vs. 350 (BTW, ps -ef |wc == 61 on both boxes) Both machines are RHEL3 kernel 2.4.21-37.ELsmp with ganglia-gmond-3.0.3-1 installed from RPM. Any ideas? Thanks, Vitaly -- --- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforge CID=DEVDEV ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net
Re: [Ganglia-general] Two similar linux hosts provides different metrics
Martin, Thank a lot - now I understand that different number of threads causes this difference. Vitaly -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Tuesday, January 16, 2007 12:29 PM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject: RE: [Ganglia-general] Two similar linux hosts provides different metrics Vitaly, gmond on Linux just interprets the fourth filed of /proc/loadavg. The number in front of the slash is the number of running processes, the number following the slash is the total number of processes. Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: .5: cat /proc/loadavg 0.04 0.06 0.01 1/185 10512 .6: cat /proc/loadavg 1.03 1.01 1.00 2/441 19965 Oops! I think I'm starting to understand - number of processes on both machines are the same, but number the threads are different. probably gmond counts threads, not processes: .5: ps -ef|wc 64 ps -efm|wc 187 .6: ps -ef|wc 62 ps -efm|wc 441 -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Tuesday, January 16, 2007 11:59 AM To: Vitaly Karasik; [EMAIL PROTECTED]; ganglia-general@lists.sourceforge.net Subject: RE: [Ganglia-general] Two similar linux hosts provides different metrics Vitaly, in this case try to run gmond with a debug level higher that 2. Maybe this sheds some light on it. Or, you could add debug statements to the proc_run_func and proc_total_func code. But: first of all show us the output of cat /proc/loadavg on both nodes. cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: It seems like we have different numbers in gmond: HOST NAME=5.5.5.5 IP=5.5.5.5 REPORTED=1168934873 TN=2 TMAX=20 DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534354 .. METRIC NAME=proc_total VAL=185 TYPE=uint32 UNITS= TN=229 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ .. METRIC NAME=proc_run VAL=0 TYPE=uint32 UNITS= TN=229 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ HOST NAME=5.5.5.6 IP=5.5.5.6 REPORTED=1168934871 TN=3 TMAX=20 DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534349 METRIC NAME=proc_run VAL=15 TYPE=uint32 UNITS= TN=68 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ METRIC NAME=proc_total VAL=439 TYPE=uint32 UNITS= TN=68 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ Thanks, Vitaly -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Monday, January 15, 2007 12:30 PM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject: RE: [Ganglia-general] Two similar linux hosts provides different metrics Hi Vitaly, where do you see the invalid numbers: a) in the gmond XML Stream (telnet/nc to the gmond XML port) b) in the XML Stream from gmetad (telnet/nc to the gmetad XML port) c) only in the web-frontend Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: NON-BUSY HOST: # ps axl|wc 61 8625865 # uptime 08:54:55 up 204 days, 2:00, 1 user, load average: 0.00, 0.00, 0.00 BUSY HOST ]# ps axl|wc 62 8775977 ]# uptime 08:55:18 up 31 days, 16:30, 1 user, load average: 0.04, 0.01, 0.00 -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Thursday, January 11, 2007 10:54 AM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Two similar linux hosts provides different metrics Hi Vitaly, what does ps axl show on both hosts, as that is basically what gmond looks at? If it is already different there, the problem is not ganglia related. (OK, I see you already checked ...) What are the load averages according to uptime? Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: Hi, I have a weird problem - two linux hosts with similar configuration provide very different metrics about number of running processes - one shows about 2, and second about 20-40 (I speak about concentrated load graph at top right.) proc_total is different too - 171 vs. 350 (BTW, ps -ef |wc == 61 on both boxes) Both machines are RHEL3 kernel 2.4.21-37.ELsmp with ganglia-gmond-3.0.3-1 installed from RPM. Any ideas? Thanks, Vitaly
Re: [Ganglia-general] Two similar linux hosts provides different metrics
Hi Vitaly, where do you see the invalid numbers: a) in the gmond XML Stream (telnet/nc to the gmond XML port) b) in the XML Stream from gmetad (telnet/nc to the gmetad XML port) c) only in the web-frontend Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: NON-BUSY HOST: # ps axl|wc 61 8625865 # uptime 08:54:55 up 204 days, 2:00, 1 user, load average: 0.00, 0.00, 0.00 BUSY HOST ]# ps axl|wc 62 8775977 ]# uptime 08:55:18 up 31 days, 16:30, 1 user, load average: 0.04, 0.01, 0.00 -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Thursday, January 11, 2007 10:54 AM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Two similar linux hosts provides different metrics Hi Vitaly, what does ps axl show on both hosts, as that is basically what gmond looks at? If it is already different there, the problem is not ganglia related. (OK, I see you already checked ...) What are the load averages according to uptime? Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: Hi, I have a weird problem - two linux hosts with similar configuration provide very different metrics about number of running processes - one shows about 2, and second about 20-40 (I speak about concentrated load graph at top right.) proc_total is different too - 171 vs. 350 (BTW, ps -ef |wc == 61 on both boxes) Both machines are RHEL3 kernel 2.4.21-37.ELsmp with ganglia-gmond-3.0.3-1 installed from RPM. Any ideas? Thanks, Vitaly -- --- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforge CID=DEVDEV ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Martin Knoblauch email: k n o b i AT knobisoft DOT de www: http://www.knobisoft.de -- Martin Knoblauch email: k n o b i AT knobisoft DOT de www: http://www.knobisoft.de
[Ganglia-general] Two similar linux hosts provides different metrics
Hi, I have a weird problem - two linux hosts with similar configuration provide very different metrics about number of running processes - one shows about 2, and second about 20-40 (I speak about concentrated load graph at top right.) proc_total is different too - 171 vs. 350 (BTW, ps -ef |wc == 61 on both boxes) Both machines are RHEL3 kernel 2.4.21-37.ELsmp with ganglia-gmond-3.0.3-1 installed from RPM. Any ideas? Thanks, Vitaly
Re: [Ganglia-general] Two similar linux hosts provides different metrics
Hi Vitaly, what does ps axl show on both hosts, as that is basically what gmond looks at? If it is already different there, the problem is not ganglia related. (OK, I see you already checked ...) What are the load averages according to uptime? Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: Hi, I have a weird problem - two linux hosts with similar configuration provide very different metrics about number of running processes - one shows about 2, and second about 20-40 (I speak about concentrated load graph at top right.) proc_total is different too - 171 vs. 350 (BTW, ps -ef |wc == 61 on both boxes) Both machines are RHEL3 kernel 2.4.21-37.ELsmp with ganglia-gmond-3.0.3-1 installed from RPM. Any ideas? Thanks, Vitaly - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Martin Knoblauch email: k n o b i AT knobisoft DOT de www: http://www.knobisoft.de
Re: [Ganglia-general] Two similar linux hosts provides different metrics
NON-BUSY HOST: # ps axl|wc 61 8625865 # uptime 08:54:55 up 204 days, 2:00, 1 user, load average: 0.00, 0.00, 0.00 BUSY HOST ]# ps axl|wc 62 8775977 ]# uptime 08:55:18 up 31 days, 16:30, 1 user, load average: 0.04, 0.01, 0.00 -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Thursday, January 11, 2007 10:54 AM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Two similar linux hosts provides different metrics Hi Vitaly, what does ps axl show on both hosts, as that is basically what gmond looks at? If it is already different there, the problem is not ganglia related. (OK, I see you already checked ...) What are the load averages according to uptime? Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: Hi, I have a weird problem - two linux hosts with similar configuration provide very different metrics about number of running processes - one shows about 2, and second about 20-40 (I speak about concentrated load graph at top right.) proc_total is different too - 171 vs. 350 (BTW, ps -ef |wc == 61 on both boxes) Both machines are RHEL3 kernel 2.4.21-37.ELsmp with ganglia-gmond-3.0.3-1 installed from RPM. Any ideas? Thanks, Vitaly -- --- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforge CID=DEVDEV ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Martin Knoblauch email: k n o b i AT knobisoft DOT de www: http://www.knobisoft.de