Re: [Ganglia-general] Ganglia Installation Issues

2010-12-06 Thread Mike
Hi Antonio,

  Thanks much for your response. I now ran /usr/sbin/update-rc.d -f gmond 
defaults and
/usr/sbin/update-rc.d -f gmetad defaults, which initially gave me error: 
update-rc.d: /etc/init.d/gmond: file does not exist. The init script was in 
/etc/rc.d/init.d/. Then I copied it to /etc/init.d/gmond. Now the update-rc.d 
went fine .

I cannot view the web interface when I point to http://ip_address/ganglia/
and I get The server at ip_address is taking too long to respond.

Here are some relevant information:

A)  I start gmond by /usr/sbin/gmond when I do a telnet EC2Ip_address 8649,
     it gives
HOST NAME=EC2Ip_address IP=10.251.86.192 REPORTED=1291666741 TN=30 
TMAX=20 DMAX=0 LOCATION=EC2Ip_address GMOND_STARTED=129121

B)  /usr/sbin/gmond -d 10 gives me this: 

Got a heartbeat message 1291666768

    metric 'cpu_user' being collected now
    metric 'cpu_user' has value_threshold 1.00
    metric 'cpu_system' being collected now
    metric 'cpu_system' has value_threshold 1.00
    metric 'cpu_idle' being collected now
    metric 'cpu_idle' has value_threshold 5.00
    metric 'cpu_nice' being collected now
    metric 'cpu_nice' has value_threshold 1.00
    metric 'cpu_aidle' being collected now
    metric 'cpu_aidle' has value_threshold 5.00
    metric 'cpu_wio' being collected now
    metric 'cpu_wio' has value_threshold 1.00
    metric 'load_one' being collected now
    metric 'load_one' has value_threshold 1.00
    metric 'load_five' being collected now
    metric 'load_five' has value_threshold 1.00
    metric 'load_fifteen' being collected now
    metric 'load_fifteen' has value_threshold 1.00
    sent message 'heartbeat' of length 56 with 0 errors
Processing a metric value message from EC2_IP
Got a heartbeat message 1291667489
and goes on

C)  /usr/sbin/gmetad -d 10 gives me this

Going to run as user nobody
Sources are ...
Source: [MyCluster, step 15] has 1 sources
    10.251.86.192
xml listening on port 8651
interactive xml listening on port 8652
cleanup thread has been started
Data thread 1147169104 is monitoring [MyCluster] data source
    10.251.86.192
[MyCluster] is a 2.5 or later data stream
hash_create size = 1024
hash-size is 1031
hash_create size = 50
hash-size is 53
hash_create size = 50
hash-size is 53
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
 
 ...etc

D)   Here are the relevant parts in my /etc/ganglia/gmond.conf

cluster {
  name = MyCluster
  owner = myclusterowner
  latlong = unspecified
  url = unspecified
}
host {
  location = IP_of_EC2
}

udp_send_channel {
  mcast_join = IP_of_EC2
  port = 8666
  ttl = 1
}
udp_recv_channel {
    port = 8666
    family = inet4
}

And gmetad.conf has

data_source MyCluster ipaddress:8649

Any help on this would be highly appreciated!.

Thanks,
Mike



--- On Sun, 12/5/10, Antonio Óscar Balmaseda antonio.o.balmas...@gmail.com 
wrote:

From: Antonio Óscar Balmaseda antonio.o.balmas...@gmail.com
Subject: Re: [Ganglia-general] Ganglia Installation Issues
To: Mike nano_kol...@yahoo.com
Cc: Ganglia ganglia-general@lists.sourceforge.net
Date: Sunday, December 5, 2010, 9:57 AM


Hey, Mike,

2010/12/5 Mike nano_kol...@yahoo.com


Hi all,

   I am trying to get Ganglia run on Ubuntu instance. I built the version 3.1.7 
from source. I compiled the source and libs were installed in 
/etc/ganglia/lib64/ganglia/


I used the command: ./configure --prefix=/etc/ganglia --with-gmetad 
--sysconfdir=/etc/ganglia  make  make install , and everything went fine.
Ichanged the GMOND in gmond/gmond.init to GMOND=/etc/ganglia/sbin/gmond, and 
changed GMETAD in gmetad/gmetad.init to GMETAD=/etc/ganglia/sbin/gmetad



1. I have copied the gmond/gmond.init from the build directory to 
/etc/rc.d/init.d/gmond and when I start gmond using the command
/etc/rc.d/init.d/gmond start
I get the following error.
.: 9: Can't open /etc/rc.d/init.d/functions


Also I copied gmetad/gmetad.init to /etc/rc.d/init.d/gmetad and starting it 
also fails with the above error.
 What is expected for /etc/rc.d/init.d/functions ? by these scripts.

When I try something like gmond -d 1 to start the gmond in foreground it 
gives a message that :
    [PYTHON] Can't open the python module path 
/etc/ganglia/lib64/ganglia/python_modules. Module python_module failed to 
initialize.




In this case, you have to put gmond and gmetad in the startup applications. You 
can do this with:

$ sudo updatedb-rc.d -f gmond defaults
$ sudo updatedb-rc.d -f gmetad defaults



The other thing can be solved creating this directory or comment the line that 
searches it in /etc/ganglia/gmond.conf. I'm not sure if it's necessary but the 
owner of this folder is 'nobody' in my 

Re: [Ganglia-general] Ganglia Installation Issues

2010-12-06 Thread Antonio Óscar Balmaseda
2010/12/6 Mike nano_kol...@yahoo.com

 Hi Antonio,

   Thanks much for your response. I now ran /usr/sbin/update-rc.d -f gmond
 defaults and
 /usr/sbin/update-rc.d -f gmetad defaults, which initially gave me error:
 update-rc.d: /etc/init.d/gmond: file does not exist. The init script was in
 /etc/rc.d/init.d/. Then I copied it to /etc/init.d/gmond. Now the
 update-rc.d went fine .

 I cannot view the web interface when I point to http://ip_address/ganglia/
 and I get The server at ip_address is taking too long to respond.

 Here are some relevant information:

 A)  I start gmond by /usr/sbin/gmond when I do a telnet EC2Ip_address8649,
  it gives
 HOST NAME=EC2Ip_address IP=10.251.86.192 REPORTED=1291666741 TN=30
 TMAX=20 DMAX=0 LOCATION=EC2Ip_address GMOND_STARTED=129121

 B)  /usr/sbin/gmond -d 10 gives me this:

 Got a heartbeat message 1291666768

 metric 'cpu_user' being collected now
 metric 'cpu_user' has value_threshold 1.00
 metric 'cpu_system' being collected now
 metric 'cpu_system' has value_threshold 1.00
 metric 'cpu_idle' being collected now
 metric 'cpu_idle' has value_threshold 5.00
 metric 'cpu_nice' being collected now
 metric 'cpu_nice' has value_threshold 1.00
 metric 'cpu_aidle' being collected now
 metric 'cpu_aidle' has value_threshold 5.00
 metric 'cpu_wio' being collected now
 metric 'cpu_wio' has value_threshold 1.00
 metric 'load_one' being collected now
 metric 'load_one' has value_threshold 1.00
 metric 'load_five' being collected now
 metric 'load_five' has value_threshold 1.00
 metric 'load_fifteen' being collected now
 metric 'load_fifteen' has value_threshold 1.00
 sent message 'heartbeat' of length 56 with 0 errors
 Processing a metric value message from EC2_IP
 Got a heartbeat message 1291667489
 and goes on

 C)  /usr/sbin/gmetad -d 10 gives me this

 Going to run as user nobody
 Sources are ...
 Source: [MyCluster, step 15] has 1 sources
 10.251.86.192
 xml listening on port 8651
 interactive xml listening on port 8652
 cleanup thread has been started
 Data thread 1147169104 is monitoring [MyCluster] data source
 10.251.86.192
 [MyCluster] is a 2.5 or later data stream
 hash_create size = 1024
 hash-size is 1031
 hash_create size = 50
 hash-size is 53
 hash_create size = 50
 hash-size is 53
 [MyCluster] is a 2.5 or later data stream
 [MyCluster] is a 2.5 or later data stream
 [MyCluster] is a 2.5 or later data stream
 [MyCluster] is a 2.5 or later data stream
 [MyCluster] is a 2.5 or later data stream
 [MyCluster] is a 2.5 or later data stream

  ...etc

 D)   Here are the relevant parts in my /etc/ganglia/gmond.conf

 cluster {
   name = MyCluster
   owner = myclusterowner

   latlong = unspecified
   url = unspecified
 }
 host {
   location = IP_of_EC2
 }

 udp_send_channel {
   mcast_join = IP_of_EC2
   port = 8666

   ttl = 1
 }
 udp_recv_channel {
 port = 8666
 family = inet4
 }

 And gmetad.conf has

 data_source MyCluster ipaddress:8649

 Any help on this would be highly appreciated!.

 Thanks,
 Mike


How is it going?

That's weird. Did you copy the files ganglia-X.YY/web in /var/www? Because
it seems that gmond  gmetad are working fine...

Antonio.
--
What happens now with your Lotus Notes apps - do you make another costly 
upgrade, or settle for being marooned without product support? Time to move
off Lotus Notes and onto the cloud with Force.com, apps are easier to build,
use, and manage than apps on traditional platforms. Sign up for the Lotus 
Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Writing metrics

2010-12-06 Thread Antonio Óscar Balmaseda
Hi, everyone,

I have a strange problem. I'm writing a new metric for ganglia, in python,
in order to measure some data of the apache log. I'm pretty sure that the
code is correct but the thing is that it's absolutely impossible open any
file.

When I try do it, the system returns:

Traceback (most recent call last):
  File /usr/lib/ganglia/python_modules/test.py, line 41, in
Apache_Response
fileHand = open('/var/log/apache2/access.log','r')
IOError: [Errno 13] Permission denied: '/var/log/apache2/access.log'

I've checked the permissions, tested with other files in other directories
and done everything that I could think. Anyone has a clue?

Thanks a lot. Regards,
Antonio.
--
What happens now with your Lotus Notes apps - do you make another costly 
upgrade, or settle for being marooned without product support? Time to move
off Lotus Notes and onto the cloud with Force.com, apps are easier to build,
use, and manage than apps on traditional platforms. Sign up for the Lotus 
Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Writing metrics

2010-12-06 Thread Alex Dean

On Dec 6, 2010, at 3:16 PM, Antonio Óscar Balmaseda wrote:

 Hi, everyone,
 
 I have a strange problem. I'm writing a new metric for ganglia, in python, in 
 order to measure some data of the apache log. I'm pretty sure that the code 
 is correct but the thing is that it's absolutely impossible open any file.
 
 When I try do it, the system returns:
 
 Traceback (most recent call last):
   File /usr/lib/ganglia/python_modules/test.py, line 41, in Apache_Response
 fileHand = open('/var/log/apache2/access.log','r')
 IOError: [Errno 13] Permission denied: '/var/log/apache2/access.log'
 
 I've checked the permissions, tested with other files in other directories 
 and done everything that I could think. Anyone has a clue?

Are you sure you've checked as the gmond user (nobody,etc) in addition to your 
normal user account ?

alex
--
What happens now with your Lotus Notes apps - do you make another costly 
upgrade, or settle for being marooned without product support? Time to move
off Lotus Notes and onto the cloud with Force.com, apps are easier to build,
use, and manage than apps on traditional platforms. Sign up for the Lotus 
Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Ganglia Installation Issues

2010-12-06 Thread Mike
Yes, I have the web folder copied to /var/www/ganglia
Do we have to keep  in gmond.conf,
tcp_accept_channel {
   port = 8649
 }  

Because trying to start gmond with this included in the conf gave me an error 
Unable to create tcp_accept_channel. So I removed this from gmond.conf

Thanks,
Mike

--- On Mon, 12/6/10, Antonio Óscar Balmaseda antonio.o.balmas...@gmail.com 
wrote:

From: Antonio Óscar Balmaseda antonio.o.balmas...@gmail.com
Subject: Re: [Ganglia-general] Ganglia Installation Issues
To: Mike nano_kol...@yahoo.com
Cc: Ganglia ganglia-general@lists.sourceforge.net
Date: Monday, December 6, 2010, 9:07 PM



2010/12/6 Mike nano_kol...@yahoo.com


Hi Antonio,

  Thanks much for your response. I now ran /usr/sbin/update-rc.d -f gmond 
defaults and
/usr/sbin/update-rc.d -f gmetad defaults, which initially gave me error: 
update-rc.d: /etc/init.d/gmond: file does not exist. The init script was in 
/etc/rc.d/init.d/. Then I copied it to /etc/init.d/gmond. Now the update-rc.d 
went fine .



I cannot view the web interface when I point to http://ip_address/ganglia/
and I get The server at ip_address is taking too long to respond.

Here are some relevant information:



A)  I start gmond by /usr/sbin/gmond when I do a telnet EC2Ip_address 8649,
     it gives
HOST NAME=EC2Ip_address IP=10.251.86.192 REPORTED=1291666741 TN=30 
TMAX=20 DMAX=0 LOCATION=EC2Ip_address GMOND_STARTED=129121



B)  /usr/sbin/gmond -d 10 gives me this: 

Got a heartbeat message 1291666768


    metric 'cpu_user' being collected now

    metric 'cpu_user' has value_threshold 1.00
    metric 'cpu_system' being collected now


    metric 'cpu_system' has value_threshold 1.00
    metric 'cpu_idle' being collected now


    metric 'cpu_idle' has value_threshold 5.00
    metric 'cpu_nice' being collected now


    metric 'cpu_nice' has value_threshold 1.00
    metric 'cpu_aidle' being collected now


    metric 'cpu_aidle' has value_threshold 5.00
    metric 'cpu_wio' being collected now


    metric 'cpu_wio' has value_threshold 1.00
    metric 'load_one' being collected now


    metric 'load_one' has value_threshold 1.00
    metric 'load_five' being collected now


    metric 'load_five' has value_threshold 1.00
    metric 'load_fifteen' being collected now


    metric 'load_fifteen' has value_threshold 1.00
    sent message 'heartbeat' of length 56 with 0 errors


Processing a metric value message from EC2_IP
Got a heartbeat message 1291667489
and goes on


C)  /usr/sbin/gmetad -d 10 gives me this


Going to run as user nobody
Sources are ...
Source:
 [MyCluster, step 15] has 1 sources
    10.251.86.192
xml listening on port 8651


interactive xml listening on port 8652
cleanup thread has been started
Data thread 1147169104 is monitoring [MyCluster] data source


    10.251.86.192
[MyCluster] is a 2.5 or later data stream
hash_create size = 1024


hash-size is 1031
hash_create size = 50
hash-size is 53


hash_create size = 50
hash-size is 53
[MyCluster] is a 2.5 or later data stream


[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream


[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream


[MyCluster] is a 2.5 or later data stream
 
 ...etc

D)   Here are the relevant parts in my /etc/ganglia/gmond.conf

cluster {


  name = MyCluster
  owner = myclusterowner


  latlong = unspecified
  url = unspecified
}


host {
  location = IP_of_EC2
}



udp_send_channel {
  mcast_join = IP_of_EC2

  port = 8666

  ttl = 1
}
udp_recv_channel {


    port = 8666
    family = inet4
}



And gmetad.conf has

data_source MyCluster ipaddress:8649

Any help on this would be highly appreciated!.

Thanks,
Mike



How is it going?

That's weird. Did you copy the files ganglia-X.YY/web in /var/www? Because it 
seems that gmond  gmetad are working fine...

Antonio.




  --
What happens now with your Lotus Notes apps - do you make another costly 
upgrade, or settle for being marooned without product support? Time to move
off Lotus Notes and onto the cloud with Force.com, apps are easier to build,
use, and manage than apps on traditional platforms. Sign up for the Lotus 
Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Writing metrics

2010-12-06 Thread David Birdsong
If you put that open inside of metric_init, start gmond as root, then
the filehandle will be created before gmond drops it's privileges to
the user configured.

On Mon, Dec 6, 2010 at 4:16 PM, Antonio Óscar Balmaseda
antonio.o.balmas...@gmail.com wrote:
 Hi, everyone,

 I have a strange problem. I'm writing a new metric for ganglia, in python,
 in order to measure some data of the apache log. I'm pretty sure that the
 code is correct but the thing is that it's absolutely impossible open any
 file.

 When I try do it, the system returns:

 Traceback (most recent call last):
   File /usr/lib/ganglia/python_modules/test.py, line 41, in
 Apache_Response
     fileHand = open('/var/log/apache2/access.log','r')
 IOError: [Errno 13] Permission denied: '/var/log/apache2/access.log'

 I've checked the permissions, tested with other files in other directories
 and done everything that I could think. Anyone has a clue?

 Thanks a lot. Regards,
 Antonio.

 --
 What happens now with your Lotus Notes apps - do you make another costly
 upgrade, or settle for being marooned without product support? Time to move
 off Lotus Notes and onto the cloud with Force.com, apps are easier to build,
 use, and manage than apps on traditional platforms. Sign up for the Lotus
 Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d
 ___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-general



--
What happens now with your Lotus Notes apps - do you make another costly 
upgrade, or settle for being marooned without product support? Time to move
off Lotus Notes and onto the cloud with Force.com, apps are easier to build,
use, and manage than apps on traditional platforms. Sign up for the Lotus 
Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general