Re: [Ganglia-general] Ganglia Installation Issues
Hi Antonio, Thanks much for your response. I now ran /usr/sbin/update-rc.d -f gmond defaults and /usr/sbin/update-rc.d -f gmetad defaults, which initially gave me error: update-rc.d: /etc/init.d/gmond: file does not exist. The init script was in /etc/rc.d/init.d/. Then I copied it to /etc/init.d/gmond. Now the update-rc.d went fine . I cannot view the web interface when I point to http://ip_address/ganglia/ and I get The server at ip_address is taking too long to respond. Here are some relevant information: A) I start gmond by /usr/sbin/gmond when I do a telnet EC2Ip_address 8649, it gives HOST NAME=EC2Ip_address IP=10.251.86.192 REPORTED=1291666741 TN=30 TMAX=20 DMAX=0 LOCATION=EC2Ip_address GMOND_STARTED=129121 B) /usr/sbin/gmond -d 10 gives me this: Got a heartbeat message 1291666768 metric 'cpu_user' being collected now metric 'cpu_user' has value_threshold 1.00 metric 'cpu_system' being collected now metric 'cpu_system' has value_threshold 1.00 metric 'cpu_idle' being collected now metric 'cpu_idle' has value_threshold 5.00 metric 'cpu_nice' being collected now metric 'cpu_nice' has value_threshold 1.00 metric 'cpu_aidle' being collected now metric 'cpu_aidle' has value_threshold 5.00 metric 'cpu_wio' being collected now metric 'cpu_wio' has value_threshold 1.00 metric 'load_one' being collected now metric 'load_one' has value_threshold 1.00 metric 'load_five' being collected now metric 'load_five' has value_threshold 1.00 metric 'load_fifteen' being collected now metric 'load_fifteen' has value_threshold 1.00 sent message 'heartbeat' of length 56 with 0 errors Processing a metric value message from EC2_IP Got a heartbeat message 1291667489 and goes on C) /usr/sbin/gmetad -d 10 gives me this Going to run as user nobody Sources are ... Source: [MyCluster, step 15] has 1 sources 10.251.86.192 xml listening on port 8651 interactive xml listening on port 8652 cleanup thread has been started Data thread 1147169104 is monitoring [MyCluster] data source 10.251.86.192 [MyCluster] is a 2.5 or later data stream hash_create size = 1024 hash-size is 1031 hash_create size = 50 hash-size is 53 hash_create size = 50 hash-size is 53 [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream ...etc D) Here are the relevant parts in my /etc/ganglia/gmond.conf cluster { name = MyCluster owner = myclusterowner latlong = unspecified url = unspecified } host { location = IP_of_EC2 } udp_send_channel { mcast_join = IP_of_EC2 port = 8666 ttl = 1 } udp_recv_channel { port = 8666 family = inet4 } And gmetad.conf has data_source MyCluster ipaddress:8649 Any help on this would be highly appreciated!. Thanks, Mike --- On Sun, 12/5/10, Antonio Óscar Balmaseda antonio.o.balmas...@gmail.com wrote: From: Antonio Óscar Balmaseda antonio.o.balmas...@gmail.com Subject: Re: [Ganglia-general] Ganglia Installation Issues To: Mike nano_kol...@yahoo.com Cc: Ganglia ganglia-general@lists.sourceforge.net Date: Sunday, December 5, 2010, 9:57 AM Hey, Mike, 2010/12/5 Mike nano_kol...@yahoo.com Hi all, I am trying to get Ganglia run on Ubuntu instance. I built the version 3.1.7 from source. I compiled the source and libs were installed in /etc/ganglia/lib64/ganglia/ I used the command: ./configure --prefix=/etc/ganglia --with-gmetad --sysconfdir=/etc/ganglia make make install , and everything went fine. Ichanged the GMOND in gmond/gmond.init to GMOND=/etc/ganglia/sbin/gmond, and changed GMETAD in gmetad/gmetad.init to GMETAD=/etc/ganglia/sbin/gmetad 1. I have copied the gmond/gmond.init from the build directory to /etc/rc.d/init.d/gmond and when I start gmond using the command /etc/rc.d/init.d/gmond start I get the following error. .: 9: Can't open /etc/rc.d/init.d/functions Also I copied gmetad/gmetad.init to /etc/rc.d/init.d/gmetad and starting it also fails with the above error. What is expected for /etc/rc.d/init.d/functions ? by these scripts. When I try something like gmond -d 1 to start the gmond in foreground it gives a message that : [PYTHON] Can't open the python module path /etc/ganglia/lib64/ganglia/python_modules. Module python_module failed to initialize. In this case, you have to put gmond and gmetad in the startup applications. You can do this with: $ sudo updatedb-rc.d -f gmond defaults $ sudo updatedb-rc.d -f gmetad defaults The other thing can be solved creating this directory or comment the line that searches it in /etc/ganglia/gmond.conf. I'm not sure if it's necessary but the owner of this folder is 'nobody' in my
Re: [Ganglia-general] Ganglia Installation Issues
2010/12/6 Mike nano_kol...@yahoo.com Hi Antonio, Thanks much for your response. I now ran /usr/sbin/update-rc.d -f gmond defaults and /usr/sbin/update-rc.d -f gmetad defaults, which initially gave me error: update-rc.d: /etc/init.d/gmond: file does not exist. The init script was in /etc/rc.d/init.d/. Then I copied it to /etc/init.d/gmond. Now the update-rc.d went fine . I cannot view the web interface when I point to http://ip_address/ganglia/ and I get The server at ip_address is taking too long to respond. Here are some relevant information: A) I start gmond by /usr/sbin/gmond when I do a telnet EC2Ip_address8649, it gives HOST NAME=EC2Ip_address IP=10.251.86.192 REPORTED=1291666741 TN=30 TMAX=20 DMAX=0 LOCATION=EC2Ip_address GMOND_STARTED=129121 B) /usr/sbin/gmond -d 10 gives me this: Got a heartbeat message 1291666768 metric 'cpu_user' being collected now metric 'cpu_user' has value_threshold 1.00 metric 'cpu_system' being collected now metric 'cpu_system' has value_threshold 1.00 metric 'cpu_idle' being collected now metric 'cpu_idle' has value_threshold 5.00 metric 'cpu_nice' being collected now metric 'cpu_nice' has value_threshold 1.00 metric 'cpu_aidle' being collected now metric 'cpu_aidle' has value_threshold 5.00 metric 'cpu_wio' being collected now metric 'cpu_wio' has value_threshold 1.00 metric 'load_one' being collected now metric 'load_one' has value_threshold 1.00 metric 'load_five' being collected now metric 'load_five' has value_threshold 1.00 metric 'load_fifteen' being collected now metric 'load_fifteen' has value_threshold 1.00 sent message 'heartbeat' of length 56 with 0 errors Processing a metric value message from EC2_IP Got a heartbeat message 1291667489 and goes on C) /usr/sbin/gmetad -d 10 gives me this Going to run as user nobody Sources are ... Source: [MyCluster, step 15] has 1 sources 10.251.86.192 xml listening on port 8651 interactive xml listening on port 8652 cleanup thread has been started Data thread 1147169104 is monitoring [MyCluster] data source 10.251.86.192 [MyCluster] is a 2.5 or later data stream hash_create size = 1024 hash-size is 1031 hash_create size = 50 hash-size is 53 hash_create size = 50 hash-size is 53 [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream ...etc D) Here are the relevant parts in my /etc/ganglia/gmond.conf cluster { name = MyCluster owner = myclusterowner latlong = unspecified url = unspecified } host { location = IP_of_EC2 } udp_send_channel { mcast_join = IP_of_EC2 port = 8666 ttl = 1 } udp_recv_channel { port = 8666 family = inet4 } And gmetad.conf has data_source MyCluster ipaddress:8649 Any help on this would be highly appreciated!. Thanks, Mike How is it going? That's weird. Did you copy the files ganglia-X.YY/web in /var/www? Because it seems that gmond gmetad are working fine... Antonio. -- What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] Writing metrics
Hi, everyone, I have a strange problem. I'm writing a new metric for ganglia, in python, in order to measure some data of the apache log. I'm pretty sure that the code is correct but the thing is that it's absolutely impossible open any file. When I try do it, the system returns: Traceback (most recent call last): File /usr/lib/ganglia/python_modules/test.py, line 41, in Apache_Response fileHand = open('/var/log/apache2/access.log','r') IOError: [Errno 13] Permission denied: '/var/log/apache2/access.log' I've checked the permissions, tested with other files in other directories and done everything that I could think. Anyone has a clue? Thanks a lot. Regards, Antonio. -- What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Writing metrics
On Dec 6, 2010, at 3:16 PM, Antonio Óscar Balmaseda wrote: Hi, everyone, I have a strange problem. I'm writing a new metric for ganglia, in python, in order to measure some data of the apache log. I'm pretty sure that the code is correct but the thing is that it's absolutely impossible open any file. When I try do it, the system returns: Traceback (most recent call last): File /usr/lib/ganglia/python_modules/test.py, line 41, in Apache_Response fileHand = open('/var/log/apache2/access.log','r') IOError: [Errno 13] Permission denied: '/var/log/apache2/access.log' I've checked the permissions, tested with other files in other directories and done everything that I could think. Anyone has a clue? Are you sure you've checked as the gmond user (nobody,etc) in addition to your normal user account ? alex -- What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Ganglia Installation Issues
Yes, I have the web folder copied to /var/www/ganglia Do we have to keep in gmond.conf, tcp_accept_channel { port = 8649 } Because trying to start gmond with this included in the conf gave me an error Unable to create tcp_accept_channel. So I removed this from gmond.conf Thanks, Mike --- On Mon, 12/6/10, Antonio Óscar Balmaseda antonio.o.balmas...@gmail.com wrote: From: Antonio Óscar Balmaseda antonio.o.balmas...@gmail.com Subject: Re: [Ganglia-general] Ganglia Installation Issues To: Mike nano_kol...@yahoo.com Cc: Ganglia ganglia-general@lists.sourceforge.net Date: Monday, December 6, 2010, 9:07 PM 2010/12/6 Mike nano_kol...@yahoo.com Hi Antonio, Thanks much for your response. I now ran /usr/sbin/update-rc.d -f gmond defaults and /usr/sbin/update-rc.d -f gmetad defaults, which initially gave me error: update-rc.d: /etc/init.d/gmond: file does not exist. The init script was in /etc/rc.d/init.d/. Then I copied it to /etc/init.d/gmond. Now the update-rc.d went fine . I cannot view the web interface when I point to http://ip_address/ganglia/ and I get The server at ip_address is taking too long to respond. Here are some relevant information: A) I start gmond by /usr/sbin/gmond when I do a telnet EC2Ip_address 8649, it gives HOST NAME=EC2Ip_address IP=10.251.86.192 REPORTED=1291666741 TN=30 TMAX=20 DMAX=0 LOCATION=EC2Ip_address GMOND_STARTED=129121 B) /usr/sbin/gmond -d 10 gives me this: Got a heartbeat message 1291666768 metric 'cpu_user' being collected now metric 'cpu_user' has value_threshold 1.00 metric 'cpu_system' being collected now metric 'cpu_system' has value_threshold 1.00 metric 'cpu_idle' being collected now metric 'cpu_idle' has value_threshold 5.00 metric 'cpu_nice' being collected now metric 'cpu_nice' has value_threshold 1.00 metric 'cpu_aidle' being collected now metric 'cpu_aidle' has value_threshold 5.00 metric 'cpu_wio' being collected now metric 'cpu_wio' has value_threshold 1.00 metric 'load_one' being collected now metric 'load_one' has value_threshold 1.00 metric 'load_five' being collected now metric 'load_five' has value_threshold 1.00 metric 'load_fifteen' being collected now metric 'load_fifteen' has value_threshold 1.00 sent message 'heartbeat' of length 56 with 0 errors Processing a metric value message from EC2_IP Got a heartbeat message 1291667489 and goes on C) /usr/sbin/gmetad -d 10 gives me this Going to run as user nobody Sources are ... Source: [MyCluster, step 15] has 1 sources 10.251.86.192 xml listening on port 8651 interactive xml listening on port 8652 cleanup thread has been started Data thread 1147169104 is monitoring [MyCluster] data source 10.251.86.192 [MyCluster] is a 2.5 or later data stream hash_create size = 1024 hash-size is 1031 hash_create size = 50 hash-size is 53 hash_create size = 50 hash-size is 53 [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream [MyCluster] is a 2.5 or later data stream ...etc D) Here are the relevant parts in my /etc/ganglia/gmond.conf cluster { name = MyCluster owner = myclusterowner latlong = unspecified url = unspecified } host { location = IP_of_EC2 } udp_send_channel { mcast_join = IP_of_EC2 port = 8666 ttl = 1 } udp_recv_channel { port = 8666 family = inet4 } And gmetad.conf has data_source MyCluster ipaddress:8649 Any help on this would be highly appreciated!. Thanks, Mike How is it going? That's weird. Did you copy the files ganglia-X.YY/web in /var/www? Because it seems that gmond gmetad are working fine... Antonio. -- What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Writing metrics
If you put that open inside of metric_init, start gmond as root, then the filehandle will be created before gmond drops it's privileges to the user configured. On Mon, Dec 6, 2010 at 4:16 PM, Antonio Óscar Balmaseda antonio.o.balmas...@gmail.com wrote: Hi, everyone, I have a strange problem. I'm writing a new metric for ganglia, in python, in order to measure some data of the apache log. I'm pretty sure that the code is correct but the thing is that it's absolutely impossible open any file. When I try do it, the system returns: Traceback (most recent call last): File /usr/lib/ganglia/python_modules/test.py, line 41, in Apache_Response fileHand = open('/var/log/apache2/access.log','r') IOError: [Errno 13] Permission denied: '/var/log/apache2/access.log' I've checked the permissions, tested with other files in other directories and done everything that I could think. Anyone has a clue? Thanks a lot. Regards, Antonio. -- What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general