Hi Antonio,

  Thanks much for your response. I now ran /usr/sbin/update-rc.d -f gmond 
defaults and
/usr/sbin/update-rc.d -f gmetad defaults, which initially gave me error: 
update-rc.d: /etc/init.d/gmond: file does not exist. The init script was in 
/etc/rc.d/init.d/. Then I copied it to /etc/init.d/gmond. Now the update-rc.d 
went fine .

I cannot view the web interface when I point to http://ip_address/ganglia/
and I get The server at ip_address is taking too long to respond.

Here are some relevant information:

A)  I start gmond by "/usr/sbin/gmond" when I do a telnet EC2Ip_address 8649,
     it gives
<HOST NAME=EC2Ip_address IP="10.251.86.192" REPORTED="1291666741" TN="30" 
TMAX="20" DMAX="0" LOCATION=EC2Ip_address GMOND_STARTED="1291666621">

B)  /usr/sbin/gmond -d 10 gives me this: 

Got a heartbeat message 1291666768

        metric 'cpu_user' being collected now
        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now
        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now
        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now
        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now
        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now
        metric 'cpu_wio' has value_threshold 1.000000
        metric 'load_one' being collected now
        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now
        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now
        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 56 with 0 errors
Processing a metric value message from EC2_IP
Got a heartbeat message 1291667489
....and goes on

C)  /usr/sbin/gmetad -d 10 gives me this

Going to run as user nobody
Sources are ...
Source: [MyCluster, step 15] has 1 sources
        10.251.86.192
xml listening on port 8651
interactive xml listening on port 8652
cleanup thread has been started
Data thread 1147169104 is monitoring [MyCluster] data source
        10.251.86.192
[MyCluster] is a 2.5 or later data stream
hash_create size = 1024
hash->size is 1031
hash_create size = 50
hash->size is 53
hash_create size = 50
hash->size is 53
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
 
 ...etc

D)   Here are the relevant parts in my /etc/ganglia/gmond.conf

cluster {
  name = "MyCluster"
  owner = "myclusterowner"
  latlong = "unspecified"
  url = "unspecified"
}
host {
  location = "IP_of_EC2"
}

udp_send_channel {
  mcast_join = IP_of_EC2
  port = 8666
  ttl = 1
}
udp_recv_channel {
    port = 8666
    family = inet4
}

And gmetad.conf has

data_source "MyCluster" ipaddress:8649

Any help on this would be highly appreciated!.

Thanks,
Mike



--- On Sun, 12/5/10, Antonio Óscar Balmaseda <[email protected]> 
wrote:

From: Antonio Óscar Balmaseda <[email protected]>
Subject: Re: [Ganglia-general] Ganglia Installation Issues
To: "Mike" <[email protected]>
Cc: "Ganglia" <[email protected]>
Date: Sunday, December 5, 2010, 9:57 AM


Hey, Mike,

2010/12/5 Mike <[email protected]>


Hi all,

   I am trying to get Ganglia run on Ubuntu instance. I built the version 3.1.7 
from source. I compiled the source and libs were installed in 
/etc/ganglia/lib64/ganglia/


I used the command: ./configure --prefix=/etc/ganglia --with-gmetad 
--sysconfdir=/etc/ganglia && make && make install , and everything went fine.
Ichanged the GMOND in gmond/gmond.init to GMOND=/etc/ganglia/sbin/gmond, and 
changed GMETAD in gmetad/gmetad.init to GMETAD=/etc/ganglia/sbin/gmetad



1. I have copied the gmond/gmond.init from the build directory to 
/etc/rc.d/init.d/gmond and when I start gmond using the command
/etc/rc.d/init.d/gmond start
I get the following error.
.: 9: Can't open /etc/rc.d/init.d/functions


Also I copied gmetad/gmetad.init to /etc/rc.d/init.d/gmetad and starting it 
also fails with the above error.
 What is expected for /etc/rc.d/init.d/functions ? by these scripts.

When I try something like "gmond -d 1" to start the gmond in foreground it 
gives a message that :
    [PYTHON] Can't open the python module path 
/etc/ganglia/lib64/ganglia/python_modules. Module python_module failed to 
initialize.




In this case, you have to put gmond and gmetad in the startup applications. You 
can do this with:

$ sudo updatedb-rc.d -f gmond defaults
$ sudo updatedb-rc.d -f gmetad defaults



The other thing can be solved creating this directory or comment the line that 
searches it in /etc/ganglia/gmond.conf. I'm not sure if it's necessary but the 
owner of this folder is 'nobody' in my system. I recommend this order:



$ sudo mkdir -p /etc/ganglia/lib64/ganglia/python_modules

 

2. Also I am running an EC2 instance, so while making the changes in the conf 
files,
     In the gmetad.conf I made foll changes:
     a. data_source "MyCluster" 'internalIP of the instance' ( or  should I add 
external IP of the instance?)


      b. What should I set for User gmetad will setuid to (defaults to 
"nobody"). My rrd directory is at /var/lib/ganglia/rrds and is owned by 
"root".So    
            should I set the user here as "root"?



     In
 the gmond.conf I have the foll:
    cluster {
          name = "MyCluster"
         owner = "MyOwner"
          latlong = "unspecified"
          url = "unspecified"}


     host {  location = "unspecified"}    (again what should go in here???) 
     udp_send_channel {
          mcast_join = 239.2.11.71
          port = 8649
          ttl = 1}
    udp_recv_channel {


          mcast_join = 239.2.11.71
          port =
 8649
          bind = 239.2.11.71}
    tcp_accept_channel {
       port = 8649}

All other conf paremeters are unchanged.
Please help me with this.

To your question a, I'm not sure but I don't think so. Nevertheless, the 
question b is clear. You have to make owner to the user 'nobody'. If you don't, 
ganglia can't write in that directory.






Thanks,
Mike

Ganglia-general mailing list

[email protected]

https://lists.sourceforge.net/lists/listinfo/ganglia-general



Regards,
Antonio.




      
------------------------------------------------------------------------------
What happens now with your Lotus Notes apps - do you make another costly 
upgrade, or settle for being marooned without product support? Time to move
off Lotus Notes and onto the cloud with Force.com, apps are easier to build,
use, and manage than apps on traditional platforms. Sign up for the Lotus 
Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to