Hi Antonio,

   I could finally see the Ganglia up running in the webinterface. I restarted 
everything and now its fine.
Thanks a lot for your help. 
Now I am looking for monitoring hadoop using Ganglia. I added the metrics 
properties to the hadoop-metrics properties. Is there something else I have to 
do to see the hadoop metrics in ganglia?

Thanks,
Mike

--- On Tue, 12/7/10, Antonio Óscar Balmaseda <[email protected]> 
wrote:

From: Antonio Óscar Balmaseda <[email protected]>
Subject: Re: [Ganglia-general] Ganglia Installation Issues
To: "Mike" <[email protected]>
Cc: "Ganglia" <[email protected]>
Date: Tuesday, December 7, 2010, 11:37 AM



2010/12/6 Mike <[email protected]>


Yes, I have the web folder copied to /var/www/ganglia
Do we have to keep  in gmond.conf,
tcp_accept_channel {


   port = 8649
 }  

Because trying to start gmond with this included in the conf gave me an error 
"Unable to create tcp_accept_channel". So I removed this from gmond.conf



Thanks,
Mike

--- On Mon, 12/6/10, Antonio Óscar Balmaseda <[email protected]> 
wrote:



From: Antonio Óscar Balmaseda <[email protected]>


Subject: Re: [Ganglia-general] Ganglia Installation Issues
To: "Mike" <[email protected]>
Cc: "Ganglia" <[email protected]>


Date: Monday,
 December 6, 2010, 9:07 PM



2010/12/6 Mike <[email protected]>




Hi Antonio,

  Thanks much for your response. I now ran /usr/sbin/update-rc.d -f gmond 
defaults and
/usr/sbin/update-rc.d -f gmetad defaults, which initially gave me error: 
update-rc.d: /etc/init.d/gmond: file does not exist. The init script was in 
/etc/rc.d/init.d/. Then I copied it to /etc/init.d/gmond. Now the update-rc.d 
went fine .





I cannot view the web interface when I point to http://ip_address/ganglia/
and I get The server at ip_address is taking too long to respond.



Here are some relevant information:



A)  I start gmond by "/usr/sbin/gmond" when I do a telnet EC2Ip_address 8649,
     it gives
<HOST NAME=EC2Ip_address IP="10.251.86.192" REPORTED="1291666741" TN="30" 
TMAX="20" DMAX="0" LOCATION=EC2Ip_address GMOND_STARTED="1291666621">





B)  /usr/sbin/gmond -d 10 gives me this: 

Got a heartbeat message 1291666768




        metric 'cpu_user' being collected now

        metric 'cpu_user' has value_threshold 1.000000
        metric 'cpu_system' being collected now




        metric 'cpu_system' has value_threshold 1.000000
        metric 'cpu_idle' being collected now




        metric 'cpu_idle' has value_threshold 5.000000
        metric 'cpu_nice' being collected now




        metric 'cpu_nice' has value_threshold 1.000000
        metric 'cpu_aidle' being collected now




        metric 'cpu_aidle' has value_threshold 5.000000
        metric 'cpu_wio' being collected now




        metric 'cpu_wio' has value_threshold 1.000000
        metric 'load_one' being collected now




        metric 'load_one' has value_threshold 1.000000
        metric 'load_five' being collected now




        metric 'load_five' has value_threshold 1.000000
        metric 'load_fifteen' being collected now




        metric 'load_fifteen' has value_threshold 1.000000
        sent message 'heartbeat' of length 56 with 0 errors




Processing a metric value message from EC2_IP
Got a heartbeat message 1291667489
....and goes on




C)  /usr/sbin/gmetad -d 10 gives me this


Going to run as user nobody
Sources are ...
Source:
 [MyCluster, step 15] has 1 sources
        10.251.86.192
xml listening on port 8651




interactive xml listening on port 8652
cleanup thread has been started
Data thread 1147169104 is monitoring [MyCluster] data source




        10.251.86.192
[MyCluster] is a 2.5 or later data stream
hash_create size = 1024




hash->size is 1031
hash_create size = 50
hash->size is 53




hash_create size = 50
hash->size is 53
[MyCluster] is a 2.5 or later data stream




[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream




[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream




[MyCluster] is a 2.5 or later data stream
 
 ...etc

D)   Here are the relevant parts in my /etc/ganglia/gmond.conf

cluster {




  name = "MyCluster"
  owner = "myclusterowner"


  latlong = "unspecified"
  url = "unspecified"
}




host {
  location = "IP_of_EC2"
}





udp_send_channel {
  mcast_join = IP_of_EC2



  port = 8666

  ttl = 1
}
udp_recv_channel {




    port = 8666
    family = inet4
}





And gmetad.conf has

data_source "MyCluster" ipaddress:8649

Any help on this would be highly appreciated!.

Thanks,
Mike






      
If I'm not wrong, this error is showed when gmond is already running. You have 
to keep this lines to get working the system. Try readd the lines, stopping 
gmond and restarting, it must work fine.



One question:  you can see the ganglia website in any case, isn't?

Regards,
Antonio.




      
------------------------------------------------------------------------------
What happens now with your Lotus Notes apps - do you make another costly 
upgrade, or settle for being marooned without product support? Time to move
off Lotus Notes and onto the cloud with Force.com, apps are easier to build,
use, and manage than apps on traditional platforms. Sign up for the Lotus 
Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to