Hi Antonio,
I could finally see the Ganglia up running in the webinterface. I restarted
everything and now its fine.
Thanks a lot for your help.
Now I am looking for monitoring hadoop using Ganglia. I added the metrics
properties to the hadoop-metrics properties. Is there something else I have to
do to see the hadoop metrics in ganglia?
Thanks,
Mike
--- On Tue, 12/7/10, Antonio Óscar Balmaseda <[email protected]>
wrote:
From: Antonio Óscar Balmaseda <[email protected]>
Subject: Re: [Ganglia-general] Ganglia Installation Issues
To: "Mike" <[email protected]>
Cc: "Ganglia" <[email protected]>
Date: Tuesday, December 7, 2010, 11:37 AM
2010/12/6 Mike <[email protected]>
Yes, I have the web folder copied to /var/www/ganglia
Do we have to keep in gmond.conf,
tcp_accept_channel {
port = 8649
}
Because trying to start gmond with this included in the conf gave me an error
"Unable to create tcp_accept_channel". So I removed this from gmond.conf
Thanks,
Mike
--- On Mon, 12/6/10, Antonio Óscar Balmaseda <[email protected]>
wrote:
From: Antonio Óscar Balmaseda <[email protected]>
Subject: Re: [Ganglia-general] Ganglia Installation Issues
To: "Mike" <[email protected]>
Cc: "Ganglia" <[email protected]>
Date: Monday,
December 6, 2010, 9:07 PM
2010/12/6 Mike <[email protected]>
Hi Antonio,
Thanks much for your response. I now ran /usr/sbin/update-rc.d -f gmond
defaults and
/usr/sbin/update-rc.d -f gmetad defaults, which initially gave me error:
update-rc.d: /etc/init.d/gmond: file does not exist. The init script was in
/etc/rc.d/init.d/. Then I copied it to /etc/init.d/gmond. Now the update-rc.d
went fine .
I cannot view the web interface when I point to http://ip_address/ganglia/
and I get The server at ip_address is taking too long to respond.
Here are some relevant information:
A) I start gmond by "/usr/sbin/gmond" when I do a telnet EC2Ip_address 8649,
it gives
<HOST NAME=EC2Ip_address IP="10.251.86.192" REPORTED="1291666741" TN="30"
TMAX="20" DMAX="0" LOCATION=EC2Ip_address GMOND_STARTED="1291666621">
B) /usr/sbin/gmond -d 10 gives me this:
Got a heartbeat message 1291666768
metric 'cpu_user' being collected now
metric 'cpu_user' has value_threshold 1.000000
metric 'cpu_system' being collected now
metric 'cpu_system' has value_threshold 1.000000
metric 'cpu_idle' being collected now
metric 'cpu_idle' has value_threshold 5.000000
metric 'cpu_nice' being collected now
metric 'cpu_nice' has value_threshold 1.000000
metric 'cpu_aidle' being collected now
metric 'cpu_aidle' has value_threshold 5.000000
metric 'cpu_wio' being collected now
metric 'cpu_wio' has value_threshold 1.000000
metric 'load_one' being collected now
metric 'load_one' has value_threshold 1.000000
metric 'load_five' being collected now
metric 'load_five' has value_threshold 1.000000
metric 'load_fifteen' being collected now
metric 'load_fifteen' has value_threshold 1.000000
sent message 'heartbeat' of length 56 with 0 errors
Processing a metric value message from EC2_IP
Got a heartbeat message 1291667489
....and goes on
C) /usr/sbin/gmetad -d 10 gives me this
Going to run as user nobody
Sources are ...
Source:
[MyCluster, step 15] has 1 sources
10.251.86.192
xml listening on port 8651
interactive xml listening on port 8652
cleanup thread has been started
Data thread 1147169104 is monitoring [MyCluster] data source
10.251.86.192
[MyCluster] is a 2.5 or later data stream
hash_create size = 1024
hash->size is 1031
hash_create size = 50
hash->size is 53
hash_create size = 50
hash->size is 53
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
[MyCluster] is a 2.5 or later data stream
...etc
D) Here are the relevant parts in my /etc/ganglia/gmond.conf
cluster {
name = "MyCluster"
owner = "myclusterowner"
latlong = "unspecified"
url = "unspecified"
}
host {
location = "IP_of_EC2"
}
udp_send_channel {
mcast_join = IP_of_EC2
port = 8666
ttl = 1
}
udp_recv_channel {
port = 8666
family = inet4
}
And gmetad.conf has
data_source "MyCluster" ipaddress:8649
Any help on this would be highly appreciated!.
Thanks,
Mike
If I'm not wrong, this error is showed when gmond is already running. You have
to keep this lines to get working the system. Try readd the lines, stopping
gmond and restarting, it must work fine.
One question: you can see the ganglia website in any case, isn't?
Regards,
Antonio.
------------------------------------------------------------------------------
What happens now with your Lotus Notes apps - do you make another costly
upgrade, or settle for being marooned without product support? Time to move
off Lotus Notes and onto the cloud with Force.com, apps are easier to build,
use, and manage than apps on traditional platforms. Sign up for the Lotus
Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general