Hello,I have installed Ganglia 3.7.1 on a Dell 720 cluster running CentOS 6.4.
The Ganglia web has been running OK .
The remote nodes appeared to be down on the Ganglia web page (they were
actually up). I restarted the gmond on the remote nodes, then the Ganglia web
page showed these remote nodes were up, but exactly after 2 or 3 minutes, the
Ganglia web page said these nodes were down again.
I am using following versions:Ganglia Web Frontend version 3.7.0 .
Ganglia Web Backend (gmetad) version 3.7.2 .
Images created with RRDtool version 1.5.3.
Powered by Dwoo 1.1.1.Configuration:Gmond/* This configuration is as close to
2.5.x default behavior as possible
The values closely match ./gmond/metric.h definitions in 2.5.x */
globals {
daemonize = yes
setuid = yes
user = ganglia
debug_level = 0
max_udp_msg_len = 1472
mute = no
deaf = no
allow_extra_data = yes
host_dmax = 86400 /*secs. Expires (removes from web interface) hosts in 1 day
*/
host_tmax = 20 /*secs */
cleanup_threshold = 300 /*secs */
gexec = no
# By default gmond will use reverse DNS resolution when displaying your
hostname
# Uncommeting following value will override that value.
# override_hostname = mywebserver.domain.com
# If you are not using multicast this value should be set to something other
than 0.
# Otherwise if you restart aggregator gmond you will get empty graphs. 60
seconds is reasonable
send_metadata_interval = 0 /*secs */}/*
* The cluster attributes specified will be used as part of the CLUSTER
* tag that will wrap all hosts collected by this instance.
*/
cluster {
name = Production
owner = unspecified
latlong = unspecified
url = unspecified
}/* The host section describes attributes of the host, like the location */
host {
location = unspecified
}/* Feel free to specify as many udp_send_channels as you like. Gmond
used to only support having a single channel */
udp_send_channel {
#bind_hostname = yes # Highly recommended, soon to be default.
# This option tells gmond to use a source address
# that resolves to the machine's hostname. Without
# this, the metrics may appear to come from any
# interface and the DNS names associated with
# those IPs will be used to create the RRDs. mcast_join
= 239.2.11.71
port = 8649
ttl = 1
}/* You can specify as many udp_recv_channels as you like as well. */
udp_recv_channel {
mcast_join = 239.2.11.71
port = 8649
bind = 239.2.11.71
retry_bind = true
# Size of the UDP buffer. If you are handling lots of metrics you really
# should bump it up to e.g. 10MB or even higher.
# buffer = 10485760
}/* You can specify as many tcp_accept_channels as you like to share
an xml description of the state of the cluster */
tcp_accept_channel {
port = 8649
# If you want to gzip XML output
gzip_output = no
}/* Channel to receive sFlow datagrams */
#udp_recv_channel {
# port = 6343
#}gmetad.confdata_source Production hostname
setuid_username nobody
case_sensitive_hostnames 0
CLUSTER INFORMATION
Name: unspecified
Hosts: 2
Gexec Hosts: 0
Dead Hosts: 53
There are no Errors in either gmond or gmetad shown at debug level .Anyone know
what was the problem? Thanks a lot for your help.
Thanks,
Harinath
--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general