Hello,
I am trying to configure Ganglia monitoring system for our cluster. Its
seems to be working all the port and everything look fine, Here are my
configuration files
**************************************************
*Gmetad.conf*
I added these two lines in this file
Server1 is running both gmond and gmetad services
data_source "Our Cluster Name" Server1:8649 Server2:8649 server3:8649
server4:8649 Server5:8649
gridname "Our Grid Name"
*Gmond.conf *File which is same on all the nodes
/* This configuration is as close to 2.5.x default behavior as possible
The values closely match ./gmond/metric.h definitions in 2.5.x */
globals {
setuid = yes
user = nobody
# debug_level = 10
cleanup_threshold = 300 /*secs */
}
/* If a cluster attribute is specified, then all gmond hosts are wrapped
inside
* of a <CLUSTER> tag. If you do not specify a cluster tag, then all
<HOSTS> will
* NOT be wrapped inside of a <CLUSTER> tag. */
cluster {
name = "Our Cluster Name"
owner = "Our Grid Name"
}
/* Feel free to specify as many udp_send_channels as you like. Gmond
used to only support having a single channel */
udp_send_channel {
# mcast_join = 239.2.11.71
* host = server1*
port = 8649
}
/* You can specify as many udp_recv_channels as you like as well. */
udp_recv_channel {
port = 8649
acl {
default = "allow"
access {
ip = 192.168.240.0
mask = 32
action = "allow"
}
}
}
/* You can specify as many tcp_accept_channels as you like to share
an xml description of the state of the cluster */
tcp_accept_channel {
port = 8649
acl{
default="allow"
access {
ip= 192.168.240.0
mask = 32
action = "allow"
}
}
}
/* The old internal 2.5.x metric array has been replaced by the following
collection_group directives. What follows is the default behavior for
collecting and sending metrics that is as close to 2.5.x behavior as
possible. */
/* This collection group will cause a heartbeat (or beacon) to be sent every
20 seconds. In the heartbeat is the GMOND_STARTED data which expresses
the age of the running gmond. */
collection_group {
collect_once = yes
time_threshold = 20
metric {
name = "heartbeat"
}
}
/* This collection group will send general info about this host every
1200 secs.
This information doesn't change between reboots and is only collected
once. */
collection_group {
collect_once = yes
time_threshold = 1200
metric {
name = "cpu_num"
.... Default Collection Group, I did not make any change in that.
***************************************************************************
Started gmetad and gmond, Everything seems fine so far, it created rrds
in /var/ling/ganglia/rrd/our cluster name/server* directory.
Now when I go on the web to see the graphs, all the graphs looks same. I
click on different links, but I get the same graph every time. pie chart
does not show up, Can some body suggest me what I might be missing. I
dont see, CPU, Memory and Network usage graphs. DO I have to do anything
in web-front-end configuration. See the image below
I am using These packages..
ganglia-gmond-3.0.1-1
ganglia-monitor-core-lib-2.5.7-1
ganglia-gmetad-3.0.1-1
ganglia-web-3.0.1-1
2.6.11 Kernel.
I can provide more information if that can help you understanding the
problem.
Thanks you so much,