I've been trying to get 3.0.2 installed and working and after spinning my head far too many times, I've decided I don't really understand the larger picture of how ganglia works.

Seems there are three pieces to this:

* gmond - daemon that runs on each host, collects information on localhost

* gmetad - "Federation in Ganglia is achieved using a tree of point-to-point connections amongst representative cluster nodes to aggregate the state of multiple clusters." Ah - collects data from all the other nodes? Can it be that important, since by default it is not even built?

* web app - Some PHP code that sends a request to a host/socket and displays the output. Implies to me that the target software must have access to some place where all the data is saved.


What's going on here?

gmond collects stats for localhost. Are these saved somewhere? Apparently not, since I see no reference to a path. Are they sent somewhere? Well, gmond.conf provides udp_send_channel, so that suggests maybe the data is sent elsewhere via this.You can apparently specify a specific host/port or broadcast it around. This seems reasonable.

But gmond.conf provides udp_recv_channel too. What the heck can this be for? Just as confusing to me is tcp_accept_channel. So maybe gmond CAN save data. Now I'm back to my original confusion.

Just as confusing to me is the collection of ports (I may have these wrong, I've been screwing around with this so much, I may have lost the defaults)

webapp - 8652 (how to ask for all the collected data ?)
gmond - 8649   (send, receive, accept - only send makes sense to me)
gmetad - 8655,8651 are mentioned near data_source
gmetad - 8651 (answer requests for XML)  What XML? from who, for what?
gmetad - 8652 (answer queries for XML)  Same questions.


I've read everything I can find on the web site, but I cannot get my poor head around what's going on so I can figure out why Ganglia doesn't work for me. Apparently the mail archives are broken so I've not been able to search the past Emails.

One issue I'm sure must be a factor is that my cluster machines are on an internal private network and the web access for the summary must come from outside that network. That's why I've been focusing on how the data is saved.

If anyone can provide a clearer picture to me of what the data flow is, I'd appreciate it. TIA


--
=============================================================
Terry Gliedt     [EMAIL PROTECTED]       http://www.hps.com/~tpg/
Biostatistics, Univ of Michigan  Personal Email:  [EMAIL PROTECTED]

Reply via email to