Man...
I'm noticing tons of bugs in Ganglia... I'll log all my troubles in a
bit but wanted to ask this question first.
I had two nodes... both on an internal network and both having hostnames
via their /etc/hosts files.
One didn't have an /etc/hosts entry on one box and the other DID.
So dataset had assertions around a 10.0.0.12 IP but the source gmon node
knew it by db3.foo.com.
The gmetad didn't appear to have the data correctly and all the rrd
graph were blank.
The only way I could fix it was to sync up both /etc/hosts files between
gmond nodes, restart them, then restart the gmetad node.
THEN it worked. But the problem was that in the mean time I was getting
really confusing behavior without any debug info.
Does ganglia internally keep track of nodes via their gethostbyname()
resolution or does it use IPs? The problem I think is that it doesn't
use a canonical name internally so it can get an incorrect dataset.
I think the dataset internally should just use IP with the hostname only
reflected in the UI.
Thoughts?
Kevin
--
Use Rojo (RSS/Atom aggregator). Visit http://rojo.com. Ask me for an
invite! Also see irc.freenode.net #rojo if you want to chat.
Rojo is Hiring! - http://www.rojonetworks.com/JobsAtRojo.html
If you're interested in RSS, Weblogs, Social Networking, etc... then you
should work for Rojo! If you recommend someone and we hire them you'll
get a free iPod!
Kevin A. Burton, Location - San Francisco, CA
AIM/YIM - sfburtonator, Web - http://peerfear.org/
GPG fingerprint: 5FB2 F3E2 760E 70A8 6174 D393 E84D 8D04 99F1 4412