Man...

I'm noticing tons of bugs in Ganglia... I'll log all my troubles in a bit but wanted to ask this question first.

I had two nodes... both on an internal network and both having hostnames via their /etc/hosts files.

One didn't have an /etc/hosts entry on one box and the other DID.

So dataset had assertions around a 10.0.0.12 IP but the source gmon node knew it by db3.foo.com.

The gmetad didn't appear to have the data correctly and all the rrd graph were blank.

The only way I could fix it was to sync up both /etc/hosts files between gmond nodes, restart them, then restart the gmetad node.

THEN it worked. But the problem was that in the mean time I was getting really confusing behavior without any debug info.

Does ganglia internally keep track of nodes via their gethostbyname() resolution or does it use IPs? The problem I think is that it doesn't use a canonical name internally so it can get an incorrect dataset.

I think the dataset internally should just use IP with the hostname only reflected in the UI.
Thoughts?

Kevin

--

Use Rojo (RSS/Atom aggregator). Visit http://rojo.com. Ask me for an invite! Also see irc.freenode.net #rojo if you want to chat.

Rojo is Hiring! - http://www.rojonetworks.com/JobsAtRojo.html

If you're interested in RSS, Weblogs, Social Networking, etc... then you should work for Rojo! If you recommend someone and we hire them you'll get a free iPod! Kevin A. Burton, Location - San Francisco, CA
      AIM/YIM - sfburtonator,  Web - http://peerfear.org/
GPG fingerprint: 5FB2 F3E2 760E 70A8 6174 D393 E84D 8D04 99F1 4412


Reply via email to