Here's the configuration:

One CISCO gigabit Ethernet switch. Plugged into this switch is zappa.georgetown.edu, which is the master node of a 16 node OSCAR Beowulf cluster, hendrix.arc.georgetown.edu, which is a stand-alone computer (that also answers to time.arc.georgetown.edu), and john.arc.georgetown.edu, paul.arc.georgetown.edu, george.arc.georgetown.edu and ringo.arc.georgetown.edu, which are all stand-alone nodes that happen to share a filesystem and use PVM, but they are NOT a classic Beowulf cluster. I have other machines elsewhere on campus all running gmonds.

Here's the problem: hendrix is my only gmetad installation (you can see it at www.guppi.georgetown.edu), and it is supposed to report on all clusters and all stand-alone machines on my 'grid.' Clusters report their Master node on the top-level, and you can tunnel down to see the compute nodes. This has been working just fine up until now.

When I tried to add john, paul, george and ringo, and have them send their data to hendrix to be displayed, each of them will speak for themselves, as well as for each other, AND hendrix (or time)!! For example, john (which is a dual-processor machine) will report that it has 5 nodes and 10 processors, and if you tunnel down on john you'll find john (again), george, paul, and ringo, as well as either hendrix or time. Additionally, hendrix will not report for itself. I've tried making each of these machines 'deaf' in the config file, but then they disappear entirely.

What am I doing wrong? Things have been going so well with ganglia here up until now, but I don't see anything in the docs to help me. How can I make each of these machines ignore the others and just report their local information to hendrix?? At first I thought it was an issue with sharing the switch, but if it were zappa would be involved. Then I thought it was a PVM thing, but then why is hendrix involved???

Arnie Miles
Georgetown University
[EMAIL PROTECTED]


Reply via email to