On Tue, Aug 26, 2008 at 09:56:21PM -0400, Ofer Inbar wrote:
> 
> I see some discussion of moving away from using hostnames as directory
> names to store RRDs in.

I think that is a misunderstanding from the proposed enhancement of making
ganglia a little more resilient to DNS failures or mismatched multihoming
interfaces and that was the original driver behind the original wishlist.

> I hope very much that doesn't happen!

Agree as the current KISS scheme is scalable (to a certain extent) easy
to manage and intuitive and a key differentiator with other solutions
that weren't as lucky as us to have Matt doing the original design.

If it ever happens it should be most likely considered only as an option
while the current scheme will be supported forever.

The possibility to implement a different scheme than the current one that
uses an ondisk tree of clusters/nodes is available (even if probably not yet
complete and missing any needed frontend logic to support it) from our
development version (trunk or 3.2.x) through the python gmetad rewrite and
its modular architecture.

> It is very very useful to be able to ls the rrd directories to see what
> hosts gmetad sees, which ones it thinks are in which cluster; to run finds
> or ls's to see the datestamps on RRD files and immediately connect them to
> hosts; and to be able to rename hosts and rename their corresponding
> directories easily.

Agree and from what I'd seen scales well to even thousands of nodes if enough
IO is provided either through very fast disks, multiple mount points, RAIDs,
SAN or even hacky alternatives like a file based loopback device or ramfs.

Some people might find it more useful (even if it will be prohibitively more
expensive) to store the metrics in some sort of DBMS and use an SQL interface
instead of simple filesystem commands, or will have problems in their setup
that prevents them from having a unique FQDN generating conflicts and so this
is just an option for them.

Don't worry that the core principles for ganglia of Simplicity, Efficiency
and Correctness are still valid; after all this all started as a way to
monitor HPC clusters in a global scale and we are still aiming for world
domination last time I checked ;).

Carlo

-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to