A quick query looking for experience of ganglia on lsf clusters.
I have being using mrtg to this point on a mixed Solaris + Linux cluster.
From an lsf perspective, the users see the cluster as either;
- entire cluster
- solaris only
- linux only
Until recently, we were using standard mrtg, without rrdtool. This
caused a problem with
averaging of cpu usage across the cluster, as mrtg really was not up to
the task ( as far as I can see ).
I am currently redoing this, and the first cut at it is to update mrtg,
implement rrd backend and
generate stats on the fly with greater intelligence.
Now I am wondering if this is simply the wrong tool for the task, and I
should be looking at something like ganglia instead,
A couple of things.
1. Can I collect both linux and solaris node data and present them to a
single linux front end.
2. Can I generate data on sub clusters, ie the linux and solaris
specific views, as well as generating
overall view across all machines.
Thanks for any insight, experience relating to this.
-Bob
The information contained in this e-mail and in any attachments is confidential
and is designated solely for the attention of the intended recipient(s). If you
are not an intended recipient, you must not use, disclose, copy, distribute or
retain this e-mail or any part thereof. If you have received this e-mail in
error, please notify the sender by return e-mail and delete all copies of this
e-mail from your computer system(s).
Please direct any additional queries to: [EMAIL PROTECTED]
Thank You.