https://bugzilla.wikimedia.org/show_bug.cgi?id=48338

--- Comment #1 from Tim Landscheidt <[email protected]> ---
1. Apparently, the Puppet modules are structured the other way round: A module
typically has a ::monitoring class that adds the gathering thingy to the node. 
At Tools, the proper class would probably be gridengine::master::monitoring to
be deployed exactly once per SGE cluster.

2. No report has to be defined on the Ganglia side at all.  If one feeds it
data, it will make sense of it on its own.

3. As a test, I have set up ~scfc/bin/sge_jobs.pl to be run on tools-login
every fifteen minutes.  It gathers information on pending, running and error
jobs, and submits it to Ganglia.  The graphs can be found at
http://ganglia.wmflabs.org -> tools -> tools-login ->
sge_pending/sge_running/sge_error.  I intend to leave it running for a few days
before puppetizing.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to