https://bugzilla.wikimedia.org/show_bug.cgi?id=51497

--- Comment #1 from Antoine "hashar" Musso <[email protected]> ---
A breakdown of the useful monitoring systems:

Icinga
======

The puppet manifests already define Icinga checks for a lot of service, that is
done via the global define monitor_service.  As an example, Varnish instances
are blessed with:

    monitor_service { "varnish http ${title}":
        description => "Varnish HTTP ${title}",
        check_command => "check_http_generic!varnishcheck!${port}"

    }


Which adds the monitoring on icinga.wikimedia.org.

We could get ops involved in setting up the labs instance for beta and do the
configuration hack that would prevent paging but drop emails|messages instead.


Ganglia
=======

All labs instances are automatically added in a Ganglia instance:

http://ganglia.wmflabs.org/latest/?r=hour&s=by+name&c=deployment-prep&tab=m

That seems to cover our needs.

Graphite
========

That would be very nice to have, specially the profiling bits.  That project
does not have any documentation beside the puppet manifests though.  Probably
lower priority compared to Icinga.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to