On Tue, Mar 18, 2008 at 08:44:13PM -0400, Javier Frias wrote:
> So i guess searching the lists *well* should have been my first recourse...
> 
> http://lists.danga.com/pipermail/mogilefs/2007-June/001043.html
> 
> mentions that the stats command is pretty heavy.. I'd assume this is
> still the case.. so... the follow up question would be,
> 
> how does everyone here monitor mogilefs?
> 
> I have db monitors, and port monitors, and a simple transaction
> monitor is in the works ( write a file, delete a file ), but it'd be
> nice to map growth across domains/classes.
I used to graph this in Munin (run the query every 5 minutes, store in
RRD):
SELECT COUNT(fid), dmid, classid FROM file GROUP BY dmid, classid;
(which the dmid/classid cached).
So all domain/classes on the same graph.
I stopped doing it as due to the differences in magnitude of numbers and
growth rate, I needed to have multiple separate graphs, and I just
couldn't be bothered.

Instead, I do use the per-database graphs that Munin has for Postgresql.
postgres_block_read_
postgres_commits_
postgres_queries_
postgres_space_

This actually raises one interesting bit that I'm not sure if anybody
else has seen. Approximately once a day, Mogile is doing a SELECT query
that returns a massive 50-80k rows, while the normal 5-minute average is
~500.

Beyond that performance monitoring of PostGres, I do have a lot of stuff
watched via Nagios - daemon running and port connection tests for
all the mogilefsd nodes (3), and all the mogstored nodes (8), and the
haproxy nodes (local on each web client).

*haproxy: None of the MogileFS client code does load-balancing/failover
between MogileFS instances very well, so we use haproxy on
loopback on each of our web nodes. If you want to just contact the
Mogile system, instead of looking for a mogilefsd instance that is up,
you just hit localhost:7001, and it directs you to one that IS actually
up. haproxy keeps state of ones that are up, so it works well. Doing it
on loopback cuts down on any latency and failure issue we might have if
we were to have it on a standalone system.

-- 
Robin Hugh Johnson
Gentoo Linux Developer & Infra Guy
E-Mail     : [EMAIL PROTECTED]
GnuPG FP   : 11AC BA4F 4778 E3F6 E4ED  F38E B27B 944E 3488 4E85

Attachment: pgpdwuHgHjBV8.pgp
Description: PGP signature

Reply via email to