On Tue, Mar 18, 2008 at 08:44:13PM -0400, Javier Frias wrote: > So i guess searching the lists *well* should have been my first recourse... > > http://lists.danga.com/pipermail/mogilefs/2007-June/001043.html > > mentions that the stats command is pretty heavy.. I'd assume this is > still the case.. so... the follow up question would be, > > how does everyone here monitor mogilefs? > > I have db monitors, and port monitors, and a simple transaction > monitor is in the works ( write a file, delete a file ), but it'd be > nice to map growth across domains/classes. I used to graph this in Munin (run the query every 5 minutes, store in RRD): SELECT COUNT(fid), dmid, classid FROM file GROUP BY dmid, classid; (which the dmid/classid cached). So all domain/classes on the same graph. I stopped doing it as due to the differences in magnitude of numbers and growth rate, I needed to have multiple separate graphs, and I just couldn't be bothered.
Instead, I do use the per-database graphs that Munin has for Postgresql. postgres_block_read_ postgres_commits_ postgres_queries_ postgres_space_ This actually raises one interesting bit that I'm not sure if anybody else has seen. Approximately once a day, Mogile is doing a SELECT query that returns a massive 50-80k rows, while the normal 5-minute average is ~500. Beyond that performance monitoring of PostGres, I do have a lot of stuff watched via Nagios - daemon running and port connection tests for all the mogilefsd nodes (3), and all the mogstored nodes (8), and the haproxy nodes (local on each web client). *haproxy: None of the MogileFS client code does load-balancing/failover between MogileFS instances very well, so we use haproxy on loopback on each of our web nodes. If you want to just contact the Mogile system, instead of looking for a mogilefsd instance that is up, you just hit localhost:7001, and it directs you to one that IS actually up. haproxy keeps state of ones that are up, so it works well. Doing it on loopback cuts down on any latency and failure issue we might have if we were to have it on a standalone system. -- Robin Hugh Johnson Gentoo Linux Developer & Infra Guy E-Mail : [EMAIL PROTECTED] GnuPG FP : 11AC BA4F 4778 E3F6 E4ED F38E B27B 944E 3488 4E85
pgpdwuHgHjBV8.pgp
Description: PGP signature
