We use: LMT as well as Ganglia + collectl. Nagios for system health, hardware health, and cluster health (crm_mon -s). Splunk for monitoring and reviewing log messages.
Erik On Thu, Sep 30, 2010 at 7:36 AM, Temple Jason <[email protected]> wrote: > We use ganglia with collectl. These versions are the only ones I could find > to work in this way: > > Sep 30 13:35 [r...@wn125:~]# rpm -qa |grep collectl > collectl-3.4.2-5 > Sep 30 13:35 [r...@wn125:~]# rpm -qa |grep ganglia > ganglia-gmond-3.1.7-1 > > We are quite happy with it. > > Thanks, > > Jason > > -----Original Message----- > From: [email protected] > [mailto:[email protected]] On Behalf Of Andreas Davour > Sent: giovedì, 30. settembre 2010 11:47 > To: [email protected] > Subject: [Lustre-discuss] How do you monitor your lustre? > > > I ask because the lmt project seem to be quite moribund. Anyone else out there > doing something? > > /andreas > -- > Systems Engineer > PDC Center for High Performance Computing > CSC School of Computer Science and Communication > KTH Royal Institute of Technology > SE-100 44 Stockholm, Sweden > Phone: 087906658 > "A satellite, an earring, and a dust bunny are what made America great!" > _______________________________________________ > Lustre-discuss mailing list > [email protected] > http://lists.lustre.org/mailman/listinfo/lustre-discuss > _______________________________________________ > Lustre-discuss mailing list > [email protected] > http://lists.lustre.org/mailman/listinfo/lustre-discuss > _______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
