----- Original Message ---- > From: Scott Dworkis <[email protected]> > To: Martin Knoblauch <[email protected]> > Cc: [email protected] > Sent: Wed, February 17, 2010 8:32:32 PM > Subject: Re: [Ganglia-general] gmond memory leaks > > 3.1.2 on gentoo (that solaris must be a sourceforge ad?). i have zero > experience with valgrind... i'll have a look but a smidge of guidance > would be appreciated. :) >
Just get valgrind and run the leaking "gmond" under its control. "gmond" should be configured to not run in background. After some time interrupt it and you will get a report of valgrinds findings. For example, a simple program leaking 8x1MB will produce: [mknob...@l6g0223j ~]$ valgrind ./memeat ==13647== Memcheck, a memory error detector. ==13647== Copyright (C) 2002-2006, and GNU GPL'd, by Julian Seward et al. ==13647== Using LibVEX rev 1658, a library for dynamic binary translation. ==13647== Copyright (C) 2004-2006, and GNU GPL'd, by OpenWorks LLP. ==13647== Using valgrind-3.2.1, a dynamic binary instrumentation framework. ==13647== Copyright (C) 2000-2006, and GNU GPL'd, by Julian Seward et al. ==13647== For more details, rerun with: -v ==13647== ^C ==13647== ==13647== ERROR SUMMARY: 0 errors from 0 contexts (suppressed: 5 from 1) ==13647== malloc/free: in use at exit: 8,000,000 bytes in 8 blocks. ==13647== malloc/free: 8 allocs, 0 frees, 8,000,000 bytes allocated. ==13647== For counts of detected errors, rerun with: -v ==13647== searching for pointers to 8 not-freed blocks. ==13647== checked 66,440 bytes. ==13647== ==13647== LEAK SUMMARY: ==13647== definitely lost: 8,000,000 bytes in 8 blocks. ==13647== possibly lost: 0 bytes in 0 blocks. ==13647== still reachable: 0 bytes in 0 blocks. ==13647== suppressed: 0 bytes in 0 blocks. ==13647== Use --leak-check=full to see details of leaked memory. If you use "--leak-check=full", it will tell you where the leaking memory was allocated. "gmond" needs to be compiled with debug info (-g). A few questions. - What is your setup? I assume quite a few hosts monitoring (collectors) metrics and one aggregating the results. - Which of the "gmond"s leak? The "collectors", the "aggregator" or both? Cheers Martin > yeah 150k metrics is a lot... i have an interest in scaling this thing. > i'll post another thread bout things i've done to scale so far that seem > to be working well. > > On Wed, 17 Feb 2010, Martin Knoblauch wrote: > > > Hi Scott, > > > > which version of Ganglia and which operating environment do you have > > (guessing > Solaris from your signature :-)? Any chance that you could run valgrind or > equivalent on your setup? 10GB/day is a lot, as is 150k metrics. > > > > Cheers > > Martin > > ------------------------------------------------------ > > Martin Knoblauch > > email: k n o b i AT knobisoft DOT de > > www: http://www.knobisoft.de > > > > > > > > ----- Original Message ---- > >> From: Scott Dworkis > >> To: [email protected] > >> Sent: Wed, February 17, 2010 3:08:26 AM > >> Subject: [Ganglia-general] gmond memory leaks > >> > >> (sorry if this is a repost... i tried previously without having first > >> subscribed to the list, and fear i got lost somewhere along the moderation > >> path) > >> > >> hi all - i am seeing gmond leak about 10GB/day on about 150k metrics > >> collected. it seemed like things worsened when i added dmax to all my > >> custom metrics, but maybe it was always bad. is this a known issue? > >> > >> sorry if it is already known... i couldn't see that there was a good way > >> to search the forums or if there is a bug tracker to search. > >> > >> -scott > >> > >> > ------------------------------------------------------------------------------ > >> SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, > >> Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW > >> http://p.sf.net/sfu/solaris-dev2dev > >> _______________________________________________ > >> Ganglia-general mailing list > >> [email protected] > >> https://lists.sourceforge.net/lists/listinfo/ganglia-general > > > > ------------------------------------------------------------------------------ > SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, > Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW > http://p.sf.net/sfu/solaris-dev2dev > _______________________________________________ > Ganglia-general mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/ganglia-general ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Ganglia-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/ganglia-general

