On Sun, 2007-06-17 at 15:23 -0400, Patterson, Ron (NIH/NLM/NCBI) [C]
wrote:

> For example, if I use the following:
> 
> /usr/bin/gmetric --conf=/etc/gmond.conf -nmystat -v1.0 -tfloat -uunits
> 
> Run every minute from all 200 hosts in my cluster, I will see gmond
> start growing in size almost immediately. 

Have you tried looking at the metrics stored in memory on a single host?
You can just run something like "telnet localhost 8649" to get a dump of
all the XML data.  The reason I suggest this is to get an idea as to
what is causing the memory consumption.  If you find that the size of
the metric data stays roughly the same, then that would seem to point to
a memory leak issue.

If you found that the size of the metric data kept growing (and hence
was responsible for using up the memory), then maybe something weird is
happening and gmond is somehow treating each new incoming data point as
a new metric.  Then it would keep accumulating values in memory instead
of overwriting the previous value.

Of course, your example above seems pretty straight forward so I would
be surprised if the latter were true.  I'm betting that your diagnosis
of a memory leak is more likely correct.  This simple test would at
least rule out the second possibility.

-- 
Rick Mohr
Systems Developer
Ohio Supercomputer Center


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to