Hi Doug:

On Thu, Dec 17, 2009 at 10:58 AM, Douglas Wade Needham
<[email protected]> wrote:

> When we have a program connect across our 1Gbps network connection to
> this gmetad, we end up with very gappy data, if the hosts don't just
> get marked as down and the RRDs stop updating.  I have already started
> pressuring those who would approve moving our RRDs to a memory fs,
> but in the meantime... :(

Could you please elaborate on what this "program" is doing when it
connects to gmetad?

While I haven't run Ganglia under Xen instances, if I were to make a
guess, this is probably an I/O related issue.  Is there any chance you
can run the gmetad instance on a bare metal box and see if your
situation improves?  64 hosts x 40 metrics can be easily handled by a
typical server.  It is usually when you get into the high hundreds and
beyond that people usually need to implement the tmpfs workaround.

Another thing you could try is rrdcached which is available in new
versions of RRDtool.

Regarding the patch, if you are to make one, please do so against
trunk as all code contribution needs to go there, and eventually
backported to our branches.

Good luck with troubleshooting.

Cheers,

Bernard

------------------------------------------------------------------------------
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to