Re: [Ganglia-general] gmetad xml output is incomplete sometimes

2010-06-28 Thread Miguel A.
Hi Bernard.

Now, I'm only monitoring 5 host. 
-2/5 are switches and only have 3 metrics. To do that I'm using 3
gmetric call every minute.
-3/5 are hosts with the default metrics and default time values.

The problem appears in both cases: switches and hosts.

Seeing debug mode of gmetad, I noticed 3 events (updating, writing,
clearing). Maybe those events are relationed with my problem (perhaps
clearing event).

Thanks,
Miguel.

El vie, 25-06-2010 a las 10:57 -0700, Bernard Li escribió:
 Hi Miguel:
 
 How many hosts and metrics are you monitoring with your gmetad?
 
 Cheers,
 
 Bernard
 
 2010/6/25 Miguel A. miguelangel.d...@ciemat.es:
  Hi.
 
  I'm getting the XML output from gmetad and saving it in a file.
  Sometimes, the output XML has more machine than others. For example,
  At 2 p.m the xml output is
  grid
 cluster1
 host 1
 host 2
 host 3
  /cluster1
  /grid
 
  And one minute later, the xml output is (for example)
  grid
 cluster1
 host 1
 /cluster1
  /grid
  But other minute later, the xml output is (for example)
  grid
 cluster1
 host 1
 host 2
 host 3
 /cluster1
  /grid
 
  I have revised that hosts were running and they were ok. I think gmetad
  only shows updated data, but I'm not sure. Do you know why gmetad
  occassionally shows some piece of data and not all of them?
 
  Regards
  Miguel.
 
  
  Confidencialidad:
  Este mensaje y sus ficheros adjuntos se dirige exclusivamente a su 
  destinatario y puede contener información privilegiada o confidencial. Si 
  no es vd. el destinatario indicado, queda notificado de que la utilización, 
  divulgación y/o copia sin autorización está prohibida en virtud de la 
  legislación vigente. Si ha recibido este mensaje por error, le rogamos que 
  nos lo comunique inmediatamente respondiendo al mensaje y proceda a su 
  destrucción.
 
  Disclaimer:
  This message and its attached files is intended exclusively for its 
  recipients and may contain confidential information. If you received this 
  e-mail in error you are hereby notified that any dissemination, copy or 
  disclosure of this communication is strictly prohibited and may be 
  unlawful. In this case, please notify us by a reply and delete this email 
  and its contents immediately.
  
 
 
 
  --
  ThinkGeek and WIRED's GeekDad team up for the Ultimate
  GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the
  lucky parental unit.  See the prize list and enter to win:
  http://p.sf.net/sfu/thinkgeek-promo
  ___
  Ganglia-general mailing list
  Ganglia-general@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/ganglia-general
 
 

--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] gmetad xml output is incomplete sometimes

2010-06-28 Thread Martin Knoblauch
Hi Miguel,

 just to rule that out: check the data_source lines in your gmetad.conf to make 
sure that gmetad is not querying its own XML port. That could result in 
incomplete/broken XML. And yes, we have seen it before :-)

 Cheers
Martin
--
Martin Knoblauch
email: k n o b i AT knobisoft DOT de
www:   http://www.knobisoft.de



- Original Message 
 From: Miguel A. Díaz Corchero miguelangel.d...@ciemat.es
 To: Bernard Li bern...@vanhpc.org
 Cc: ganglia-general@lists.sourceforge.net 
 ganglia-general@lists.sourceforge.net
 Sent: Mon, June 28, 2010 8:27:50 AM
 Subject: Re: [Ganglia-general] gmetad xml output is incomplete sometimes
 
 Hi Bernard.

Now, I'm only monitoring 5 host. 
-2/5 are switches and 
 only have 3 metrics. To do that I'm using 3
gmetric call every 
 minute.
-3/5 are hosts with the default metrics and default time 
 values.

The problem appears in both cases: switches and 
 hosts.

Seeing debug mode of gmetad, I noticed 3 events (updating, 
 writing,
clearing). Maybe those events are relationed with my problem 
 (perhaps
clearing event).

Thanks,
Miguel.

El vie, 25-06-2010 
 a las 10:57 -0700, Bernard Li escribió:
 Hi Miguel:
 
 How 
 many hosts and metrics are you monitoring with your gmetad?
 
 
 Cheers,
 
 Bernard
 
 2010/6/25 Miguel A. 
 ymailto=mailto:miguelangel.d...@ciemat.es; 
 href=mailto:miguelangel.d...@ciemat.es;miguelangel.d...@ciemat.es:
 
  Hi.
 
  I'm getting the XML output from gmetad and 
 saving it in a file.
  Sometimes, the output XML has more machine 
 than others. For example,
  At 2 p.m the xml output is
  
 grid
 cluster1
 
 host 1
 
 host 2
 
 host 3
 
  /cluster1
  
 /grid
 
  And one minute later, the xml output is 
 (for example)
  grid
 
 cluster1
   
   host 1
 /cluster1
 
  /grid
  But other minute later, the xml output is (for 
 example)
  grid
 
 cluster1
   
   host 1
   
   host 2
   
   host 3
 /cluster1
 
  /grid
 
  I have revised that hosts were 
 running and they were ok. I think gmetad
  only shows updated data, 
 but I'm not sure. Do you know why gmetad
  occassionally shows some 
 piece of data and not all of them?
 
  Regards
 
  Miguel.
 
  
  
 Confidencialidad:
  Este mensaje y sus ficheros adjuntos se dirige 
 exclusivamente a su destinatario y puede contener información privilegiada o 
 confidencial. Si no es vd. el destinatario indicado, queda notificado de que 
 la 
 utilización, divulgación y/o copia sin autorización está prohibida en virtud 
 de 
 la legislación vigente. Si ha recibido este mensaje por error, le rogamos que 
 nos lo comunique inmediatamente respondiendo al mensaje y proceda a su 
 destrucción.
 
  Disclaimer:
  This message and 
 its attached files is intended exclusively for its recipients and may contain 
 confidential information. If you received this e-mail in error you are hereby 
 notified that any dissemination, copy or disclosure of this communication is 
 strictly prohibited and may be unlawful. In this case, please notify us by a 
 reply and delete this email and its contents immediately.
  
 
 
 
 
  
 --
 
  ThinkGeek and WIRED's GeekDad team up for the Ultimate
  GeekDad 
 Father's Day Giveaway. ONE MASSIVE PRIZE to the
  lucky parental 
 unit.  See the prize list and enter to win:
  
 href=http://p.sf.net/sfu/thinkgeek-promo; target=_blank 
 http://p.sf.net/sfu/thinkgeek-promo
  
 ___
  Ganglia-general 
 mailing list
  
 ymailto=mailto:Ganglia-general@lists.sourceforge.net; 
 href=mailto:Ganglia-general@lists.sourceforge.net;Ganglia-general@lists.sourceforge.net
 
  
 target=_blank 
 https://lists.sourceforge.net/lists/listinfo/ganglia-general
 
 
 
 

--
This 
 SF.net email is sponsored by Sprint
What will you do first with EVO, the 
 first 4G phone?
Visit sprint.com/first -- 
 href=http://p.sf.net/sfu/sprint-com-first; target=_blank 
 http://p.sf.net/sfu/sprint-com-first
___
Ganglia-general 
 mailing list

 href=mailto:Ganglia-general@lists.sourceforge.net;Ganglia-general@lists.sourceforge.net

 href=https://lists.sourceforge.net/lists/listinfo/ganglia-general; 
 target=_blank 
 https://lists.sourceforge.net/lists/listinfo/ganglia-general

--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net

[Ganglia-general] Regarding change in Default Polling time

2010-06-28 Thread Anand, Abhishek A
Hello Everyone ,

I have installed on a 4 node test cluster. I wish to change (from 15 to 300 
second) the regular polling time. Once I make that change in gmetad.conf

data_source my cluster 300 localhost

(This is the only change I made in configuration )

After doing that my graphs are kind of broken.  Please can anyone help me if I 
need to make any additional configuration change.

I will be implementing this on 400 node performance cluster and what changes I 
can make to minimize ganglia impact on Cluster.

Kind Regards,

Abhishek Anand
Software  Services Group/DRD
I N T E L Corp
DP3-307-H7
2800 Center Dr
DuPont WA 98327
Email: abhishek.a.an...@intel.com

--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] 3.1.7 installation from tar notes - Ubuntu - and a repeatable Segfault

2010-06-28 Thread Bernard Li
Hi Whit:

On Sat, Jun 19, 2010 at 10:38 AM, Whit Blauvelt w...@transpect.com wrote:

 Let me first off say I hugely appreciate Ganglia. In the old spirit of
 reporting glitches so others may benefit, I'd like to add this brief update
 to my earlier notes on installing on CentOS. This time it's a
 build-from-source install on Ubuntu 10.04 LTS server.

Thanks, and we appreciate your comments.

   cp mans/* /usr/share/man/man1
   cp gmond/gmond.conf.5 /usr/share/man/man5

This is being looked at -- by default manpages should be installed,
see this bug:

http://bugzilla.ganglia.info/cgi-bin/bugzilla/show_bug.cgi?id=24

   cp gmond/gmond.init /etc/init.d/gmond

This is a bit tougher, since we'll need to write detection code to
figure out what OS you're running.  Remember Ganglia is supported on
multiple platforms.

   mkdir /etc/ganglia
   cd gmond/modules
   cp -a conf.d /etc/ganglia
   rm /etc/ganglia/conf.d/example.conf

Again, you may not always want to copy all the configuration files as
not all modules are supported on all platforms.  Perhaps a separate
`make install_gmond_modules` can be created for users who would want
to install them.

   cd /etc/ganglia
   gmond -t  gmond.conf

This is also being looked at.

 3. If you'd like to see gmond segfault, after the cp -a conf.d etc/ganglia
   /step, but before the gmond -t  gmond.conf step, issue a
   gmond -m. Now, since gmond -t merely makes explicit the implicit gmond
   configuration settings, why should it segfault just if the module
   conf.d files are there when the gmond.conf isn't yet??

Please see this bug report:

http://bugzilla.ganglia.info/cgi-bin/bugzilla/show_bug.cgi?id=259

 4. Create an Ubuntu-style /etc/init.d/gmond file. This is left as something
   of an exercise for the reader. However, at then end here you'll find a
   stripped-down version that works for me - but beware it's inelegant and
   not rigorously tested.

You can probably just get the official one from Ubuntu/Debian.

 When I posted my prior notes to the CentOS list I got seriously flamed for
 building from tar rather than running through rpmbuild. I hope building from
 tar isn't effectively depricated, since in my experience it's always been
 the best way to be sure of getting the most recent version of a critical
 daemon right. My rule of thumb is to stay with distros for anything they've
 done well and are current enough with, and build-by-hand anything where
 their compile options or less-than-current version doesn't fit local needs.
 That's generally at most a few daemons on any particular system. I always
 leave the libraries and so on stock distro. The storm from some of my peers,
 who now regard this as an inproper method, surprised me.

Building from source is definitely not deprecated, since all packages
are based on it.  However, the manual operations that you have
outlined above are usually taken care of by the packager, and thus are
hidden from end users on platforms where a binary package is
available.

I understand that you're trying to install Ganglia on CentOS and
Ubuntu systems.  For CentOS, I would suggest that you build the RPMs
from the tarball.  The spec file is kept fairly up to date and you
don't need to go through any of the manual processes you have
mentioned.  Additionally the package can then be tracked via RPM which
is probably what you want to do on a RPM-based system anyway.

For Ubuntu -- I don't see official packages for 3.1.7 but they are
available from Debian sid, so perhaps you can use those:

http://packages.debian.org/sid/ganglia-monitor

 So does my experience count as finding a bug which should be reported in the
 3.1.7 makefile (not to mention that segfault), or is building from tar
 something the Ganglia maintainers feel should be discouraged? For most major
 projects it remains the most-supported installation path, as it has long
 been.

Please feel free to file bugs that you believe I have not addressed in
this email.  We will continue to improve the building from source
experience.

Thanks,

Bernard

--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Overlay deploy timeline on Ganglia graphs

2010-06-28 Thread Vladimir Vuksan
Thought some people may be interested :-)

http://vuksan.com/blog/2010/06/28/overlay-deploy-timeline-on-your-ganglia-graphs/

You should be able to overlay any type of a change event.

Vladimir
--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general