Re: [Ganglia-general] gmetad xml output is incomplete sometimes
Hi Bernard. Now, I'm only monitoring 5 host. -2/5 are switches and only have 3 metrics. To do that I'm using 3 gmetric call every minute. -3/5 are hosts with the default metrics and default time values. The problem appears in both cases: switches and hosts. Seeing debug mode of gmetad, I noticed 3 events (updating, writing, clearing). Maybe those events are relationed with my problem (perhaps clearing event). Thanks, Miguel. El vie, 25-06-2010 a las 10:57 -0700, Bernard Li escribió: Hi Miguel: How many hosts and metrics are you monitoring with your gmetad? Cheers, Bernard 2010/6/25 Miguel A. miguelangel.d...@ciemat.es: Hi. I'm getting the XML output from gmetad and saving it in a file. Sometimes, the output XML has more machine than others. For example, At 2 p.m the xml output is grid cluster1 host 1 host 2 host 3 /cluster1 /grid And one minute later, the xml output is (for example) grid cluster1 host 1 /cluster1 /grid But other minute later, the xml output is (for example) grid cluster1 host 1 host 2 host 3 /cluster1 /grid I have revised that hosts were running and they were ok. I think gmetad only shows updated data, but I'm not sure. Do you know why gmetad occassionally shows some piece of data and not all of them? Regards Miguel. Confidencialidad: Este mensaje y sus ficheros adjuntos se dirige exclusivamente a su destinatario y puede contener información privilegiada o confidencial. Si no es vd. el destinatario indicado, queda notificado de que la utilización, divulgación y/o copia sin autorización está prohibida en virtud de la legislación vigente. Si ha recibido este mensaje por error, le rogamos que nos lo comunique inmediatamente respondiendo al mensaje y proceda a su destrucción. Disclaimer: This message and its attached files is intended exclusively for its recipients and may contain confidential information. If you received this e-mail in error you are hereby notified that any dissemination, copy or disclosure of this communication is strictly prohibited and may be unlawful. In this case, please notify us by a reply and delete this email and its contents immediately. -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] gmetad xml output is incomplete sometimes
Hi Miguel, just to rule that out: check the data_source lines in your gmetad.conf to make sure that gmetad is not querying its own XML port. That could result in incomplete/broken XML. And yes, we have seen it before :-) Cheers Martin -- Martin Knoblauch email: k n o b i AT knobisoft DOT de www: http://www.knobisoft.de - Original Message From: Miguel A. Díaz Corchero miguelangel.d...@ciemat.es To: Bernard Li bern...@vanhpc.org Cc: ganglia-general@lists.sourceforge.net ganglia-general@lists.sourceforge.net Sent: Mon, June 28, 2010 8:27:50 AM Subject: Re: [Ganglia-general] gmetad xml output is incomplete sometimes Hi Bernard. Now, I'm only monitoring 5 host. -2/5 are switches and only have 3 metrics. To do that I'm using 3 gmetric call every minute. -3/5 are hosts with the default metrics and default time values. The problem appears in both cases: switches and hosts. Seeing debug mode of gmetad, I noticed 3 events (updating, writing, clearing). Maybe those events are relationed with my problem (perhaps clearing event). Thanks, Miguel. El vie, 25-06-2010 a las 10:57 -0700, Bernard Li escribió: Hi Miguel: How many hosts and metrics are you monitoring with your gmetad? Cheers, Bernard 2010/6/25 Miguel A. ymailto=mailto:miguelangel.d...@ciemat.es; href=mailto:miguelangel.d...@ciemat.es;miguelangel.d...@ciemat.es: Hi. I'm getting the XML output from gmetad and saving it in a file. Sometimes, the output XML has more machine than others. For example, At 2 p.m the xml output is grid cluster1 host 1 host 2 host 3 /cluster1 /grid And one minute later, the xml output is (for example) grid cluster1 host 1 /cluster1 /grid But other minute later, the xml output is (for example) grid cluster1 host 1 host 2 host 3 /cluster1 /grid I have revised that hosts were running and they were ok. I think gmetad only shows updated data, but I'm not sure. Do you know why gmetad occassionally shows some piece of data and not all of them? Regards Miguel. Confidencialidad: Este mensaje y sus ficheros adjuntos se dirige exclusivamente a su destinatario y puede contener información privilegiada o confidencial. Si no es vd. el destinatario indicado, queda notificado de que la utilización, divulgación y/o copia sin autorización está prohibida en virtud de la legislación vigente. Si ha recibido este mensaje por error, le rogamos que nos lo comunique inmediatamente respondiendo al mensaje y proceda a su destrucción. Disclaimer: This message and its attached files is intended exclusively for its recipients and may contain confidential information. If you received this e-mail in error you are hereby notified that any dissemination, copy or disclosure of this communication is strictly prohibited and may be unlawful. In this case, please notify us by a reply and delete this email and its contents immediately. -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: href=http://p.sf.net/sfu/thinkgeek-promo; target=_blank http://p.sf.net/sfu/thinkgeek-promo ___ Ganglia-general mailing list ymailto=mailto:Ganglia-general@lists.sourceforge.net; href=mailto:Ganglia-general@lists.sourceforge.net;Ganglia-general@lists.sourceforge.net target=_blank https://lists.sourceforge.net/lists/listinfo/ganglia-general -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- href=http://p.sf.net/sfu/sprint-com-first; target=_blank http://p.sf.net/sfu/sprint-com-first ___ Ganglia-general mailing list href=mailto:Ganglia-general@lists.sourceforge.net;Ganglia-general@lists.sourceforge.net href=https://lists.sourceforge.net/lists/listinfo/ganglia-general; target=_blank https://lists.sourceforge.net/lists/listinfo/ganglia-general -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net
[Ganglia-general] Regarding change in Default Polling time
Hello Everyone , I have installed on a 4 node test cluster. I wish to change (from 15 to 300 second) the regular polling time. Once I make that change in gmetad.conf data_source my cluster 300 localhost (This is the only change I made in configuration ) After doing that my graphs are kind of broken. Please can anyone help me if I need to make any additional configuration change. I will be implementing this on 400 node performance cluster and what changes I can make to minimize ganglia impact on Cluster. Kind Regards, Abhishek Anand Software Services Group/DRD I N T E L Corp DP3-307-H7 2800 Center Dr DuPont WA 98327 Email: abhishek.a.an...@intel.com -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] 3.1.7 installation from tar notes - Ubuntu - and a repeatable Segfault
Hi Whit: On Sat, Jun 19, 2010 at 10:38 AM, Whit Blauvelt w...@transpect.com wrote: Let me first off say I hugely appreciate Ganglia. In the old spirit of reporting glitches so others may benefit, I'd like to add this brief update to my earlier notes on installing on CentOS. This time it's a build-from-source install on Ubuntu 10.04 LTS server. Thanks, and we appreciate your comments. cp mans/* /usr/share/man/man1 cp gmond/gmond.conf.5 /usr/share/man/man5 This is being looked at -- by default manpages should be installed, see this bug: http://bugzilla.ganglia.info/cgi-bin/bugzilla/show_bug.cgi?id=24 cp gmond/gmond.init /etc/init.d/gmond This is a bit tougher, since we'll need to write detection code to figure out what OS you're running. Remember Ganglia is supported on multiple platforms. mkdir /etc/ganglia cd gmond/modules cp -a conf.d /etc/ganglia rm /etc/ganglia/conf.d/example.conf Again, you may not always want to copy all the configuration files as not all modules are supported on all platforms. Perhaps a separate `make install_gmond_modules` can be created for users who would want to install them. cd /etc/ganglia gmond -t gmond.conf This is also being looked at. 3. If you'd like to see gmond segfault, after the cp -a conf.d etc/ganglia /step, but before the gmond -t gmond.conf step, issue a gmond -m. Now, since gmond -t merely makes explicit the implicit gmond configuration settings, why should it segfault just if the module conf.d files are there when the gmond.conf isn't yet?? Please see this bug report: http://bugzilla.ganglia.info/cgi-bin/bugzilla/show_bug.cgi?id=259 4. Create an Ubuntu-style /etc/init.d/gmond file. This is left as something of an exercise for the reader. However, at then end here you'll find a stripped-down version that works for me - but beware it's inelegant and not rigorously tested. You can probably just get the official one from Ubuntu/Debian. When I posted my prior notes to the CentOS list I got seriously flamed for building from tar rather than running through rpmbuild. I hope building from tar isn't effectively depricated, since in my experience it's always been the best way to be sure of getting the most recent version of a critical daemon right. My rule of thumb is to stay with distros for anything they've done well and are current enough with, and build-by-hand anything where their compile options or less-than-current version doesn't fit local needs. That's generally at most a few daemons on any particular system. I always leave the libraries and so on stock distro. The storm from some of my peers, who now regard this as an inproper method, surprised me. Building from source is definitely not deprecated, since all packages are based on it. However, the manual operations that you have outlined above are usually taken care of by the packager, and thus are hidden from end users on platforms where a binary package is available. I understand that you're trying to install Ganglia on CentOS and Ubuntu systems. For CentOS, I would suggest that you build the RPMs from the tarball. The spec file is kept fairly up to date and you don't need to go through any of the manual processes you have mentioned. Additionally the package can then be tracked via RPM which is probably what you want to do on a RPM-based system anyway. For Ubuntu -- I don't see official packages for 3.1.7 but they are available from Debian sid, so perhaps you can use those: http://packages.debian.org/sid/ganglia-monitor So does my experience count as finding a bug which should be reported in the 3.1.7 makefile (not to mention that segfault), or is building from tar something the Ganglia maintainers feel should be discouraged? For most major projects it remains the most-supported installation path, as it has long been. Please feel free to file bugs that you believe I have not addressed in this email. We will continue to improve the building from source experience. Thanks, Bernard -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] Overlay deploy timeline on Ganglia graphs
Thought some people may be interested :-) http://vuksan.com/blog/2010/06/28/overlay-deploy-timeline-on-your-ganglia-graphs/ You should be able to overlay any type of a change event. Vladimir -- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general