[Ganglia-general] Running Ganglia, rrdcached on Alma 9 or Rocky 9?
Hi All, Have anyone succeed running Ganglia on a recent Alma or Rocky 9 with system and rrdcached working together? Trying to update our systems monitoring but it is somehow tricky. Any advice much appreciated! Thanks! -- Grigory Shamov Site Lead / HPC Specialist University of Manitoba and DRI Alliance Canada ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] How to check Ganglia Version
On our system: rpm -qa | grep ganglia libganglia-3.7.2-2.el7.grex.x86_64 ganglia-gmond-3.7.2-2.el7.grex.x86_64 ganglia-gmond-modules-python-3.7.2-2.el7.grex.x86_6 So, the current version is 3.7.2 ; which is fairly old, but we still use it and it does the job. -- Grigory Shamov From: Beyer, Gregory L Sent: Tuesday, September 1, 2020 10:52:01 AM To: Daniel Rich; ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] How to check Ganglia Version Caution: This message was sent from outside the University of Manitoba. Thank you, Rich, that command worked. As for whether Ganglia is developed any more, I found this on its githup page: Version 3.7.0 (Release date 2014-02-28) I’d conclude that something with no releases in over six years is pretty dead. Further, since 2016, activity on the mailing list has plunged to almost nothing, another sign of RIP, . __ Gregory Beyer Sr. Systems Support Engineer Partnership for Advanced Computing Environment (PACE) Georgia Institute of Technology gbey...@gatech.edu<mailto:gbey...@gatech.edu> From: Daniel Rich Date: Friday, August 28, 2020 at 2:36 PM To: "ganglia-general@lists.sourceforge.net" , "Beyer, Gregory L" Subject: Re: [Ganglia-general] How to check Ganglia Version Looks like the last release was in 2016, not 10 years ago. But there hasn’t been any update at all on their sourceforge page since 2018. http://ganglia.sourceforge.net To your original question, you should be able to query gmond for the version, for example I just ran this: > gmond -V gmond 3.6.0 Dan Rich <mailto:dr...@employees.org> http://www.employees.org/~drich/ "Step up to red alert!" "Are you sure, sir? It means changing the bulb in the sign..." - Red Dwarf (BBC) On Aug 28, 2020, 09:48 -0700, Beyer, Gregory L , wrote: Wow, so it’s not being developed or maintained any more. OK, thanks. __ Gregory Beyer Sr. Systems Support Engineer Partnership for Advanced Computing Environment (PACE) Georgia Institute of Technology gbey...@gatech.edu<mailto:gbey...@gatech.edu> From: Grigory Shamov Date: Thursday, August 27, 2020 at 3:58 PM To: "Beyer, Gregory L" , "ganglia-general@lists.sourceforge.net" Subject: Re: How to check Ganglia Version Ganglia did not update in 10 years or so, so likely you are on the latest version. Grigory Shamov From: Beyer, Gregory L Sent: Thursday, August 27, 2020 2:08:21 PM To: ganglia-general@lists.sourceforge.net Subject: [Ganglia-general] How to check Ganglia Version Caution: This message was sent from outside the University of Manitoba. I’ve inherited a Ganglia installation. How do I check what version I have installed? Thanks, __ Gregory Beyer Sr. Systems Support Engineer Partnership for Advanced Computing Environment (PACE) Georgia Institute of Technology gbey...@gatech.edu<mailto:gbey...@gatech.edu> ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] How to check Ganglia Version
Thanks, that makes a huge difference. Grigory Shamov From: Daniel Rich Sent: Friday, August 28, 2020 1:36:18 PM To: ganglia-general@lists.sourceforge.net; Beyer, Gregory L Subject: Re: [Ganglia-general] How to check Ganglia Version Caution: This message was sent from outside the University of Manitoba. Looks like the last release was in 2016, not 10 years ago. But there hasn’t been any update at all on their sourceforge page since 2018. http://ganglia.sourceforge.net To your original question, you should be able to query gmond for the version, for example I just ran this: > gmond -V gmond 3.6.0 Dan Rich <mailto:dr...@employees.org> http://www.employees.org/~drich/ "Step up to red alert!" "Are you sure, sir? It means changing the bulb in the sign..." - Red Dwarf (BBC) On Aug 28, 2020, 09:48 -0700, Beyer, Gregory L , wrote: Wow, so it’s not being developed or maintained any more. OK, thanks. __ Gregory Beyer Sr. Systems Support Engineer Partnership for Advanced Computing Environment (PACE) Georgia Institute of Technology gbey...@gatech.edu<mailto:gbey...@gatech.edu> From: Grigory Shamov Date: Thursday, August 27, 2020 at 3:58 PM To: "Beyer, Gregory L" , "ganglia-general@lists.sourceforge.net" Subject: Re: How to check Ganglia Version Ganglia did not update in 10 years or so, so likely you are on the latest version. Grigory Shamov From: Beyer, Gregory L Sent: Thursday, August 27, 2020 2:08:21 PM To: ganglia-general@lists.sourceforge.net Subject: [Ganglia-general] How to check Ganglia Version Caution: This message was sent from outside the University of Manitoba. I’ve inherited a Ganglia installation. How do I check what version I have installed? Thanks, __ Gregory Beyer Sr. Systems Support Engineer Partnership for Advanced Computing Environment (PACE) Georgia Institute of Technology gbey...@gatech.edu<mailto:gbey...@gatech.edu> ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] How to check Ganglia Version
Ganglia did not update in 10 years or so, so likely you are on the latest version. Grigory Shamov From: Beyer, Gregory L Sent: Thursday, August 27, 2020 2:08:21 PM To: ganglia-general@lists.sourceforge.net Subject: [Ganglia-general] How to check Ganglia Version Caution: This message was sent from outside the University of Manitoba. I’ve inherited a Ganglia installation. How do I check what version I have installed? Thanks, __ Gregory Beyer Sr. Systems Support Engineer Partnership for Advanced Computing Environment (PACE) Georgia Institute of Technology gbey...@gatech.edu<mailto:gbey...@gatech.edu> ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] gmond forwarding from one internal network to another?
Hi All, I think this topic was discussed here; but somehow I am stuck with it again. I have a unicast gmond setup on our cluster. There are compute nodes on 192.168, login nodes on 10.x and 192.168 and the Ganglia server on 10.x only. I want to forward metrics from 192.168 to the server, preferably without creating custom routing schemes as it might affect other stuff on the cluster. So I have following setup: 1) Compute node's gmond are deaf and send metrics over 192.168 network to three of login nodes gmond's. So I have three send channels like this: udp_send_channel { bind_hostname = yes host = 192.168.x1.y1 port = 8649 ttl = 3 } 2) Login nodes receive and send metrics. I want them to receive computes' metrics over their 192.168 interface and pass metrics to 10.0. network, where my gmond/gmetad/gweb resides. So their send channel is: udp_send_channel { bind_hostname = yes host = 10.my-gmetad-ip port = 8649 ttl = 2 } and they supposed to listen to anything? udp_recv_channel { port = 8649 # Size of the UDP buffer. If you are handling lots of metrics you really # should bump it up to e.g. 10MB or even higher. buffer = 20971520 } 3) and the gmond of 10.my-gmetad-ip has similar udp_recv_channel config. Somehow it doesn't work though; telnet to the TCP port on the server, 10.my-gmetad-ip shows that it gives only the metrics from the login nodes coming from their 10.x interfaces. The login nodes seem to have the metrics from compute nodes' 192.168.x interface, as telnet shows, but do not pass these further, somehow. I check the metrics as follows: telnet 10.0.0.24 8649 | grep HOST Could you please suggest, what am I doing wrong in this setup? Is gmond supposed to forward like this at all? Thank you very much in advance! -- Grigory Shamov Westgrid/ComputeCanada Site Lead University of Manitoba E2-588 EITC Building, (204) 474-9625 -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151=/4140 ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] rrdcached /gmetad permission problems after update.
Hi Vladimir, Thanks a lot! The TCP method seems to be working for me. -- Grigory Shamov Westgrid/ComputeCanada Site Lead University of Manitoba E2-588 EITC Building, (204) 474-9625 From: Vladimir Vuksan <vli...@veus.hr<mailto:vli...@veus.hr>> Date: Tuesday, 23 February, 2016 10:27 AM To: Grigory Shamov <grigory.sha...@umanitoba.ca<mailto:grigory.sha...@umanitoba.ca>>, Adrian Sevcenco <adrian.sevce...@cern.ch<mailto:adrian.sevce...@cern.ch>>, "ganglia-general@lists.sourceforge.net<mailto:ganglia-general@lists.sourceforge.net>" <ganglia-general@lists.sourceforge.net<mailto:ganglia-general@lists.sourceforge.net>> Subject: Re: [Ganglia-general] rrdcached /gmetad permission problems after update. I don't think it depends on a version. I assume anything 1.4.x will work. Vladimir 02/23/2016 u 10:50 AM, Grigory Shamov je napisao/la: Dear Adrian, Vladimir, Thank you very much for the answers! Does it depend on rrdtools/rrdcached version? Which one are you using ? -- Grigory Shamov Westgrid/ComputeCanada Site Lead University of Manitoba E2-588 EITC Building, (204) 474-9625 From: Vladimir Vuksan <vli...@veus.hr<mailto:vli...@veus.hr>> Date: Tuesday, 23 February, 2016 8:57 AM To: Adrian Sevcenco <adrian.sevce...@cern.ch<mailto:adrian.sevce...@cern.ch>>, "ganglia-general@lists.sourceforge.net<mailto:ganglia-general@lists.sourceforge.net>" <ganglia-general@lists.sourceforge.net<mailto:ganglia-general@lists.sourceforge.net>>, Grigory Shamov <grigory.sha...@umanitoba.ca<mailto:grigory.sha...@umanitoba.ca>> Subject: Re: [Ganglia-general] rrdcached /gmetad permission problems after update. Another thing to try is to switch over to using TCP for rrdcached connections since that avoids contention on the rrdcached socket and should avoid some of the permissions issues. For example I am using following options OPTS=" -t 60 -w 180 -z 180 -F -s ganglia -m 664 -l 127.0.0.1:9998 -s ganglia -m 777 -P FLUSH,STATS,HELP -l unix:/tmp/rrdcached.limited.sock -b /var/lib/ganglia/rrds -B -p /var/lib/ganglia/rrdcached.pid In the gmetad.conf you can then add rrdcached_address 127.0.0.1:9998 Vladimir 02/23/2016 u 09:34 AM, Adrian Sevcenco je napisao/la: On 02/22/2016 10:32 PM, Grigory Shamov wrote: Hi All, I have updated Gmond/Gmetad to 3.7.2 on our Ganglia server that uses also RRDCached. It used to work, and configuration didn't change, but bow metrics do not get into the graphs anymore. In the logs there is a lot of messages about permissions, and a new kind of message about imuxsock thing: Feb 22 13:18:40 host /usr/sbin/gmetad[3554]: RRD_update (/var/lib/ganglia/rrds/Grex/__SummaryInfo__/rx_bytes_ib0.rrd): rrdcached: Permission denied. Feb 22 13:18:40 host rsyslogd-2177: imuxsock begins to drop messages from pid 3554 due to rate-limiting Does anyone know how to fix it? Thank you very much in advance! I had the same problem (and partially i still have one) .. So : 1. that "drop messages" message is because of the write errors .. you can ignore that 2. the problem is with rrdcached .. i tried with making part of group ganglia, change ownership of rrds to ganglia:rrdcached but had the same errors ... in the end i had to make the rrds dir 777 and i get rid of errors 3. ganglia web does not function with the rrdcached limited socket so i had to use the same full socket that gmetad use ... i hope that nothing bad will happen but i have no other choice... HTH, Adrian -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151=/4140 ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net<mailto:Ganglia-general@lists.sourceforge.net>https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151=/4140___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] rrdcached /gmetad permission problems after update.
Dear Adrian, Vladimir, Thank you very much for the answers! Does it depend on rrdtools/rrdcached version? Which one are you using ? -- Grigory Shamov Westgrid/ComputeCanada Site Lead University of Manitoba E2-588 EITC Building, (204) 474-9625 From: Vladimir Vuksan <vli...@veus.hr<mailto:vli...@veus.hr>> Date: Tuesday, 23 February, 2016 8:57 AM To: Adrian Sevcenco <adrian.sevce...@cern.ch<mailto:adrian.sevce...@cern.ch>>, "ganglia-general@lists.sourceforge.net<mailto:ganglia-general@lists.sourceforge.net>" <ganglia-general@lists.sourceforge.net<mailto:ganglia-general@lists.sourceforge.net>>, Grigory Shamov <grigory.sha...@umanitoba.ca<mailto:grigory.sha...@umanitoba.ca>> Subject: Re: [Ganglia-general] rrdcached /gmetad permission problems after update. Another thing to try is to switch over to using TCP for rrdcached connections since that avoids contention on the rrdcached socket and should avoid some of the permissions issues. For example I am using following options OPTS=" -t 60 -w 180 -z 180 -F -s ganglia -m 664 -l 127.0.0.1:9998 -s ganglia -m 777 -P FLUSH,STATS,HELP -l unix:/tmp/rrdcached.limited.sock -b /var/lib/ganglia/rrds -B -p /var/lib/ganglia/rrdcached.pid In the gmetad.conf you can then add rrdcached_address 127.0.0.1:9998 Vladimir 02/23/2016 u 09:34 AM, Adrian Sevcenco je napisao/la: On 02/22/2016 10:32 PM, Grigory Shamov wrote: Hi All, I have updated Gmond/Gmetad to 3.7.2 on our Ganglia server that uses also RRDCached. It used to work, and configuration didn't change, but bow metrics do not get into the graphs anymore. In the logs there is a lot of messages about permissions, and a new kind of message about imuxsock thing: Feb 22 13:18:40 host /usr/sbin/gmetad[3554]: RRD_update (/var/lib/ganglia/rrds/Grex/__SummaryInfo__/rx_bytes_ib0.rrd): rrdcached: Permission denied. Feb 22 13:18:40 host rsyslogd-2177: imuxsock begins to drop messages from pid 3554 due to rate-limiting Does anyone know how to fix it? Thank you very much in advance! I had the same problem (and partially i still have one) .. So : 1. that "drop messages" message is because of the write errors .. you can ignore that 2. the problem is with rrdcached .. i tried with making part of group ganglia, change ownership of rrds to ganglia:rrdcached but had the same errors ... in the end i had to make the rrds dir 777 and i get rid of errors 3. ganglia web does not function with the rrdcached limited socket so i had to use the same full socket that gmetad use ... i hope that nothing bad will happen but i have no other choice... HTH, Adrian -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151=/4140 ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net<mailto:Ganglia-general@lists.sourceforge.net>https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151=/4140___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] rrdcached /gmetad permission problems after update.
Hi All, I have updated Gmond/Gmetad to 3.7.2 on our Ganglia server that uses also RRDCached. It used to work, and configuration didn't change, but bow metrics do not get into the graphs anymore. In the logs there is a lot of messages about permissions, and a new kind of message about imuxsock thing: Feb 22 13:18:40 host /usr/sbin/gmetad[3554]: RRD_update (/var/lib/ganglia/rrds/Grex/__SummaryInfo__/rx_bytes_ib0.rrd): rrdcached: Permission denied. Feb 22 13:18:40 host rsyslogd-2177: imuxsock begins to drop messages from pid 3554 due to rate-limiting Does anyone know how to fix it? Thank you very much in advance! -- Grigory Shamov Westgrid/ComputeCanada Site Lead University of Manitoba E2-588 EITC Building, (204) 474-9625 -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151=/4140 ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] binding gmond to an interface?
Hi All, Is there a way to bind gmond not to hostname, but to another interface? For example, if you have hostname on eth0, but want to send your unicast metrics to eth1. -- Grigory Shamov Westgrid/ComputeCanada Site Lead University of Manitoba E2-588 EITC Building, (204) 474-9625 -- Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991=/4140 ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] How to use procstat metrics in gmond 3.7.0 ?
Hi All, I am trying to use process statistics metrics defining them in procstat.pyconf . In some cases this doesn't work. There is nothing reported; the metric appears in the Ganglia Web interface but is empty. What I am trying to do is to monitor my Torque server. I have in the procstat.pyconf now: param pbs_server { # pbs_server seem to have PID there # value = '/var/spool/torque/server_priv/server.lock' value='/pbs_server/' } Could you please suggest how to specify the daemon to monitor in procstat.pyconf? It seems from the examples, that both PID or process name cane be specified? It did not work for me, either with the PID value from server.lock file (commented out now), or with the string value. Thank you very much! -- Grigory Shamov Westgrid/ComputeCanada Site Lead University of Manitoba E2-588 EITC Building, (204) 474-9625 -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/ ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] splitting rrds on different hosts
Dear Jesse, How does one split a cluster to separate ones, to display in a single Gangnlia-Web instance? Also, is there a way to make some of the clusters private, like they do on SurfSara? https://ganglia.surfsara.nl/ I mean there is a way since its been done, but could anyone suggest how exactly? Thanks! -- Grigory Shamov Westgrid/ComputeCanada Site Lead University of Manitoba E2-588 EITC Building, (204) 474-9625 On 14-10-11 1:44 PM, Jesse Becker haw...@gmail.com wrote: There isn't a way to do it natively, but there are a few ways to work around it. One is to split your hosts into separate clusters, and multiple gmetad instances. These could easily be on the same host, but use different disk partitions so there's less IO contention. Dump all of the files into a single location, but use symlinks to distribute them. Get an SSD drive, and use that; it should help a fair bit since the IO is largely small/random. Disable readahead on the device in question. Use tmpfs to store the RRD files, but remember to sync them back to persistent storage periodically (and restore them again at boot-time). On Sat, Oct 11, 2014 at 2:18 PM, Rita rmorgan...@gmail.com wrote: At the moment all of my rrds are going to the host which hosts the gmetad. Is it possible to split the gmetads to different hosts so rrds will be distributed? I am asking this because I am monitoring 400 hosts and having I/O disk wait problems. I would like to split the load. Any thoughts? -- --- Get your facts first, then you can distort them as you please.-- - - Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer http://p.sf.net/sfu/Zoho ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Jesse Becker -- Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer http://p.sf.net/sfu/Zoho ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer http://p.sf.net/sfu/Zoho ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Huge metrics' size being reported to gmetad
Dear Sergio, Somehow on my 300-node cluster Ganglia, with more or less default metrics and RRD configs, collects over 40GB! If there are smarter RRD settings to reduce the size, it would be very interesting to learn them. -- Grigory Shamov HPC Analyst, Tech. Site Lead, Westgrid/Compute Canada E2-588 EITC Building, University of Manitoba (204) 474-9625 From: Sergio Ballestrero sergio.ballestr...@gmail.commailto:sergio.ballestr...@gmail.com Date: Monday, 30 June, 2014 10:21 AM To: Cristovao Jose Domingues Cordeiro cristovao.corde...@cern.chmailto:cristovao.corde...@cern.ch Cc: Ganglia ganglia-general@lists.sourceforge.netmailto:ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Huge metrics' size being reported to gmetad Hi Cristovao, that depends on how many metrics and on the rrd creation settings. Sure 150MB looks like a lot. An ls -la may give more hints... Ciao, Sergio On 30 Jun 2014 16:35, Cristovao Jose Domingues Cordeiro cristovao.corde...@cern.chmailto:cristovao.corde...@cern.ch wrote: Someone? Cumprimentos / Best regards, Cristóvão José Domingues Cordeiro IT Department - 28/1-010 CERN From: Cristovao Jose Domingues Cordeiro [cristovao.corde...@cern.chmailto:cristovao.corde...@cern.ch] Sent: 24 June 2014 13:58 To: ganglia-general@lists.sourceforge.netmailto:ganglia-general@lists.sourceforge.net Subject: [Ganglia-general] Huge metrics' size being reported to gmetad Hi, I have a grid configuration, with several clusters. I am also using RAM disk for I/O optimization (4GB). I've been noticing that sometimes, gmetad breaks, complaining about lack of space in this tmpfs partition. I checked and I saw that for some reason, some clusters, have hosts which occupy 3MB, 6MB and even sometimes 150MB All together makes the cluster occupy 2GB and consequently occupy half of the ramdisk space. We would normally expect these host metrics to have +/- 336k right? Has anyone experienced this? Cumprimentos / Best regards, Cristóvão José Domingues Cordeiro IT Department - 28/1-010 CERN -- Open source business process management suite built on Java and Eclipse Turn processes into business applications with Bonita BPM Community Edition Quickly connect people, data, and systems into organized workflows Winner of BOSSIE, CODIE, OW2 and Gartner awards http://p.sf.net/sfu/Bonitasoft ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.netmailto:Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Open source business process management suite built on Java and Eclipse Turn processes into business applications with Bonita BPM Community Edition Quickly connect people, data, and systems into organized workflows Winner of BOSSIE, CODIE, OW2 and Gartner awards http://p.sf.net/sfu/Bonitasoft___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Number of HyperThreaded CPUs
Hi, More specifically, somehow in Ganglia 3.6 a solution posted here previously, gmetric -t uint32 -s zero -n 'cpu_num' -v $NCPUS -u 'CPUs' it doesn't seem to change total number of CPUs or cpu_num of individual nodes, as shown by the Web frontend. Could someone advise please, how to count real cores only? -- Grigory Shamov HPC Analyst, Westgrid/Compute Canada E2-588 EITC Building, University of Manitoba (204) 474-9625 On 14-04-14 9:14 AM, Grigory Shamov grigory.sha...@umanitoba.ca wrote: Hi All, I am sure this has been resolved already, but googling did not bring me an immediate answer for the new Ganglia. I've just installed the recent Ganglia 3.6 with Ganglia-Web 3.5.2 on our system. The system is Intel Xeons with HT enabled, and by default Ganglia seem to count every core. This gives utilization of 50% max which is not correct (as we schedule by real cores not HT ones). So, what would be a best way to report only real cores in the new Ganglia? -- Grigory Shamov HPC Analyst, Westgrid/Compute Canada E2-588 EITC Building, University of Manitoba (204) 474-9625 -- Learn Graph Databases - Download FREE O'Reilly Book Graph Databases is the definitive new guide to graph databases and their applications. Written by three acclaimed leaders in the field, this first edition is now available. Download your free book today! http://p.sf.net/sfu/NeoTech ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Learn Graph Databases - Download FREE O'Reilly Book Graph Databases is the definitive new guide to graph databases and their applications. Written by three acclaimed leaders in the field, this first edition is now available. Download your free book today! http://p.sf.net/sfu/NeoTech ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] Number of HyperThreaded CPUs
Hi All, I am sure this has been resolved already, but googling did not bring me an immediate answer for the new Ganglia. I've just installed the recent Ganglia 3.6 with Ganglia-Web 3.5.2 on our system. The system is Intel Xeons with HT enabled, and by default Ganglia seem to count every core. This gives utilization of 50% max which is not correct (as we schedule by real cores not HT ones). So, what would be a best way to report only real cores in the new Ganglia? -- Grigory Shamov HPC Analyst, Westgrid/Compute Canada E2-588 EITC Building, University of Manitoba (204) 474-9625 -- Learn Graph Databases - Download FREE O'Reilly Book Graph Databases is the definitive new guide to graph databases and their applications. Written by three acclaimed leaders in the field, this first edition is now available. Download your free book today! http://p.sf.net/sfu/NeoTech ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general