[Ganglia-general] Plan item
snarayanaswamy: Can you please work on the documentation for replicating ganglia/riemann setup in another data center? Sergey: yes, that’s in my list as discussed snarayanaswamy: thanks -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Plan item
Very sorry! Please remove this from the discussion list! Thanks! Sergey -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Gweb cluster page stopped working after adding 500 custom metrics per server
gmetad 3.7.0 I don't use rrdcached yet. Sergey > On Dec 19, 2015, at 7:41 PM, Vladimir Vuksan <vli...@veus.hr> wrote: > > What Ganglia gmetad version are you running? Are you using rrdcached? > > On December 18, 2015 8:09:09 PM EST, Sergey <svin...@apple.com> wrote: > Addition: Ganglia Web log shows 500 error. > > Sergey > > > > On Dec 11, 2015, at 11:52 AM, Sergey <svin...@apple.com> wrote: > > Hi All! > > We added ~500 custom metrics/server in one cluster and now this cluster page > stopped working. > All other clusters are working properly. > It looks like some timeout value should be updated in Gweb, because the data > retrieving time was increased. > Do you know how to fix this? > > The mobile page is still showing all data from this cluster. > > Thanks! > Sergey > > > > > Ganglia-general mailing list > Ganglia-general@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/ganglia-general > <https://lists.sourceforge.net/lists/listinfo/ganglia-general> > > Vladimir -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Gweb cluster page stopped working after adding 500 custom metrics per server
I have 30 servers in this cluster. I'll check PHP configuration. Thanks! Sergey > On Dec 19, 2015, at 7:56 PM, Jesse Becker <haw...@gmail.com> wrote: > > How many servers? > > There aren't too many timeouts in the gweb code. It could be something > related to PHP configuration. If, for example, the PHP script is waiting for > gmetad to finish sending data, and the webserver kills it because it took > "too long." > > On Fri, Dec 11, 2015 at 2:52 PM, Sergey <svin...@apple.com > <mailto:svin...@apple.com>> wrote: > Hi All! > > We added ~500 custom metrics/server in one cluster and now this cluster page > stopped working. > All other clusters are working properly. > It looks like some timeout value should be updated in Gweb, because the data > retrieving time was increased. > Do you know how to fix this? > > The mobile page is still showing all data from this cluster. > > Thanks! > Sergey > -- > ___ > Ganglia-general mailing list > Ganglia-general@lists.sourceforge.net > <mailto:Ganglia-general@lists.sourceforge.net> > https://lists.sourceforge.net/lists/listinfo/ganglia-general > <https://lists.sourceforge.net/lists/listinfo/ganglia-general> > > > > -- > Jesse Becker -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Gweb cluster page stopped working after adding 500 custom metrics per server
Addition: Ganglia Web log shows 500 error. Sergey > On Dec 11, 2015, at 11:52 AM, Sergey <svin...@apple.com> wrote: > > Hi All! > > We added ~500 custom metrics/server in one cluster and now this cluster page > stopped working. > All other clusters are working properly. > It looks like some timeout value should be updated in Gweb, because the data > retrieving time was increased. > Do you know how to fix this? > > The mobile page is still showing all data from this cluster. > > Thanks! > Sergey -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] Gweb cluster page stopped working after adding 500 custom metrics per server
Hi All! We added ~500 custom metrics/server in one cluster and now this cluster page stopped working. All other clusters are working properly. It looks like some timeout value should be updated in Gweb, because the data retrieving time was increased. Do you know how to fix this? The mobile page is still showing all data from this cluster. Thanks! Sergey -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] Cluster View - nodes up and down
Hi All! Is it possible to generate aggregate view for a grid which will show - how many nodes in every cluster are presented and how many of them are down? It will be useful if an aggregate view is used like a custom dashboard. Thanks! Sergey -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Module Collection intreval
I found that parameter “collect_every” allows to setup frequency of the script runs. I added logging to the python module and it definitely shows that this module runs every 5 minutes if collect_every = 300”. Sergey On Aug 10, 2015, at 12:42 PM, Sergey svin...@apple.com wrote: Hi! I use GMOND python module to collect Kafka Lag values. It periodically runs some utility and parses the output. I don’t want to run this utility too often. How can I setup the frequency of my checks? Does mymod.pyconf collection group parameter (collect_every = 300) limit the frequency of checks? Or do I need to setup some general GMOND parameter? Thanks! -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] Module Collection intreval
Hi! I use GMOND python module to collect Kafka Lag values. It periodically runs some utility and parses the output. I don’t want to run this utility too often. How can I setup the frequency of my checks? Does mymod.pyconf collection group parameter (collect_every = 300) limit the frequency of checks? Or do I need to setup some general GMOND parameter? Thanks! -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] Process state monitoring
Hi Everybody! I see that we can monitor any process and collect CPU and memory metrics for this process via Python module. Is it possible to monitor the state of the process (if running - state=Up, if stopped - state=Down)? I see that CPU and memory metrics are coming independently of the process state, so I can’t use them to calculate the process state. I think some “process heartbeat” monitor required. Any ideas? Thanks! Sergey -- Don't Limit Your Business. Reach for the Cloud. GigeNET's Cloud Solutions provide you with the tools and support that you need to offload your IT needs and focus on growing your business. Configured For All Businesses. Start Your Cloud Today. https://www.gigenetcloud.com/ ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Process state monitoring
Hi Vladimir, This is indirect solution. I’d prefer some python module which will return metric “100” in case of running process and “0” - if it doesn’t exist. Or, for example - to return number of the existing processes with the same name. Thank you! Sergey On Jun 30, 2015, at 11:57 AM, Vladimir Vuksan vli...@veus.hr wrote: You can alert on process memory size e.g. we have alerts that say if process memory is 100 bytes it's down. Also if process memory is X bytes it's leaking memory. Vladimir 06/30/2015 u 01:19 PM, Sergey je napisao/la: Hi Everybody! I see that we can monitor any process and collect CPU and memory metrics for this process via Python module. Is it possible to monitor the state of the process (if running - state=Up, if stopped - state=Down)? I see that CPU and memory metrics are coming independently of the process state, so I can’t use them to calculate the process state. I think some “process heartbeat” monitor required. Any ideas? -- Don't Limit Your Business. Reach for the Cloud. GigeNET's Cloud Solutions provide you with the tools and support that you need to offload your IT needs and focus on growing your business. Configured For All Businesses. Start Your Cloud Today. https://www.gigenetcloud.com/ ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] GMOND + SFLOWD functionality
Hi Peter, It’s very sad. It also contradicts the Gmond topology described in the O’Reily book “Monitoring with Ganglia” (p.22 Fig. 2-3). The main disadvantage of this is the fact that we have to build 2 parallel monitoring structures (gnome and show) with separate ports and flows, which are joined only in the central collection point. Is it possible to modify Gmond agent to join Gmond and Sfow data locally on every monitored computer? Thanks! Sergey On May 30, 2015, at 10:07 PM, Peter Phaal peter.ph...@gmail.com wrote: Sergey, gmond does not retransmit the sFlow metrics it receives. A single gmond instance is used a central collector for a cluster of machines running Host sFlow agents. gmetad uses a TCP connection to retrieve the cluster stats from the single gmond instance and update the RRDs. Peter On Fri, May 29, 2015 at 10:02 AM, Sergey svin...@apple.com wrote: Hi Vladimir, This is very serious question - is GMOND supposed to retransmit metrics received from the local HSFLOWD agent or it just saves them locally for further retrieving via TCP connection? What is the initial project for this? Thanks! Serfey Vinnik -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] GMOND + SFLOWD functionality
OK. Then what does the “deaf” Gmond make with metrics to provide them to “mute” Gmond? I guess - it sends them. I our case “deaf” Gmond gets metrics from local HSFLOWD and doesn’t send them. Why it doesn’t send them to “mute” Gmond? What’s the difference between it’s own metrics and HSFLOWD metrics? Did you see the content of HSWLOWD.auto ? It says that Collector IP should be “localhost”. But it can’t be localhost because collected metrics will be saved but not transmitted. Thanks! Sergey On Jun 1, 2015, at 12:17 PM, Jesse Becker haw...@gmail.com wrote: On Mon, Jun 1, 2015 at 1:00 PM, Sergey svin...@apple.com wrote: It also contradicts the Gmond topology described in the O’Reily book “Monitoring with Ganglia” (p.22 Fig. 2-3). I don't see how. I'm looking at a copy of the book right now, and Figure 2-3 has three gmonds: two (deaf) gmonds that send to a third gmond (mute) that aggregates them. There's nothing about retransmitting or relaying metrics at all. Gmond doesn't retransmit metrics, except when polled via TCP (which is usually from gmetad). Hi Peter, It’s very sad. It also contradicts the Gmond topology described in the O’Reily book “Monitoring with Ganglia” (p.22 Fig. 2-3). The main disadvantage of this is the fact that we have to build 2 parallel monitoring structures (gnome and show) with separate ports and flows, which are joined only in the central collection point. Is it possible to modify Gmond agent to join Gmond and Sfow data locally on every monitored computer? Thanks! Sergey On May 30, 2015, at 10:07 PM, Peter Phaal peter.ph...@gmail.com wrote: Sergey, gmond does not retransmit the sFlow metrics it receives. A single gmond instance is used a central collector for a cluster of machines running Host sFlow agents. gmetad uses a TCP connection to retrieve the cluster stats from the single gmond instance and update the RRDs. Peter On Fri, May 29, 2015 at 10:02 AM, Sergey svin...@apple.com wrote: Hi Vladimir, This is very serious question - is GMOND supposed to retransmit metrics received from the local HSFLOWD agent or it just saves them locally for further retrieving via TCP connection? What is the initial project for this? Thanks! Serfey Vinnik -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Jesse Becker -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] GMOND + SFLOWD functionality
Hi Vladimir, This is very serious question - is GMOND supposed to retransmit metrics received from the local HSFLOWD agent or it just saves them locally for further retrieving via TCP connection? What is the initial project for this? Thanks! Serfey Vinnik -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] HTTPD metrics not sent
Yes, and I see in debug mode that GMOND accepts and saves HTTP metrics. These metrics can be retrieved over direct TCP connection from Collector, but they are not sent to another GMOND agent via UDP. We use one common GMOND per cluster which is running at Collector server. How can it be fixed? Thanks! Sergey On May 28, 2015, at 1:40 PM, Peter Phaal peter.ph...@gmail.com wrote: Have you enabled http in the sFlow section in the gmond config? http://blog.sflow.com/2011/12/using-ganglia-to-monitor-web-farms.html You should try running sflowtool on the head end gmond system to verify that the data is arriving: http://blog.sflow.com/2011/12/sflowtool.html On Thu, May 28, 2015 at 10:06 AM, Sergey svin...@apple.com wrote: Hi Everybody! I use HSFLOWD agent to collect HTTPD metrics from Apache server vis mod_sflow.so module. I see that GMOND gets HTTPD metrics from HSFLOWD and save them in metadata, but for some reason it doesn’t forward HTTPD metrics by UDP to another GMOND agent. All other metrics are successful transfered. Do you know how to fix it? Thanks! Sergey -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net sflowvalve.jarhttps://lists.sourceforge.net/lists/listinfo/ganglia-general -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] HTTPD metrics not sent
Hi Everybody! I use HSFLOWD agent to collect HTTPD metrics from Apache server vis mod_sflow.so module. I see that GMOND gets HTTPD metrics from HSFLOWD and save them in metadata, but for some reason it doesn’t forward HTTPD metrics by UDP to another GMOND agent. All other metrics are successful transfered. Do you know how to fix it? Thanks! Sergey -- ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Sflow Apache metrics
Great, that works for me! Thanks! Sergey On Apr 13, 2015, at 7:21 PM, Neil Mckee neil.mckee...@gmail.com wrote: Sergey, It's usually best to compile mod-sflow from sources so that it matches the particular version of apache you are running. So before you do that you have the option of editing mod-sflow.c and changing the setting of SFWB_DEFAULT_CONFIGFILE (on line 211). https://code.google.com/p/mod-sflow/source/browse/trunk/mod_sflow.c#211 https://code.google.com/p/mod-sflow/source/browse/trunk/mod_sflow.c#211 Does that work for you? Separate question: I'm not sure how hsflowd works if it doesn't start as root? What OS are you on? Neil On Mon, Apr 13, 2015 at 5:55 PM, Sergey svin...@apple.com mailto:svin...@apple.com wrote: I found following error in Apache log: [Mon Apr 13 23:25:14 2015] [error] (2)No such file or directory: apr_stat(/etc/hsflowd.auto) failed The problem is that Hsflowd process is running in the user directory and keeps hsflowd.auto file in ./run directory. I can’t access /etc directory and put file there also, because I don’t have root access. Any ideas? Thanks! S. On Apr 13, 2015, at 9:36 AM, Sergey svin...@apple.com mailto:svin...@apple.com wrote: Yes, I installed sflowtool and it works! I get all counters except http* ones. That’s why I tested http://hostname/sflow http://hostname/sflow page, because it uses mod_sflow in Apache. It looks like some Apache+sflow issue, but I don’t know how to troubleshoot it. Thanks S. On Apr 10, 2015, at 6:28 PM, Leslie geekg...@gmail.com mailto:geekg...@gmail.com wrote: Have you installed sflowtool and seen if the sflow counters are even getting sent out by the machine ? My next step would be a tcpdump to make sure that the sflow counters are then getting sent to the collecting host. On Fri, Apr 10, 2015 at 4:55 PM, Sergey svin...@apple.com mailto:svin...@apple.com wrote: Hi All! I installed mod_sflow on Apache and try to collect HTTP metrics by Gmond. The problem is that I don’t see any HTTP metrics coming from Hsflow to Gmond, nor HTTP counters via Apache http://hostname/sflow http://hostname/sflow page. There is a list of counters, but they all have 0. Like this: unter method_option_count 0 counter method_get_count 0 counter method_head_count 0 counter method_post_count 0 counter method_put_count 0 counter method_delete_count 0 counter method_trace_count 0 counter method_connect_count 0 counter method_other_count 0 counter status_1XX_count 0 counter status_2XX_count 0 counter status_3XX_count 0 counter status_4XX_count 0 counter status_5XX_count 0 counter status_other_count 0 string hostname xx gauge sampling_n 0 At the same time http://hostname/server-status?auto http://hostname/server-status?auto is working properly: Total Accesses: 15 Total kBytes: 5 Uptime: 149 ReqPerSec: .100671 BytesPerSec: 34.3624 BytesPerReq: 341.333 BusyWorkers: 1 IdleWorkers: 7 Scoreboard: Is there a way to troubleshoot this? I need Sflow metrics. Thanks! S. -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net mailto:Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general https://lists.sourceforge.net/lists/listinfo/ganglia-general -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net mailto:Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general https://lists.sourceforge.net/lists/listinfo/ganglia-general -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http
Re: [Ganglia-general] no_group metrics issue
Actually, it was resolved by GMOND config change: “allow_extra_data=yes” Thanks! Sergey On Apr 22, 2015, at 5:33 PM, Sergey svin...@apple.com wrote: Hi Everybody! I have one Gmetad instance [server1] collecting metrics from several clusters of hosts. Then the second Gmetad instance [server2] has to pool all data via port 8651 from the first instance and store everything in local RRDS. The first Gmetad collects data from it’s local Gmond agent and I can see it’s metrics on the [server2] Gweb, but all metrics grouping is lost for some reason. All metrics from different groups of this server were placed into [server1]/“no_group metrics” group. How can I fix it? Thanks! Sergey -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] no_group metrics issue
Hi Everybody! I have one Gmetad instance [server1] collecting metrics from several clusters of hosts. Then the second Gmetad instance [server2] has to pool all data via port 8651 from the first instance and store everything in local RRDS. The first Gmetad collects data from it’s local Gmond agent and I can see it’s metrics on the [server2] Gweb, but all metrics grouping is lost for some reason. All metrics from different groups of this server were placed into [server1]/“no_group metrics” group. How can I fix it? Thanks! Sergey -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Sflow Apache metrics
I found following error in Apache log: [Mon Apr 13 23:25:14 2015] [error] (2)No such file or directory: apr_stat(/etc/hsflowd.auto) failed The problem is that Hsflowd process is running in the user directory and keeps hsflowd.auto file in ./run directory. I can’t access /etc directory and put file there also, because I don’t have root access. Any ideas? Thanks! S. On Apr 13, 2015, at 9:36 AM, Sergey svin...@apple.com wrote: Yes, I installed sflowtool and it works! I get all counters except http* ones. That’s why I tested http://hostname/sflow http://hostname/sflow page, because it uses mod_sflow in Apache. It looks like some Apache+sflow issue, but I don’t know how to troubleshoot it. Thanks S. On Apr 10, 2015, at 6:28 PM, Leslie geekg...@gmail.com mailto:geekg...@gmail.com wrote: Have you installed sflowtool and seen if the sflow counters are even getting sent out by the machine ? My next step would be a tcpdump to make sure that the sflow counters are then getting sent to the collecting host. On Fri, Apr 10, 2015 at 4:55 PM, Sergey svin...@apple.com mailto:svin...@apple.com wrote: Hi All! I installed mod_sflow on Apache and try to collect HTTP metrics by Gmond. The problem is that I don’t see any HTTP metrics coming from Hsflow to Gmond, nor HTTP counters via Apache http://hostname/sflow http://hostname/sflow page. There is a list of counters, but they all have 0. Like this: unter method_option_count 0 counter method_get_count 0 counter method_head_count 0 counter method_post_count 0 counter method_put_count 0 counter method_delete_count 0 counter method_trace_count 0 counter method_connect_count 0 counter method_other_count 0 counter status_1XX_count 0 counter status_2XX_count 0 counter status_3XX_count 0 counter status_4XX_count 0 counter status_5XX_count 0 counter status_other_count 0 string hostname xx gauge sampling_n 0 At the same time http://hostname/server-status?auto http://hostname/server-status?auto is working properly: Total Accesses: 15 Total kBytes: 5 Uptime: 149 ReqPerSec: .100671 BytesPerSec: 34.3624 BytesPerReq: 341.333 BusyWorkers: 1 IdleWorkers: 7 Scoreboard: Is there a way to troubleshoot this? I need Sflow metrics. Thanks! S. -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net mailto:Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Sflow Apache metrics
Yes, I installed sflowtool and it works! I get all counters except http* ones. That’s why I tested http://hostname/sflow http://hostname/sflow page, because it uses mod_sflow in Apache. It looks like some Apache+sflow issue, but I don’t know how to troubleshoot it. Thanks S. On Apr 10, 2015, at 6:28 PM, Leslie geekg...@gmail.com wrote: Have you installed sflowtool and seen if the sflow counters are even getting sent out by the machine ? My next step would be a tcpdump to make sure that the sflow counters are then getting sent to the collecting host. On Fri, Apr 10, 2015 at 4:55 PM, Sergey svin...@apple.com wrote: Hi All! I installed mod_sflow on Apache and try to collect HTTP metrics by Gmond. The problem is that I don’t see any HTTP metrics coming from Hsflow to Gmond, nor HTTP counters via Apache http://hostname/sflow page. There is a list of counters, but they all have 0. Like this: unter method_option_count 0 counter method_get_count 0 counter method_head_count 0 counter method_post_count 0 counter method_put_count 0 counter method_delete_count 0 counter method_trace_count 0 counter method_connect_count 0 counter method_other_count 0 counter status_1XX_count 0 counter status_2XX_count 0 counter status_3XX_count 0 counter status_4XX_count 0 counter status_5XX_count 0 counter status_other_count 0 string hostname xx gauge sampling_n 0 At the same time http://hostname/server-status?auto is working properly: Total Accesses: 15 Total kBytes: 5 Uptime: 149 ReqPerSec: .100671 BytesPerSec: 34.3624 BytesPerReq: 341.333 BusyWorkers: 1 IdleWorkers: 7 Scoreboard: Is there a way to troubleshoot this? I need Sflow metrics. Thanks! S. -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] Sflow Apache metrics
Hi All! I installed mod_sflow on Apache and try to collect HTTP metrics by Gmond. The problem is that I don’t see any HTTP metrics coming from Hsflow to Gmond, nor HTTP counters via Apache http://hostname/sflow http://hostname/sflow page. There is a list of counters, but they all have 0. Like this: unter method_option_count 0 counter method_get_count 0 counter method_head_count 0 counter method_post_count 0 counter method_put_count 0 counter method_delete_count 0 counter method_trace_count 0 counter method_connect_count 0 counter method_other_count 0 counter status_1XX_count 0 counter status_2XX_count 0 counter status_3XX_count 0 counter status_4XX_count 0 counter status_5XX_count 0 counter status_other_count 0 string hostname xx gauge sampling_n 0 At the same time http://hostname/server-status?auto http://hostname/server-status?auto is working properly: Total Accesses: 15 Total kBytes: 5 Uptime: 149 ReqPerSec: .100671 BytesPerSec: 34.3624 BytesPerReq: 341.333 BusyWorkers: 1 IdleWorkers: 7 Scoreboard: Is there a way to troubleshoot this? I need Sflow metrics. Thanks! S.-- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Gmetad-to-Gmetad connection
Hi Vladimir, I changed to “scalable on”. I didn’t help. What I see is only the common remote grid view: CPUs Total: 120 Hosts up: 16 Hosts down:2 == Current Load Avg (15, 5, 1m): 3%, 3%, 3% Avg Utilization (last hour): 4% Localtime: 2015-03-25 10:27 === I can’t see any clusters and hosts inside this grid. By netstat I can see that the second Gmetad instance on machine2 periodically connects to the machine1:8651. I don’t see any connections to machine1:8652. The second Gmetad instance has the same ports, but it’s on another machine. Did you mean that it can affect the polling process? Any ideas? Thanks! Sergey On Mar 24, 2015, at 6:51 PM, Vladimir Vuksan vli...@veus.hr wrote: Hi Sergey, Try setting scalable on in gmetad.conf of the second instance. From the stock gmetad.conf # Scalability mode. If on, we summarize over downstream grids, and respect # authority tags. If off, we take on 2.5.0-era behavior: we do not wrap our output # in GRID/GRID tags, we ignore all GRID tags we see, and always assume # we are the authority on data source feeds. This approach does not scale to # large groups of clusters, but is provided for backwards compatibility. # default: on # scalable off I have not used this feature in a long time so not sure how well it scales however it's worth a shot. Does second instance have different interactive and xml ports ? Vladimir On 03/24/2015 09:24 PM, Sergey wrote: I have one Gmetad instance collecting metrics from several clusters of hosts. Then the second Gmetad instance has to pool all data via port 8651 from the first instance and store everything in local RRDS. I can get all data from the second machine via “#nc machine1 8651”, but when I check RRDS, I don’t see any clusters, only Summary_Data folder. Why Gmetad doesn’t write data into RRDS? -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Gmetad-to-Gmetad connection
I changed the second Gmetad to scalable off” and it works! Thank you! Sergey On Mar 25, 2015, at 1:48 PM, Vladimir Vuksan vladi...@vuksan.com wrote: I might have misspoke try scalable off. On March 25, 2015 4:26:55 PM EDT, Sergey svin...@apple.com wrote: Hi Vladimir, I changed to “scalable on”. I didn’t help. What I see is only the common remote grid view: CPUs Total: 120 Hosts up: 16 Hosts down:2 == Current Load Avg (15, 5, 1m): nb sp;3%, 3%, 3% Avg Utilization (last hour): 4% Localtime: 2015-03-25 10:27 === I can’t see any clusters and hosts inside this grid. By netstat I can see that the second Gmetad instance on machine2 periodically connects to the machine1:8651. I don’t see any connections to machine1:8652. The second Gmetad instance has the same ports, but it’s on another machine. Did you mean that it can affect the polling process? Any ideas? Thanks! Sergey On Mar 24, 2015, at 6:51 PM, Vladimir Vuksan vli...@veus.hr mailto:vli...@veus.hr wrote: Hi Sergey, Try setting scalable on in gmetad.conf of the second instance. From the stock gmetad.conf # Scalability mode. If on, we summarize over downstream grids, and respect # authority tags. If off, we take on 2.5.0-era behavior: we do not wrap our output # in GRID/GRID tags, we ignore all GRID tags we see, and always assume # we are the authority on d ata source feeds. This approach does not scale to # large groups of clusters, but is provided for backwards compatibility. # default: on # scalable off I have not used this feature in a long time so not sure how well it scales however it's worth a shot. Does second instance have different interactive and xml ports ? Vladimir On 03/24/2015 09:24 PM, Sergey wrote: I have one Gmetad instance collecting metrics from several clusters of hosts. Then the second Gmetad instance has to pool all data via port 8651 from the first instance and store everything in local RRDS. I can get all data from the second machine via “#nc machine1 8651”, but when I check RRDS, I don’t see any clusters, only Summary_Data folder. Why Gmetad doesn’t wr ite data into RRDS? -- Vladimir -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] Gmetad-to-Gmetad connection
Hello All, I have one Gmetad instance collecting metrics from several clusters of hosts. Then the second Gmetad instance has to pool all data via port 8651 from the first instance and store everything in local RRDS. I can get all data from the second machine via “#nc machine1 8651”, but when I check RRDS, I don’t see any clusters, only Summary_Data folder. Why Gmetad doesn’t write data into RRDS? Thanks! -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/ ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] Ganglia web question
Hi Everybody! I’m new in Ganglia. In my current configuration I see that Gweb and Gmetad services are running on the same machine and all data is collected on this machine in RRDS storage. Is it possible to keep all data only on Collector machine with Gmetad and RRDS and access to it from Gweb remotely? If yes - what should I change in conf.php in Gweb configuration and what should be done on Collector machine? Thanks! Sergey -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/ ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general