[Ganglia-general] Plan item

2016-01-06 Thread Sergey
snarayanaswamy:
Can you please work on the documentation for replicating 
ganglia/riemann setup in another data center?
Sergey:
yes, that’s in my list as discussed
snarayanaswamy:
thanks

--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Plan item

2016-01-06 Thread Sergey
Very sorry! Please remove this from the discussion list!

Thanks!
Sergey 


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Gweb cluster page stopped working after adding 500 custom metrics per server

2015-12-20 Thread Sergey
gmetad 3.7.0

I don't use rrdcached yet.

Sergey

> On Dec 19, 2015, at 7:41 PM, Vladimir Vuksan <vli...@veus.hr> wrote:
> 
> What Ganglia gmetad version are you running? Are you using rrdcached? 
> 
> On December 18, 2015 8:09:09 PM EST, Sergey <svin...@apple.com> wrote:
> Addition: Ganglia Web log shows 500 error.
> 
> Sergey
> 
> 
> 
>  On Dec 11, 2015, at 11:52 AM, Sergey <svin...@apple.com> wrote:
>  
>  Hi All!
>  
>  We added ~500 custom metrics/server in one cluster and now this cluster page 
> stopped working. 
>  All other clusters are working properly.
>  It looks like some timeout value should be updated in Gweb, because the data 
> retrieving time was increased.
>  Do you know how to fix this?
>  
>  The mobile page is still showing all data from this cluster.
>  
>  Thanks!
>  Sergey
> 
> 
> 
> 
> Ganglia-general mailing list
> Ganglia-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/ganglia-general 
> <https://lists.sourceforge.net/lists/listinfo/ganglia-general>
> 
> Vladimir

--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Gweb cluster page stopped working after adding 500 custom metrics per server

2015-12-20 Thread Sergey
I have 30 servers in this cluster.

I'll check PHP configuration. Thanks!

Sergey

> On Dec 19, 2015, at 7:56 PM, Jesse Becker <haw...@gmail.com> wrote:
> 
> How many servers?
> 
> There aren't too many timeouts in the gweb code.  It could be something 
> related to PHP configuration.  If, for example, the PHP script is waiting for 
> gmetad to finish sending data, and the webserver kills it because it took 
> "too long."
> 
> On Fri, Dec 11, 2015 at 2:52 PM, Sergey <svin...@apple.com 
> <mailto:svin...@apple.com>> wrote:
> Hi All!
> 
> We added ~500 custom metrics/server in one cluster and now this cluster page 
> stopped working.
> All other clusters are working properly.
> It looks like some timeout value should be updated in Gweb, because the data 
> retrieving time was increased.
> Do you know how to fix this?
> 
> The mobile page is still showing all data from this cluster.
> 
> Thanks!
> Sergey
> --
> ___
> Ganglia-general mailing list
> Ganglia-general@lists.sourceforge.net 
> <mailto:Ganglia-general@lists.sourceforge.net>
> https://lists.sourceforge.net/lists/listinfo/ganglia-general 
> <https://lists.sourceforge.net/lists/listinfo/ganglia-general>
> 
> 
> 
> -- 
> Jesse Becker

--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Gweb cluster page stopped working after adding 500 custom metrics per server

2015-12-18 Thread Sergey
Addition: Ganglia Web log shows 500 error.

Sergey



> On Dec 11, 2015, at 11:52 AM, Sergey <svin...@apple.com> wrote:
> 
> Hi All!
> 
> We added ~500 custom metrics/server in one cluster and now this cluster page 
> stopped working. 
> All other clusters are working properly.
> It looks like some timeout value should be updated in Gweb, because the data 
> retrieving time was increased.
> Do you know how to fix this?
> 
> The mobile page is still showing all data from this cluster.
> 
> Thanks!
> Sergey


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Gweb cluster page stopped working after adding 500 custom metrics per server

2015-12-11 Thread Sergey
Hi All!

We added ~500 custom metrics/server in one cluster and now this cluster page 
stopped working. 
All other clusters are working properly.
It looks like some timeout value should be updated in Gweb, because the data 
retrieving time was increased.
Do you know how to fix this?

The mobile page is still showing all data from this cluster.

Thanks!
Sergey
--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Cluster View - nodes up and down

2015-11-13 Thread Sergey

Hi All!

Is it possible to generate aggregate view for a grid which will show -  how 
many nodes in every cluster are presented and how many of them are down?
It will be useful if an aggregate view is used like a custom dashboard.

Thanks!
Sergey


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Module Collection intreval

2015-08-25 Thread Sergey
I found that parameter “collect_every” allows to setup frequency of the script 
runs.
I added logging to the python module and it definitely shows that this module 
runs every 5 minutes if collect_every = 300”.

Sergey


 On Aug 10, 2015, at 12:42 PM, Sergey svin...@apple.com wrote:
 
 Hi!
 
 I use GMOND python module to collect Kafka Lag values. It periodically runs 
 some utility and parses the output.
 I don’t want to run this utility too often. How can I setup the frequency of 
 my checks?
 Does mymod.pyconf collection group parameter (collect_every = 300) limit 
 the frequency of checks?
 Or do I need to setup some general GMOND parameter?
 
 Thanks!
 
 
 --
 ___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-general

--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Module Collection intreval

2015-08-10 Thread Sergey
Hi!

I use GMOND python module to collect Kafka Lag values. It periodically runs 
some utility and parses the output.
I don’t want to run this utility too often. How can I setup the frequency of my 
checks?
Does mymod.pyconf collection group parameter (collect_every = 300) limit the 
frequency of checks?
Or do I need to setup some general GMOND parameter?

Thanks!


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Process state monitoring

2015-06-30 Thread Sergey
Hi Everybody!

I see that we can monitor any process and collect CPU and memory metrics for 
this process via Python module.
Is it possible to monitor the state of the process (if running - state=Up, if 
stopped - state=Down)?
I see that CPU and memory metrics are coming independently of the process 
state, so I can’t use them to calculate the process state.
I think some “process heartbeat” monitor required. Any ideas?

Thanks!
Sergey
--
Don't Limit Your Business. Reach for the Cloud.
GigeNET's Cloud Solutions provide you with the tools and support that
you need to offload your IT needs and focus on growing your business.
Configured For All Businesses. Start Your Cloud Today.
https://www.gigenetcloud.com/
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Process state monitoring

2015-06-30 Thread Sergey
Hi Vladimir,

This is indirect solution. I’d prefer some python module which will return 
metric “100” in case of running process and “0” - if it doesn’t exist.
Or, for example - to return number of the existing processes with the same name.


Thank you!

Sergey

 
 On Jun 30, 2015, at 11:57 AM, Vladimir Vuksan vli...@veus.hr wrote:
 
 You can alert on process memory size e.g. we have alerts that say if process 
 memory is  100 bytes it's down. Also if process memory is  X bytes it's 
 leaking memory.
 
 Vladimir
 
 06/30/2015 u 01:19 PM, Sergey je napisao/la:
 Hi Everybody!
 
 I see that we can monitor any process and collect CPU and memory metrics for 
 this process via Python module.
 Is it possible to monitor the state of the process (if running - state=Up, 
 if stopped - state=Down)?
 I see that CPU and memory metrics are coming independently of the process 
 state, so I can’t use them to calculate the process state.
 I think some “process heartbeat” monitor required. Any ideas?


--
Don't Limit Your Business. Reach for the Cloud.
GigeNET's Cloud Solutions provide you with the tools and support that
you need to offload your IT needs and focus on growing your business.
Configured For All Businesses. Start Your Cloud Today.
https://www.gigenetcloud.com/
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] GMOND + SFLOWD functionality

2015-06-01 Thread Sergey
Hi Peter,

It’s very sad. It also contradicts the Gmond topology described in the O’Reily 
book “Monitoring with Ganglia” (p.22 Fig. 2-3).
The main disadvantage of this is the fact that we have to build 2 parallel 
monitoring structures (gnome and show) with separate ports and flows, which are 
joined only in the central collection point.
Is it possible to modify Gmond agent to join Gmond and Sfow data locally on 
every monitored computer? 

Thanks!
Sergey

 On May 30, 2015, at 10:07 PM, Peter Phaal peter.ph...@gmail.com wrote:
 
 Sergey,
 
 gmond does not retransmit the sFlow metrics it receives. A single
 gmond instance is used a central collector for a cluster of machines
 running Host sFlow agents. gmetad uses a TCP connection to retrieve
 the cluster stats from the single gmond instance and update the RRDs.
 
 Peter
 
 On Fri, May 29, 2015 at 10:02 AM, Sergey svin...@apple.com wrote:
 Hi Vladimir,
 
 This is very serious question - is GMOND supposed to retransmit metrics 
 received from the local HSFLOWD agent or it just saves them locally for 
 further retrieving via TCP connection?
 What is the initial project for this?
 
 Thanks!
 Serfey Vinnik
 --
 ___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-general


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] GMOND + SFLOWD functionality

2015-06-01 Thread Sergey
OK. Then what does the “deaf” Gmond make with metrics to provide them to “mute” 
Gmond? I guess - it sends them.
I our case “deaf” Gmond gets metrics from local HSFLOWD and doesn’t send them.
Why it doesn’t send them to “mute” Gmond? What’s the difference between it’s 
own metrics and HSFLOWD metrics?
Did you see the content of HSWLOWD.auto ?
It says that Collector IP should be “localhost”. But it can’t be localhost 
because collected metrics will be saved but not transmitted.

Thanks!
Sergey


 
 On Jun 1, 2015, at 12:17 PM, Jesse Becker haw...@gmail.com wrote:
 
 On Mon, Jun 1, 2015 at 1:00 PM, Sergey svin...@apple.com wrote:
 It also contradicts the Gmond topology described in the O’Reily book 
 “Monitoring with Ganglia” (p.22 Fig. 2-3).
 
 I don't see how.  I'm looking at a copy of the book right now, and
 Figure 2-3 has three gmonds:  two (deaf) gmonds that send to a third
 gmond (mute) that aggregates them. There's nothing about
 retransmitting or relaying metrics at all.  Gmond doesn't retransmit
 metrics, except when polled via TCP (which is usually from gmetad).
 
 
 Hi Peter,
 
 It’s very sad. It also contradicts the Gmond topology described in the 
 O’Reily book “Monitoring with Ganglia” (p.22 Fig. 2-3).
 The main disadvantage of this is the fact that we have to build 2 parallel 
 monitoring structures (gnome and show) with separate ports and flows, which 
 are joined only in the central collection point.
 Is it possible to modify Gmond agent to join Gmond and Sfow data locally on 
 every monitored computer?
 
 Thanks!
 Sergey
 
 On May 30, 2015, at 10:07 PM, Peter Phaal peter.ph...@gmail.com wrote:
 
 Sergey,
 
 gmond does not retransmit the sFlow metrics it receives. A single
 gmond instance is used a central collector for a cluster of machines
 running Host sFlow agents. gmetad uses a TCP connection to retrieve
 the cluster stats from the single gmond instance and update the RRDs.
 
 Peter
 
 On Fri, May 29, 2015 at 10:02 AM, Sergey svin...@apple.com wrote:
 Hi Vladimir,
 
 This is very serious question - is GMOND supposed to retransmit metrics 
 received from the local HSFLOWD agent or it just saves them locally for 
 further retrieving via TCP connection?
 What is the initial project for this?
 
 Thanks!
 Serfey Vinnik
 --
 ___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-general
 
 
 --
 ___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-general
 
 
 
 -- 
 Jesse Becker


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] GMOND + SFLOWD functionality

2015-05-29 Thread Sergey
Hi Vladimir,

This is very serious question - is GMOND supposed to retransmit metrics 
received from the local HSFLOWD agent or it just saves them locally for further 
retrieving via TCP connection?
What is the initial project for this?

Thanks!
Serfey Vinnik
--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] HTTPD metrics not sent

2015-05-28 Thread Sergey
Yes, and I see in debug mode that GMOND accepts and saves HTTP metrics. 
These metrics can be retrieved over direct TCP connection from Collector, but 
they are not sent to another GMOND agent via UDP.
We use one common GMOND per cluster which is running at Collector server.
How can it be fixed?

Thanks!
Sergey

 On May 28, 2015, at 1:40 PM, Peter Phaal peter.ph...@gmail.com wrote:
 
 Have you enabled http in the sFlow section in the gmond config?
 
 http://blog.sflow.com/2011/12/using-ganglia-to-monitor-web-farms.html
 
 You should try running sflowtool on the head end gmond system to
 verify that the data is arriving:
 
 http://blog.sflow.com/2011/12/sflowtool.html
 
 On Thu, May 28, 2015 at 10:06 AM, Sergey svin...@apple.com wrote:
 Hi Everybody!
 
 I use HSFLOWD agent to collect HTTPD metrics from Apache server vis 
 mod_sflow.so module.
 I see that GMOND gets HTTPD metrics from HSFLOWD and save them in metadata, 
 but for some reason it doesn’t forward HTTPD metrics by UDP to another GMOND 
 agent.
 All other metrics are successful transfered.
 Do you know how to fix it?
 
 Thanks!
 Sergey
 
 
 
 
 --
 ___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net
 sflowvalve.jarhttps://lists.sourceforge.net/lists/listinfo/ganglia-general

--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] HTTPD metrics not sent

2015-05-28 Thread Sergey
Hi Everybody!

I use HSFLOWD agent to collect HTTPD metrics from Apache server vis 
mod_sflow.so module.
I see that GMOND gets HTTPD metrics from HSFLOWD and save them in metadata, but 
for some reason it doesn’t forward HTTPD metrics by UDP to another GMOND agent.
All other metrics are successful transfered.
Do you know how to fix it?

Thanks!
Sergey

 


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Sflow Apache metrics

2015-04-23 Thread Sergey
Great, that works for me!


Thanks!
Sergey

 On Apr 13, 2015, at 7:21 PM, Neil Mckee neil.mckee...@gmail.com wrote:
 
 Sergey,
 
 It's usually best to compile mod-sflow from sources so that it matches the 
 particular version of apache you are running.  So before you do that you have 
 the option of editing mod-sflow.c and changing the setting of 
 SFWB_DEFAULT_CONFIGFILE (on line 211).
 
 https://code.google.com/p/mod-sflow/source/browse/trunk/mod_sflow.c#211 
 https://code.google.com/p/mod-sflow/source/browse/trunk/mod_sflow.c#211
 
 Does that work for you?
 
 Separate question:  I'm not sure how hsflowd works if it doesn't start as 
 root?  What OS are you on?
 
 Neil
 
 
 On Mon, Apr 13, 2015 at 5:55 PM, Sergey svin...@apple.com 
 mailto:svin...@apple.com wrote:
 
 I found following error in Apache log: 
 
 [Mon Apr 13 23:25:14 2015] [error] (2)No such file or directory: 
 apr_stat(/etc/hsflowd.auto) failed
 
 The problem is that Hsflowd process is running in the user directory and 
 keeps hsflowd.auto file in ./run directory.
 I can’t access /etc directory and put file there also, because I don’t have 
 root access.
 Any ideas?
 
 Thanks!
 S.
  
 
 On Apr 13, 2015, at 9:36 AM, Sergey svin...@apple.com 
 mailto:svin...@apple.com wrote:
 
 Yes, I installed sflowtool and it works! 
 I get all counters except http* ones.
 That’s why I tested http://hostname/sflow http://hostname/sflow page, 
 because it uses mod_sflow in Apache.
 It looks like some Apache+sflow issue, but I don’t know how to troubleshoot 
 it.
 
 Thanks
 S.
 
 On Apr 10, 2015, at 6:28 PM, Leslie geekg...@gmail.com 
 mailto:geekg...@gmail.com wrote:
 
 Have you installed sflowtool and seen if the sflow counters are even
 getting sent out by the machine ?  My next step would be a tcpdump to
 make sure that the sflow counters are then getting sent to the
 collecting host.
 
 On Fri, Apr 10, 2015 at 4:55 PM, Sergey svin...@apple.com 
 mailto:svin...@apple.com wrote:
 Hi All!
 
 I installed mod_sflow on Apache and try to collect HTTP metrics by Gmond.
 The problem is that I don’t see any HTTP metrics coming from Hsflow to
 Gmond, nor HTTP counters via Apache http://hostname/sflow 
 http://hostname/sflow page.
 There is a list of counters, but they all have 0.
 Like this:
 
 unter method_option_count 0
 counter method_get_count 0
 counter method_head_count 0
 counter method_post_count 0
 counter method_put_count 0
 counter method_delete_count 0
 counter method_trace_count 0
 counter method_connect_count 0
 counter method_other_count 0
 counter status_1XX_count 0
 counter status_2XX_count 0
 counter status_3XX_count 0
 counter status_4XX_count 0
 counter status_5XX_count 0
 counter status_other_count 0
 string hostname xx
 gauge sampling_n 0
 
 At the same time http://hostname/server-status?auto 
 http://hostname/server-status?auto is working properly:
 
 Total Accesses: 15
 Total kBytes: 5
 
 Uptime: 149
 ReqPerSec: .100671
 BytesPerSec: 34.3624
 BytesPerReq: 341.333
 BusyWorkers: 1
 IdleWorkers: 7
 Scoreboard:
 
 Is there a way to troubleshoot this? I need Sflow metrics.
 
 Thanks!
 S.
 
 --
 BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
 Develop your own process in accordance with the BPMN 2 standard
 Learn Process modeling best practices with Bonita BPM through live 
 exercises
 http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- 
 http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- 
 event?utm_
 source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF
 ___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net 
 mailto:Ganglia-general@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-general 
 https://lists.sourceforge.net/lists/listinfo/ganglia-general
 
 
 --
 BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
 Develop your own process in accordance with the BPMN 2 standard
 Learn Process modeling best practices with Bonita BPM through live exercises
 http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- 
 http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
 source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net 
 mailto:Ganglia-general@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-general 
 https://lists.sourceforge.net/lists/listinfo/ganglia-general
 
 
 --
 BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
 Develop your own process in accordance with the BPMN 2 standard
 Learn Process modeling best practices with Bonita BPM through live exercises
 http

Re: [Ganglia-general] no_group metrics issue

2015-04-23 Thread Sergey
Actually, it was resolved by GMOND config change:

“allow_extra_data=yes”

Thanks!
Sergey


 On Apr 22, 2015, at 5:33 PM, Sergey svin...@apple.com wrote:
 
 Hi Everybody!
 
 I have one Gmetad instance [server1] collecting metrics from several clusters 
 of hosts. Then the second Gmetad instance [server2] has to pool all data via 
 port 8651 from the first instance and store everything in local RRDS.
 The first Gmetad collects data from it’s local Gmond agent and I can see it’s 
 metrics on the [server2] Gweb, but all metrics grouping is lost for some 
 reason.
 All metrics from different groups of this server were placed into 
 [server1]/“no_group metrics” group.
 
 How can I fix it?
 
 Thanks!
 Sergey
 --
 BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
 Develop your own process in accordance with the BPMN 2 standard
 Learn Process modeling best practices with Bonita BPM through live exercises
 http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
 source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF
 ___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-general


--
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] no_group metrics issue

2015-04-22 Thread Sergey
Hi Everybody!

I have one Gmetad instance [server1] collecting metrics from several clusters 
of hosts. Then the second Gmetad instance [server2] has to pool all data via 
port 8651 from the first instance and store everything in local RRDS.
The first Gmetad collects data from it’s local Gmond agent and I can see it’s 
metrics on the [server2] Gweb, but all metrics grouping is lost for some reason.
All metrics from different groups of this server were placed into 
[server1]/“no_group metrics” group.

How can I fix it?

Thanks!
Sergey
--
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Sflow Apache metrics

2015-04-13 Thread Sergey

I found following error in Apache log: 

[Mon Apr 13 23:25:14 2015] [error] (2)No such file or directory: 
apr_stat(/etc/hsflowd.auto) failed

The problem is that Hsflowd process is running in the user directory and keeps 
hsflowd.auto file in ./run directory.
I can’t access /etc directory and put file there also, because I don’t have 
root access.
Any ideas?

Thanks!
S.
 

 On Apr 13, 2015, at 9:36 AM, Sergey svin...@apple.com wrote:
 
 Yes, I installed sflowtool and it works! 
 I get all counters except http* ones.
 That’s why I tested http://hostname/sflow http://hostname/sflow page, 
 because it uses mod_sflow in Apache.
 It looks like some Apache+sflow issue, but I don’t know how to troubleshoot 
 it.
 
 Thanks
 S.
 
 On Apr 10, 2015, at 6:28 PM, Leslie geekg...@gmail.com 
 mailto:geekg...@gmail.com wrote:
 
 Have you installed sflowtool and seen if the sflow counters are even
 getting sent out by the machine ?  My next step would be a tcpdump to
 make sure that the sflow counters are then getting sent to the
 collecting host.
 
 On Fri, Apr 10, 2015 at 4:55 PM, Sergey svin...@apple.com 
 mailto:svin...@apple.com wrote:
 Hi All!
 
 I installed mod_sflow on Apache and try to collect HTTP metrics by Gmond.
 The problem is that I don’t see any HTTP metrics coming from Hsflow to
 Gmond, nor HTTP counters via Apache http://hostname/sflow 
 http://hostname/sflow page.
 There is a list of counters, but they all have 0.
 Like this:
 
 unter method_option_count 0
 counter method_get_count 0
 counter method_head_count 0
 counter method_post_count 0
 counter method_put_count 0
 counter method_delete_count 0
 counter method_trace_count 0
 counter method_connect_count 0
 counter method_other_count 0
 counter status_1XX_count 0
 counter status_2XX_count 0
 counter status_3XX_count 0
 counter status_4XX_count 0
 counter status_5XX_count 0
 counter status_other_count 0
 string hostname xx
 gauge sampling_n 0
 
 At the same time http://hostname/server-status?auto 
 http://hostname/server-status?auto is working properly:
 
 Total Accesses: 15
 Total kBytes: 5
 
 Uptime: 149
 ReqPerSec: .100671
 BytesPerSec: 34.3624
 BytesPerReq: 341.333
 BusyWorkers: 1
 IdleWorkers: 7
 Scoreboard:
 
 Is there a way to troubleshoot this? I need Sflow metrics.
 
 Thanks!
 S.
 
 --
 BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
 Develop your own process in accordance with the BPMN 2 standard
 Learn Process modeling best practices with Bonita BPM through live exercises
 http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- 
 http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- 
 event?utm_
 source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF
 ___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net 
 mailto:Ganglia-general@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-general
 
 
 --
 BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
 Develop your own process in accordance with the BPMN 2 standard
 Learn Process modeling best practices with Bonita BPM through live exercises
 http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
 source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-general

--
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Sflow Apache metrics

2015-04-13 Thread Sergey
Yes, I installed sflowtool and it works! 
I get all counters except http* ones.
That’s why I tested http://hostname/sflow http://hostname/sflow page, because 
it uses mod_sflow in Apache.
It looks like some Apache+sflow issue, but I don’t know how to troubleshoot it.

Thanks
S.

 On Apr 10, 2015, at 6:28 PM, Leslie geekg...@gmail.com wrote:
 
 Have you installed sflowtool and seen if the sflow counters are even
 getting sent out by the machine ?  My next step would be a tcpdump to
 make sure that the sflow counters are then getting sent to the
 collecting host.
 
 On Fri, Apr 10, 2015 at 4:55 PM, Sergey svin...@apple.com wrote:
 Hi All!
 
 I installed mod_sflow on Apache and try to collect HTTP metrics by Gmond.
 The problem is that I don’t see any HTTP metrics coming from Hsflow to
 Gmond, nor HTTP counters via Apache http://hostname/sflow page.
 There is a list of counters, but they all have 0.
 Like this:
 
 unter method_option_count 0
 counter method_get_count 0
 counter method_head_count 0
 counter method_post_count 0
 counter method_put_count 0
 counter method_delete_count 0
 counter method_trace_count 0
 counter method_connect_count 0
 counter method_other_count 0
 counter status_1XX_count 0
 counter status_2XX_count 0
 counter status_3XX_count 0
 counter status_4XX_count 0
 counter status_5XX_count 0
 counter status_other_count 0
 string hostname xx
 gauge sampling_n 0
 
 At the same time http://hostname/server-status?auto is working properly:
 
 Total Accesses: 15
 Total kBytes: 5
 
 Uptime: 149
 ReqPerSec: .100671
 BytesPerSec: 34.3624
 BytesPerReq: 341.333
 BusyWorkers: 1
 IdleWorkers: 7
 Scoreboard:
 
 Is there a way to troubleshoot this? I need Sflow metrics.
 
 Thanks!
 S.
 
 --
 BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
 Develop your own process in accordance with the BPMN 2 standard
 Learn Process modeling best practices with Bonita BPM through live exercises
 http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
 source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF
 ___
 Ganglia-general mailing list
 Ganglia-general@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-general
 

--
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Sflow Apache metrics

2015-04-10 Thread Sergey
Hi All!

I installed mod_sflow on Apache and try to collect HTTP metrics by Gmond.
The problem is that I don’t see any HTTP metrics coming from Hsflow to Gmond, 
nor HTTP counters via Apache http://hostname/sflow http://hostname/sflow page.
There is a list of counters, but they all have 0.
Like this:
unter method_option_count 0
counter method_get_count 0
counter method_head_count 0
counter method_post_count 0
counter method_put_count 0
counter method_delete_count 0
counter method_trace_count 0
counter method_connect_count 0
counter method_other_count 0
counter status_1XX_count 0
counter status_2XX_count 0
counter status_3XX_count 0
counter status_4XX_count 0
counter status_5XX_count 0
counter status_other_count 0
string hostname xx
gauge sampling_n 0
At the same time http://hostname/server-status?auto 
http://hostname/server-status?auto is working properly:

Total Accesses: 15
Total kBytes: 5
Uptime: 149
ReqPerSec: .100671
BytesPerSec: 34.3624
BytesPerReq: 341.333
BusyWorkers: 1
IdleWorkers: 7
Scoreboard: 
Is there a way to troubleshoot this? I need Sflow metrics.

Thanks!
S.--
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Gmetad-to-Gmetad connection

2015-03-25 Thread Sergey
Hi Vladimir,

I changed to “scalable on”. I didn’t help.
What I see is only the common remote grid view:

CPUs Total: 120 
Hosts up: 16
Hosts down:2
==
Current Load Avg (15, 5, 1m):
  3%, 3%, 3%
Avg Utilization (last hour):
  4%
Localtime:
  2015-03-25 10:27
===
I can’t see any clusters and hosts inside this grid.
By netstat I can see that the second Gmetad instance on machine2 periodically 
connects to the machine1:8651.
I don’t see any connections to machine1:8652.

The second Gmetad instance has the same ports, but it’s on another machine. Did 
you mean that it can affect the polling process?

Any ideas?

Thanks!
Sergey

 On Mar 24, 2015, at 6:51 PM, Vladimir Vuksan vli...@veus.hr wrote:
 
 Hi Sergey,
 
 Try setting
 
 scalable on
 
 in gmetad.conf of the second instance. From the stock gmetad.conf
 
 # Scalability mode. If on, we summarize over downstream grids, and respect
 # authority tags. If off, we take on 2.5.0-era behavior: we do not wrap our 
 output
 # in GRID/GRID tags, we ignore all GRID tags we see, and always assume
 # we are the authority on data source feeds. This approach does not scale to
 # large groups of clusters, but is provided for backwards compatibility.
 # default: on
 # scalable off
 
 I have not used this feature in a long time so not sure how well it scales 
 however it's worth a shot.
 
 Does second instance have different interactive and xml ports ?
 
 Vladimir
 
 
 On 03/24/2015 09:24 PM, Sergey wrote:
 I have one Gmetad instance collecting metrics from several clusters of 
 hosts. Then the second Gmetad instance has to pool all data via port 8651 
 from the first instance and store everything in local RRDS.
 I can get all data from the second machine via “#nc machine1 8651”, but 
 when I check RRDS, I don’t see any clusters, only Summary_Data folder.
 Why Gmetad doesn’t write data into RRDS?
 
 

--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Gmetad-to-Gmetad connection

2015-03-25 Thread Sergey

I changed the second Gmetad to scalable off” and it works!

Thank you!


Sergey

 On Mar 25, 2015, at 1:48 PM, Vladimir Vuksan vladi...@vuksan.com wrote:
 
 I might have misspoke try scalable off. 
 
 On March 25, 2015 4:26:55 PM EDT, Sergey svin...@apple.com wrote:
 Hi Vladimir,
 
 I changed to “scalable on”. I didn’t help.
 What I see is only the common remote grid view:
 
 CPUs Total: 120   
 Hosts up: 16  
 Hosts down:2  
 ==
 Current Load Avg (15, 5, 1m):
  nb sp;3%, 3%, 3%
 Avg Utilization (last hour):
   4%
 Localtime:
   2015-03-25 10:27
 ===
 I can’t see any clusters and hosts inside this grid.
 By netstat I can see that the second Gmetad instance on machine2 periodically 
 connects to the machine1:8651.
 I don’t see any connections to machine1:8652.
 
 The second Gmetad instance has the same ports, but it’s on another machine. 
 Did you mean that it can affect the polling process?
 
 Any ideas?
 
 Thanks!
 Sergey
 
 On Mar 24, 2015, at 6:51 PM, Vladimir Vuksan vli...@veus.hr 
 mailto:vli...@veus.hr wrote:
 
 Hi Sergey,
 
 Try setting
 
 scalable on
 
 in gmetad.conf of the second instance. From the stock gmetad.conf
 
 # Scalability mode. If on, we summarize over downstream grids, and respect
 # authority tags. If off, we take on 2.5.0-era behavior: we do not wrap our 
 output
 # in GRID/GRID tags, we ignore all GRID tags we see, and always assume
 # we are the authority on d ata source feeds. This approach does not scale 
 to
 # large groups of clusters, but is provided for backwards compatibility.
 # default: on
 # scalable off
 
 I have not used this feature in a long time so not sure how well it scales 
 however it's worth a shot.
 
 Does second instance have different interactive and xml ports ?
 
 Vladimir
 
 
 On 03/24/2015 09:24 PM, Sergey wrote:
 I have one Gmetad instance collecting metrics from several clusters of 
 hosts. Then the second Gmetad instance has to pool all data via port 8651 
 from the first instance and store everything in local RRDS.
 I can get all data from the second machine via “#nc machine1 8651”, but 
 when I check RRDS, I don’t see any clusters, only Summary_Data folder.
 Why Gmetad doesn’t wr ite data into RRDS?
 
 
 
 
 -- 
 Vladimir

--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Gmetad-to-Gmetad connection

2015-03-24 Thread Sergey
Hello All,

I have one Gmetad instance collecting metrics from several clusters of hosts. 
Then the second Gmetad instance has to pool all data via port 8651 from the 
first instance and store everything in local RRDS.
I can get all data from the second machine via “#nc machine1 8651”, but when I 
check RRDS, I don’t see any clusters, only Summary_Data folder. 
Why Gmetad doesn’t write data into RRDS?

Thanks!  
--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Ganglia web question

2015-03-12 Thread Sergey
Hi Everybody!

I’m new in Ganglia.
In my current configuration I see that Gweb and Gmetad services are running on 
the same machine and all data is collected on this machine in RRDS storage.
Is it possible to keep all data only on Collector machine with Gmetad and RRDS 
and access to it from Gweb remotely?
If yes - what should I change in conf.php in Gweb configuration and what should 
be done on Collector machine?

Thanks!
Sergey
--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general