Re: [Ganglia-general] Plan item

2016-01-06 Thread Sergey
Very sorry! Please remove this from the discussion list!

Thanks!
Sergey 


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Plan item

2016-01-06 Thread Sergey
snarayanaswamy:
Can you please work on the documentation for replicating 
ganglia/riemann setup in another data center?
Sergey:
yes, that’s in my list as discussed
snarayanaswamy:
thanks

--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Gweb cluster page stopped working after adding 500 custom metrics per server

2015-12-21 Thread Sergey
I fixed this Gweb issue by adding following string in ganglia.php :



In my php.ini "memory_limit = 128M", and it was not enough for some reason.
Apache error log had following records:

[date] [error] [client x.x.x.x] PHP Fatal error:  Allowed memory size of 
134217728 bytes exhausted (tried to allocate 16385 bytes) in 
/../data/ganglia-web/ganglia.php on line 406, referer: https://../ganglia/


I'm not sure how stable is this solution, but now it works.

Thanks!
Sergey--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Gweb cluster page stopped working after adding 500 custom metrics per server

2015-12-20 Thread Sergey
I have 30 servers in this cluster.

I'll check PHP configuration. Thanks!

Sergey

> On Dec 19, 2015, at 7:56 PM, Jesse Becker  wrote:
> 
> How many servers?
> 
> There aren't too many timeouts in the gweb code.  It could be something 
> related to PHP configuration.  If, for example, the PHP script is waiting for 
> gmetad to finish sending data, and the webserver kills it because it took 
> "too long."
> 
> On Fri, Dec 11, 2015 at 2:52 PM, Sergey  <mailto:svin...@apple.com>> wrote:
> Hi All!
> 
> We added ~500 custom metrics/server in one cluster and now this cluster page 
> stopped working.
> All other clusters are working properly.
> It looks like some timeout value should be updated in Gweb, because the data 
> retrieving time was increased.
> Do you know how to fix this?
> 
> The mobile page is still showing all data from this cluster.
> 
> Thanks!
> Sergey
> --
> ___
> Ganglia-general mailing list
> Ganglia-general@lists.sourceforge.net 
> <mailto:Ganglia-general@lists.sourceforge.net>
> https://lists.sourceforge.net/lists/listinfo/ganglia-general 
> <https://lists.sourceforge.net/lists/listinfo/ganglia-general>
> 
> 
> 
> -- 
> Jesse Becker

--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Gweb cluster page stopped working after adding 500 custom metrics per server

2015-12-20 Thread Sergey
gmetad 3.7.0

I don't use rrdcached yet.

Sergey

> On Dec 19, 2015, at 7:41 PM, Vladimir Vuksan  wrote:
> 
> What Ganglia gmetad version are you running? Are you using rrdcached? 
> 
> On December 18, 2015 8:09:09 PM EST, Sergey  wrote:
> Addition: Ganglia Web log shows 500 error.
> 
> Sergey
> 
> 
> 
>  On Dec 11, 2015, at 11:52 AM, Sergey  wrote:
>  
>  Hi All!
>  
>  We added ~500 custom metrics/server in one cluster and now this cluster page 
> stopped working. 
>  All other clusters are working properly.
>  It looks like some timeout value should be updated in Gweb, because the data 
> retrieving time was increased.
>  Do you know how to fix this?
>  
>  The mobile page is still showing all data from this cluster.
>  
>  Thanks!
>  Sergey
> 
> 
> 
> 
> Ganglia-general mailing list
> Ganglia-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/ganglia-general 
> <https://lists.sourceforge.net/lists/listinfo/ganglia-general>
> 
> Vladimir

--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Gweb cluster page stopped working after adding 500 custom metrics per server

2015-12-18 Thread Sergey
Addition: Ganglia Web log shows 500 error.

Sergey



> On Dec 11, 2015, at 11:52 AM, Sergey  wrote:
> 
> Hi All!
> 
> We added ~500 custom metrics/server in one cluster and now this cluster page 
> stopped working. 
> All other clusters are working properly.
> It looks like some timeout value should be updated in Gweb, because the data 
> retrieving time was increased.
> Do you know how to fix this?
> 
> The mobile page is still showing all data from this cluster.
> 
> Thanks!
> Sergey


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Gweb cluster page stopped working after adding 500 custom metrics per server

2015-12-11 Thread Sergey
Hi All!

We added ~500 custom metrics/server in one cluster and now this cluster page 
stopped working. 
All other clusters are working properly.
It looks like some timeout value should be updated in Gweb, because the data 
retrieving time was increased.
Do you know how to fix this?

The mobile page is still showing all data from this cluster.

Thanks!
Sergey
--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Cluster View - nodes up and down

2015-11-13 Thread Sergey

Hi All!

Is it possible to generate aggregate view for a grid which will show -  how 
many nodes in every cluster are presented and how many of them are down?
It will be useful if an aggregate view is used like a custom dashboard.

Thanks!
Sergey


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Remote hosts are marking down

2015-08-25 Thread Sergey
Try to increase value of 

send_metadata_interval = 30 


=
Hello,I have installed Ganglia 3.7.1 on a Dell 720 cluster running CentOS 6.4. 
The Ganglia web has been running OK .
The remote nodes appeared to be down on the Ganglia web page (they were
actually up). I restarted the gmond on the remote nodes, then the Ganglia web
page showed these remote nodes were up, but exactly after 2 or 3 minutes, the
Ganglia web page said these nodes were down again.

I am using following versions:Ganglia Web Frontend version 3.7.0 .
Ganglia Web Backend (gmetad) version 3.7.2 .
Images created with RRDtool version 1.5.3.
Powered by Dwoo 1.1.1.Configuration:Gmond/* This configuration is as close to 
2.5.x default behavior as possible
   The values closely match ./gmond/metric.h definitions in 2.5.x */
globals {
  daemonize = yes
  setuid = yes
  user = ganglia
  debug_level = 0
  max_udp_msg_len = 1472
  mute = no
  deaf = no
  allow_extra_data = yes
  host_dmax = 86400 /*secs. Expires (removes from web interface) hosts in 1 day 
*/
  host_tmax = 20 /*secs */
  cleanup_threshold = 300 /*secs */
  gexec = no
  # By default gmond will use reverse DNS resolution when displaying your 
hostname
  # Uncommeting following value will override that value.
  # override_hostname = "mywebserver.domain.com"
  # If you are not using multicast this value should be set to something other 
than 0.
  # Otherwise if you restart aggregator gmond you will get empty graphs. 60 
seconds is reasonable
  send_metadata_interval = 0 /*secs */}/*
 * The cluster attributes specified will be used as part of the 
 * tag that will wrap all hosts collected by this instance.
 */

--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Total value metric by cluster

2015-08-25 Thread Sergey
Hi Everybody!

Is it possible to build a metric which will show a sum of metrics of all 
servers in cluster?
For example, if we have some MessagesOut/sec rate of every server, I want to 
calculate the MessagesOut/sec rate in the whole cluster.
Existing aggregate views show partial values per server, but the sum of all 
values is not calculated. It’s only visible on the chart by eyes.
I want to use this total value in Riemann for alerting with certain thresholds.

Thanks! 
--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Module Collection intreval

2015-08-25 Thread Sergey
I found that parameter “collect_every” allows to setup frequency of the script 
runs.
I added logging to the python module and it definitely shows that this module 
runs every 5 minutes if "collect_every = 300”.

Sergey


> On Aug 10, 2015, at 12:42 PM, Sergey  wrote:
> 
> Hi!
> 
> I use GMOND python module to collect Kafka Lag values. It periodically runs 
> some utility and parses the output.
> I don’t want to run this utility too often. How can I setup the frequency of 
> my checks?
> Does mymod.pyconf collection group parameter ("collect_every = 300") limit 
> the frequency of checks?
> Or do I need to setup some general GMOND parameter?
> 
> Thanks!
> 
> 
> --
> ___
> Ganglia-general mailing list
> Ganglia-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/ganglia-general

--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Module Collection intreval

2015-08-10 Thread Sergey
Hi!

I use GMOND python module to collect Kafka Lag values. It periodically runs 
some utility and parses the output.
I don’t want to run this utility too often. How can I setup the frequency of my 
checks?
Does mymod.pyconf collection group parameter ("collect_every = 300") limit the 
frequency of checks?
Or do I need to setup some general GMOND parameter?

Thanks!


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Process state monitoring

2015-06-30 Thread Sergey
Hi Vladimir,

This is indirect solution. I’d prefer some python module which will return 
metric “100” in case of running process and “0” - if it doesn’t exist.
Or, for example - to return number of the existing processes with the same name.


Thank you!

Sergey

 
> On Jun 30, 2015, at 11:57 AM, Vladimir Vuksan  wrote:
> 
> You can alert on process memory size e.g. we have alerts that say if process 
> memory is < 100 bytes it's down. Also if process memory is > X bytes it's 
> leaking memory.
> 
> Vladimir
> 
> 06/30/2015 u 01:19 PM, Sergey je napisao/la:
>> Hi Everybody!
>> 
>> I see that we can monitor any process and collect CPU and memory metrics for 
>> this process via Python module.
>> Is it possible to monitor the state of the process (if running - state=Up, 
>> if stopped - state=Down)?
>> I see that CPU and memory metrics are coming independently of the process 
>> state, so I can’t use them to calculate the process state.
>> I think some “process heartbeat” monitor required. Any ideas?


--
Don't Limit Your Business. Reach for the Cloud.
GigeNET's Cloud Solutions provide you with the tools and support that
you need to offload your IT needs and focus on growing your business.
Configured For All Businesses. Start Your Cloud Today.
https://www.gigenetcloud.com/
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Process state monitoring

2015-06-30 Thread Sergey
Hi Everybody!

I see that we can monitor any process and collect CPU and memory metrics for 
this process via Python module.
Is it possible to monitor the state of the process (if running - state=Up, if 
stopped - state=Down)?
I see that CPU and memory metrics are coming independently of the process 
state, so I can’t use them to calculate the process state.
I think some “process heartbeat” monitor required. Any ideas?

Thanks!
Sergey
--
Don't Limit Your Business. Reach for the Cloud.
GigeNET's Cloud Solutions provide you with the tools and support that
you need to offload your IT needs and focus on growing your business.
Configured For All Businesses. Start Your Cloud Today.
https://www.gigenetcloud.com/
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] GMOND + SFLOWD functionality

2015-06-01 Thread Sergey
OK. Then what does the “deaf” Gmond make with metrics to provide them to “mute” 
Gmond? I guess - it sends them.
I our case “deaf” Gmond gets metrics from local HSFLOWD and doesn’t send them.
Why it doesn’t send them to “mute” Gmond? What’s the difference between it’s 
own metrics and HSFLOWD metrics?
Did you see the content of HSWLOWD.auto ?
It says that Collector IP should be “localhost”. But it can’t be localhost 
because collected metrics will be saved but not transmitted.

Thanks!
Sergey


 
> On Jun 1, 2015, at 12:17 PM, Jesse Becker  wrote:
> 
> On Mon, Jun 1, 2015 at 1:00 PM, Sergey  wrote:
>> It also contradicts the Gmond topology described in the O’Reily book 
>> “Monitoring with Ganglia” (p.22 Fig. 2-3).
> 
> I don't see how.  I'm looking at a copy of the book right now, and
> Figure 2-3 has three gmonds:  two (deaf) gmonds that send to a third
> gmond (mute) that aggregates them. There's nothing about
> retransmitting or relaying metrics at all.  Gmond doesn't retransmit
> metrics, except when polled via TCP (which is usually from gmetad).
> 
> 
>> Hi Peter,
>> 
>> It’s very sad. It also contradicts the Gmond topology described in the 
>> O’Reily book “Monitoring with Ganglia” (p.22 Fig. 2-3).
>> The main disadvantage of this is the fact that we have to build 2 parallel 
>> monitoring structures (gnome and show) with separate ports and flows, which 
>> are joined only in the central collection point.
>> Is it possible to modify Gmond agent to join Gmond and Sfow data locally on 
>> every monitored computer?
>> 
>> Thanks!
>> Sergey
>> 
>>> On May 30, 2015, at 10:07 PM, Peter Phaal  wrote:
>>> 
>>> Sergey,
>>> 
>>> gmond does not retransmit the sFlow metrics it receives. A single
>>> gmond instance is used a central collector for a cluster of machines
>>> running Host sFlow agents. gmetad uses a TCP connection to retrieve
>>> the cluster stats from the single gmond instance and update the RRDs.
>>> 
>>> Peter
>>> 
>>> On Fri, May 29, 2015 at 10:02 AM, Sergey  wrote:
>>>> Hi Vladimir,
>>>> 
>>>> This is very serious question - is GMOND supposed to retransmit metrics 
>>>> received from the local HSFLOWD agent or it just saves them locally for 
>>>> further retrieving via TCP connection?
>>>> What is the initial project for this?
>>>> 
>>>> Thanks!
>>>> Serfey Vinnik
>>>> --
>>>> ___
>>>> Ganglia-general mailing list
>>>> Ganglia-general@lists.sourceforge.net
>>>> https://lists.sourceforge.net/lists/listinfo/ganglia-general
>> 
>> 
>> --
>> ___
>> Ganglia-general mailing list
>> Ganglia-general@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/ganglia-general
> 
> 
> 
> -- 
> Jesse Becker


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] GMOND + SFLOWD functionality

2015-06-01 Thread Sergey
Hi Peter,

It’s very sad. It also contradicts the Gmond topology described in the O’Reily 
book “Monitoring with Ganglia” (p.22 Fig. 2-3).
The main disadvantage of this is the fact that we have to build 2 parallel 
monitoring structures (gnome and show) with separate ports and flows, which are 
joined only in the central collection point.
Is it possible to modify Gmond agent to join Gmond and Sfow data locally on 
every monitored computer? 

Thanks!
Sergey

> On May 30, 2015, at 10:07 PM, Peter Phaal  wrote:
> 
> Sergey,
> 
> gmond does not retransmit the sFlow metrics it receives. A single
> gmond instance is used a central collector for a cluster of machines
> running Host sFlow agents. gmetad uses a TCP connection to retrieve
> the cluster stats from the single gmond instance and update the RRDs.
> 
> Peter
> 
> On Fri, May 29, 2015 at 10:02 AM, Sergey  wrote:
>> Hi Vladimir,
>> 
>> This is very serious question - is GMOND supposed to retransmit metrics 
>> received from the local HSFLOWD agent or it just saves them locally for 
>> further retrieving via TCP connection?
>> What is the initial project for this?
>> 
>> Thanks!
>> Serfey Vinnik
>> --
>> ___
>> Ganglia-general mailing list
>> Ganglia-general@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/ganglia-general


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] GMOND + SFLOWD functionality

2015-05-29 Thread Sergey
Hi Vladimir,

This is very serious question - is GMOND supposed to retransmit metrics 
received from the local HSFLOWD agent or it just saves them locally for further 
retrieving via TCP connection?
What is the initial project for this?

Thanks!
Serfey Vinnik
--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] HTTPD metrics not sent

2015-05-28 Thread Sergey
Yes, and I see in debug mode that GMOND accepts and saves HTTP metrics. 
These metrics can be retrieved over direct TCP connection from Collector, but 
they are not sent to another GMOND agent via UDP.
We use one common GMOND per cluster which is running at Collector server.
How can it be fixed?

Thanks!
Sergey

> On May 28, 2015, at 1:40 PM, Peter Phaal  wrote:
> 
> Have you enabled http in the sFlow section in the gmond config?
> 
> http://blog.sflow.com/2011/12/using-ganglia-to-monitor-web-farms.html
> 
> You should try running sflowtool on the head end gmond system to
> verify that the data is arriving:
> 
> http://blog.sflow.com/2011/12/sflowtool.html
> 
> On Thu, May 28, 2015 at 10:06 AM, Sergey  wrote:
>> Hi Everybody!
>> 
>> I use HSFLOWD agent to collect HTTPD metrics from Apache server vis 
>> mod_sflow.so module.
>> I see that GMOND gets HTTPD metrics from HSFLOWD and save them in metadata, 
>> but for some reason it doesn’t forward HTTPD metrics by UDP to another GMOND 
>> agent.
>> All other metrics are successful transfered.
>> Do you know how to fix it?
>> 
>> Thanks!
>> Sergey
>> 
>>> 
>> 
>> 
>> --
>> ___
>> Ganglia-general mailing list
>> Ganglia-general@lists.sourceforge.net
>> sflowvalve.jarhttps://lists.sourceforge.net/lists/listinfo/ganglia-general

--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] HTTPD metrics not sent

2015-05-28 Thread Sergey
Hi Everybody!

I use HSFLOWD agent to collect HTTPD metrics from Apache server vis 
mod_sflow.so module.
I see that GMOND gets HTTPD metrics from HSFLOWD and save them in metadata, but 
for some reason it doesn’t forward HTTPD metrics by UDP to another GMOND agent.
All other metrics are successful transfered.
Do you know how to fix it?

Thanks!
Sergey

> 


--
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] no_group metrics issue

2015-04-23 Thread Sergey
Actually, it was resolved by GMOND config change:

“allow_extra_data=yes”

Thanks!
Sergey


> On Apr 22, 2015, at 5:33 PM, Sergey  wrote:
> 
> Hi Everybody!
> 
> I have one Gmetad instance [server1] collecting metrics from several clusters 
> of hosts. Then the second Gmetad instance [server2] has to pool all data via 
> port 8651 from the first instance and store everything in local RRDS.
> The first Gmetad collects data from it’s local Gmond agent and I can see it’s 
> metrics on the [server2] Gweb, but all metrics grouping is lost for some 
> reason.
> All metrics from different groups of this server were placed into 
> [server1]/“no_group metrics” group.
> 
> How can I fix it?
> 
> Thanks!
> Sergey
> --
> BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
> Develop your own process in accordance with the BPMN 2 standard
> Learn Process modeling best practices with Bonita BPM through live exercises
> http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
> source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF
> ___
> Ganglia-general mailing list
> Ganglia-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/ganglia-general


--
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Sflow Apache metrics

2015-04-23 Thread Sergey
Great, that works for me!


Thanks!
Sergey

> On Apr 13, 2015, at 7:21 PM, Neil Mckee  wrote:
> 
> Sergey,
> 
> It's usually best to compile mod-sflow from sources so that it matches the 
> particular version of apache you are running.  So before you do that you have 
> the option of editing mod-sflow.c and changing the setting of 
> SFWB_DEFAULT_CONFIGFILE (on line 211).
> 
> https://code.google.com/p/mod-sflow/source/browse/trunk/mod_sflow.c#211 
> <https://code.google.com/p/mod-sflow/source/browse/trunk/mod_sflow.c#211>
> 
> Does that work for you?
> 
> Separate question:  I'm not sure how hsflowd works if it doesn't start as 
> root?  What OS are you on?
> 
> Neil
> 
> 
> On Mon, Apr 13, 2015 at 5:55 PM, Sergey  <mailto:svin...@apple.com>> wrote:
> 
> I found following error in Apache log: 
> 
> [Mon Apr 13 23:25:14 2015] [error] (2)No such file or directory: 
> apr_stat(/etc/hsflowd.auto) failed
> 
> The problem is that Hsflowd process is running in the user directory and 
> keeps hsflowd.auto file in ./run directory.
> I can’t access /etc directory and put file there also, because I don’t have 
> root access.
> Any ideas?
> 
> Thanks!
> S.
>  
> 
>> On Apr 13, 2015, at 9:36 AM, Sergey > <mailto:svin...@apple.com>> wrote:
>> 
>> Yes, I installed sflowtool and it works! 
>> I get all counters except http* ones.
>> That’s why I tested http://hostname/sflow <http://hostname/sflow> page, 
>> because it uses mod_sflow in Apache.
>> It looks like some Apache+sflow issue, but I don’t know how to troubleshoot 
>> it.
>> 
>> Thanks
>> S.
>> 
>>> On Apr 10, 2015, at 6:28 PM, Leslie >> <mailto:geekg...@gmail.com>> wrote:
>>> 
>>> Have you installed sflowtool and seen if the sflow counters are even
>>> getting sent out by the machine ?  My next step would be a tcpdump to
>>> make sure that the sflow counters are then getting sent to the
>>> collecting host.
>>> 
>>> On Fri, Apr 10, 2015 at 4:55 PM, Sergey >> <mailto:svin...@apple.com>> wrote:
>>>> Hi All!
>>>> 
>>>> I installed mod_sflow on Apache and try to collect HTTP metrics by Gmond.
>>>> The problem is that I don’t see any HTTP metrics coming from Hsflow to
>>>> Gmond, nor HTTP counters via Apache http://hostname/sflow 
>>>> <http://hostname/sflow> page.
>>>> There is a list of counters, but they all have 0.
>>>> Like this:
>>>> 
>>>> unter method_option_count 0
>>>> counter method_get_count 0
>>>> counter method_head_count 0
>>>> counter method_post_count 0
>>>> counter method_put_count 0
>>>> counter method_delete_count 0
>>>> counter method_trace_count 0
>>>> counter method_connect_count 0
>>>> counter method_other_count 0
>>>> counter status_1XX_count 0
>>>> counter status_2XX_count 0
>>>> counter status_3XX_count 0
>>>> counter status_4XX_count 0
>>>> counter status_5XX_count 0
>>>> counter status_other_count 0
>>>> string hostname xx
>>>> gauge sampling_n 0
>>>> 
>>>> At the same time http://hostname/server-status?auto 
>>>> <http://hostname/server-status?auto> is working properly:
>>>> 
>>>> Total Accesses: 15
>>>> Total kBytes: 5
>>>> 
>>>> Uptime: 149
>>>> ReqPerSec: .100671
>>>> BytesPerSec: 34.3624
>>>> BytesPerReq: 341.333
>>>> BusyWorkers: 1
>>>> IdleWorkers: 7
>>>> Scoreboard:
>>>> 
>>>> Is there a way to troubleshoot this? I need Sflow metrics.
>>>> 
>>>> Thanks!
>>>> S.
>>>> 
>>>> --
>>>> BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
>>>> Develop your own process in accordance with the BPMN 2 standard
>>>> Learn Process modeling best practices with Bonita BPM through live 
>>>> exercises
>>>> http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- 
>>>> <http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual-> 
>>>> event?utm_
>>>> source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF
>>>> ___
>>>> Ganglia-general mailing list
>>>>

[Ganglia-general] no_group metrics issue

2015-04-22 Thread Sergey
Hi Everybody!

I have one Gmetad instance [server1] collecting metrics from several clusters 
of hosts. Then the second Gmetad instance [server2] has to pool all data via 
port 8651 from the first instance and store everything in local RRDS.
The first Gmetad collects data from it’s local Gmond agent and I can see it’s 
metrics on the [server2] Gweb, but all metrics grouping is lost for some reason.
All metrics from different groups of this server were placed into 
[server1]/“no_group metrics” group.

How can I fix it?

Thanks!
Sergey
--
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Sflow Apache metrics

2015-04-13 Thread Sergey

I found following error in Apache log: 

[Mon Apr 13 23:25:14 2015] [error] (2)No such file or directory: 
apr_stat(/etc/hsflowd.auto) failed

The problem is that Hsflowd process is running in the user directory and keeps 
hsflowd.auto file in ./run directory.
I can’t access /etc directory and put file there also, because I don’t have 
root access.
Any ideas?

Thanks!
S.
 

> On Apr 13, 2015, at 9:36 AM, Sergey  wrote:
> 
> Yes, I installed sflowtool and it works! 
> I get all counters except http* ones.
> That’s why I tested http://hostname/sflow <http://hostname/sflow> page, 
> because it uses mod_sflow in Apache.
> It looks like some Apache+sflow issue, but I don’t know how to troubleshoot 
> it.
> 
> Thanks
> S.
> 
>> On Apr 10, 2015, at 6:28 PM, Leslie > <mailto:geekg...@gmail.com>> wrote:
>> 
>> Have you installed sflowtool and seen if the sflow counters are even
>> getting sent out by the machine ?  My next step would be a tcpdump to
>> make sure that the sflow counters are then getting sent to the
>> collecting host.
>> 
>> On Fri, Apr 10, 2015 at 4:55 PM, Sergey > <mailto:svin...@apple.com>> wrote:
>>> Hi All!
>>> 
>>> I installed mod_sflow on Apache and try to collect HTTP metrics by Gmond.
>>> The problem is that I don’t see any HTTP metrics coming from Hsflow to
>>> Gmond, nor HTTP counters via Apache http://hostname/sflow 
>>> <http://hostname/sflow> page.
>>> There is a list of counters, but they all have 0.
>>> Like this:
>>> 
>>> unter method_option_count 0
>>> counter method_get_count 0
>>> counter method_head_count 0
>>> counter method_post_count 0
>>> counter method_put_count 0
>>> counter method_delete_count 0
>>> counter method_trace_count 0
>>> counter method_connect_count 0
>>> counter method_other_count 0
>>> counter status_1XX_count 0
>>> counter status_2XX_count 0
>>> counter status_3XX_count 0
>>> counter status_4XX_count 0
>>> counter status_5XX_count 0
>>> counter status_other_count 0
>>> string hostname xx
>>> gauge sampling_n 0
>>> 
>>> At the same time http://hostname/server-status?auto 
>>> <http://hostname/server-status?auto> is working properly:
>>> 
>>> Total Accesses: 15
>>> Total kBytes: 5
>>> 
>>> Uptime: 149
>>> ReqPerSec: .100671
>>> BytesPerSec: 34.3624
>>> BytesPerReq: 341.333
>>> BusyWorkers: 1
>>> IdleWorkers: 7
>>> Scoreboard:
>>> 
>>> Is there a way to troubleshoot this? I need Sflow metrics.
>>> 
>>> Thanks!
>>> S.
>>> 
>>> --
>>> BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
>>> Develop your own process in accordance with the BPMN 2 standard
>>> Learn Process modeling best practices with Bonita BPM through live exercises
>>> http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- 
>>> <http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual-> 
>>> event?utm_
>>> source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF
>>> ___
>>> Ganglia-general mailing list
>>> Ganglia-general@lists.sourceforge.net 
>>> <mailto:Ganglia-general@lists.sourceforge.net>
>>> https://lists.sourceforge.net/lists/listinfo/ganglia-general
>>> 
> 
> --
> BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
> Develop your own process in accordance with the BPMN 2 standard
> Learn Process modeling best practices with Bonita BPM through live exercises
> http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
> source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF___
> Ganglia-general mailing list
> Ganglia-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/ganglia-general

--
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Sflow Apache metrics

2015-04-13 Thread Sergey
Yes, I installed sflowtool and it works! 
I get all counters except http* ones.
That’s why I tested http://hostname/sflow <http://hostname/sflow> page, because 
it uses mod_sflow in Apache.
It looks like some Apache+sflow issue, but I don’t know how to troubleshoot it.

Thanks
S.

> On Apr 10, 2015, at 6:28 PM, Leslie  wrote:
> 
> Have you installed sflowtool and seen if the sflow counters are even
> getting sent out by the machine ?  My next step would be a tcpdump to
> make sure that the sflow counters are then getting sent to the
> collecting host.
> 
> On Fri, Apr 10, 2015 at 4:55 PM, Sergey  wrote:
>> Hi All!
>> 
>> I installed mod_sflow on Apache and try to collect HTTP metrics by Gmond.
>> The problem is that I don’t see any HTTP metrics coming from Hsflow to
>> Gmond, nor HTTP counters via Apache http://hostname/sflow page.
>> There is a list of counters, but they all have 0.
>> Like this:
>> 
>> unter method_option_count 0
>> counter method_get_count 0
>> counter method_head_count 0
>> counter method_post_count 0
>> counter method_put_count 0
>> counter method_delete_count 0
>> counter method_trace_count 0
>> counter method_connect_count 0
>> counter method_other_count 0
>> counter status_1XX_count 0
>> counter status_2XX_count 0
>> counter status_3XX_count 0
>> counter status_4XX_count 0
>> counter status_5XX_count 0
>> counter status_other_count 0
>> string hostname xx
>> gauge sampling_n 0
>> 
>> At the same time http://hostname/server-status?auto is working properly:
>> 
>> Total Accesses: 15
>> Total kBytes: 5
>> 
>> Uptime: 149
>> ReqPerSec: .100671
>> BytesPerSec: 34.3624
>> BytesPerReq: 341.333
>> BusyWorkers: 1
>> IdleWorkers: 7
>> Scoreboard:
>> 
>> Is there a way to troubleshoot this? I need Sflow metrics.
>> 
>> Thanks!
>> S.
>> 
>> --
>> BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
>> Develop your own process in accordance with the BPMN 2 standard
>> Learn Process modeling best practices with Bonita BPM through live exercises
>> http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
>> source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF
>> ___
>> Ganglia-general mailing list
>> Ganglia-general@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/ganglia-general
>> 

--
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Sflow Apache metrics

2015-04-10 Thread Sergey
Hi All!

I installed mod_sflow on Apache and try to collect HTTP metrics by Gmond.
The problem is that I don’t see any HTTP metrics coming from Hsflow to Gmond, 
nor HTTP counters via Apache http://hostname/sflow  page.
There is a list of counters, but they all have 0.
Like this:
unter method_option_count 0
counter method_get_count 0
counter method_head_count 0
counter method_post_count 0
counter method_put_count 0
counter method_delete_count 0
counter method_trace_count 0
counter method_connect_count 0
counter method_other_count 0
counter status_1XX_count 0
counter status_2XX_count 0
counter status_3XX_count 0
counter status_4XX_count 0
counter status_5XX_count 0
counter status_other_count 0
string hostname xx
gauge sampling_n 0
At the same time http://hostname/server-status?auto 
 is working properly:

Total Accesses: 15
Total kBytes: 5
Uptime: 149
ReqPerSec: .100671
BytesPerSec: 34.3624
BytesPerReq: 341.333
BusyWorkers: 1
IdleWorkers: 7
Scoreboard: 
Is there a way to troubleshoot this? I need Sflow metrics.

Thanks!
S.--
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Gmetad-to-Gmetad connection

2015-03-25 Thread Sergey

I changed the second Gmetad to "scalable off” and it works!

Thank you!


Sergey

> On Mar 25, 2015, at 1:48 PM, Vladimir Vuksan  wrote:
> 
> I might have misspoke try scalable off. 
> 
> On March 25, 2015 4:26:55 PM EDT, Sergey  wrote:
> Hi Vladimir,
> 
> I changed to “scalable on”. I didn’t help.
> What I see is only the common remote grid view:
> 
> CPUs Total: 120   
> Hosts up: 16  
> Hosts down:2  
> ==
> Current Load Avg (15, 5, 1m):
>  &nb sp;3%, 3%, 3%
> Avg Utilization (last hour):
>   4%
> Localtime:
>   2015-03-25 10:27
> ===
> I can’t see any clusters and hosts inside this grid.
> By netstat I can see that the second Gmetad instance on machine2 periodically 
> connects to the machine1:8651.
> I don’t see any connections to machine1:8652.
> 
> The second Gmetad instance has the same ports, but it’s on another machine. 
> Did you mean that it can affect the polling process?
> 
> Any ideas?
> 
> Thanks!
> Sergey
> 
>> On Mar 24, 2015, at 6:51 PM, Vladimir Vuksan > <mailto:vli...@veus.hr>> wrote:
>> 
>> Hi Sergey,
>> 
>> Try setting
>> 
>> scalable on
>> 
>> in gmetad.conf of the second instance. From the stock gmetad.conf
>> 
>> # Scalability mode. If on, we summarize over downstream grids, and respect
>> # authority tags. If off, we take on 2.5.0-era behavior: we do not wrap our 
>> output
>> # in  tags, we ignore all  tags we see, and always assume
>> # we are the "authority" on d ata source feeds. This approach does not scale 
>> to
>> # large groups of clusters, but is provided for backwards compatibility.
>> # default: on
>> # scalable off
>> 
>> I have not used this feature in a long time so not sure how well it scales 
>> however it's worth a shot.
>> 
>> Does second instance have different interactive and xml ports ?
>> 
>> Vladimir
>> 
>> 
>> On 03/24/2015 09:24 PM, Sergey wrote:
>>> I have one Gmetad instance collecting metrics from several clusters of 
>>> hosts. Then the second Gmetad instance has to pool all data via port 8651 
>>> from the first instance and store everything in local RRDS.
>>> I can get all data from the second machine via “#>nc machine1 8651”, but 
>>> when I check RRDS, I don’t see any clusters, only Summary_Data folder.
>>> Why Gmetad doesn’t wr ite data into RRDS?
>>> 
>> 
> 
> 
> -- 
> Vladimir

--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Re: [Ganglia-general] Gmetad-to-Gmetad connection

2015-03-25 Thread Sergey
Hi Vladimir,

I changed to “scalable on”. I didn’t help.
What I see is only the common remote grid view:

CPUs Total: 120 
Hosts up: 16
Hosts down:2
==
Current Load Avg (15, 5, 1m):
  3%, 3%, 3%
Avg Utilization (last hour):
  4%
Localtime:
  2015-03-25 10:27
===
I can’t see any clusters and hosts inside this grid.
By netstat I can see that the second Gmetad instance on machine2 periodically 
connects to the machine1:8651.
I don’t see any connections to machine1:8652.

The second Gmetad instance has the same ports, but it’s on another machine. Did 
you mean that it can affect the polling process?

Any ideas?

Thanks!
Sergey

> On Mar 24, 2015, at 6:51 PM, Vladimir Vuksan  wrote:
> 
> Hi Sergey,
> 
> Try setting
> 
> scalable on
> 
> in gmetad.conf of the second instance. From the stock gmetad.conf
> 
> # Scalability mode. If on, we summarize over downstream grids, and respect
> # authority tags. If off, we take on 2.5.0-era behavior: we do not wrap our 
> output
> # in  tags, we ignore all  tags we see, and always assume
> # we are the "authority" on data source feeds. This approach does not scale to
> # large groups of clusters, but is provided for backwards compatibility.
> # default: on
> # scalable off
> 
> I have not used this feature in a long time so not sure how well it scales 
> however it's worth a shot.
> 
> Does second instance have different interactive and xml ports ?
> 
> Vladimir
> 
> 
> On 03/24/2015 09:24 PM, Sergey wrote:
>> I have one Gmetad instance collecting metrics from several clusters of 
>> hosts. Then the second Gmetad instance has to pool all data via port 8651 
>> from the first instance and store everything in local RRDS.
>> I can get all data from the second machine via “#>nc machine1 8651”, but 
>> when I check RRDS, I don’t see any clusters, only Summary_Data folder.
>> Why Gmetad doesn’t write data into RRDS?
>> 
> 

--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Gmetad-to-Gmetad connection

2015-03-24 Thread Sergey
Hello All,

I have one Gmetad instance collecting metrics from several clusters of hosts. 
Then the second Gmetad instance has to pool all data via port 8651 from the 
first instance and store everything in local RRDS.
I can get all data from the second machine via “#>nc machine1 8651”, but when I 
check RRDS, I don’t see any clusters, only Summary_Data folder. 
Why Gmetad doesn’t write data into RRDS?

Thanks!  
--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general


[Ganglia-general] Ganglia web question

2015-03-12 Thread Sergey
Hi Everybody!

I’m new in Ganglia.
In my current configuration I see that Gweb and Gmetad services are running on 
the same machine and all data is collected on this machine in RRDS storage.
Is it possible to keep all data only on Collector machine with Gmetad and RRDS 
and access to it from Gweb remotely?
If yes - what should I change in conf.php in Gweb configuration and what should 
be done on Collector machine?

Thanks!
Sergey
--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
___
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general