On 5/12/15 2:08 PM, Felipe openglx wrote:
What is the latency between your nodes?

The ones that are having trouble are all on the same switch -- latency is 0.5ms or less between them.  I have 4 other remote servers for checking access to our website from various internet locations.  They each run a scheduler and poller and each is in its own realm.  I'm not having any difficulty with them, though.

Have you restarted the scheduler after changing that setting?

Yes.  I restarted everything on all servers.

Are you using CherryPy ?

I think so:
[1431461784] INFO: [Shinken] Initializing a CherryPy backend with 50 threads


I don't think it's benefitial in having too many schedulers unless you have a pretty good retention between them set up. I'd recommend two plus one spare for a setup your size.

OK.  I was unsure how much capacity would be needed and was offered 5 servers so I went with what they gave me :-)

I can make the master server run only arbiter/broker/receiver/reactionner and then have two others run pollers and schedulers, one be a spare poller and scheduler and the last be a spare arbiter/broker/receiver/reactionner.

What I don't understand though, is that my current setup worked fine when I had around 2200 hosts being monitored and only started having major issues after I increased it to 3300.  It seems odd that in order to handle the extra load I need to take away some of the servers.


------------------------------------------------------------------------------
One dashboard for servers and applications across Physical-Virtual-Cloud 
Widest out-of-the-box monitoring support with 50+ applications
Performance metrics, stats and reports that give you Actionable Insights
Deep dive visibility with transaction tracing using APM Insight.
http://ad.doubleclick.net/ddm/clk/290420510;117567292;y
_______________________________________________
Shinken-devel mailing list
Shinken-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/shinken-devel

Reply via email to