On 5/12/15 2:08 PM, Felipe openglx wrote:
The ones that are having trouble are all on the same switch -- latency is 0.5ms or less between them. I have 4 other remote servers for checking access to our website from various internet locations. They each run a scheduler and poller and each is in its own realm. I'm not having any difficulty with them, though.
Yes. I restarted everything on all servers.
I think so: [1431461784] INFO: [Shinken] Initializing a CherryPy backend with 50 threads
OK. I was unsure how much capacity would be needed and was offered 5 servers so I went with what they gave me :-) I can make the master server run only arbiter/broker/receiver/reactionner and then have two others run pollers and schedulers, one be a spare poller and scheduler and the last be a spare arbiter/broker/receiver/reactionner. What I don't understand though, is that my current setup worked fine when I had around 2200 hosts being monitored and only started having major issues after I increased it to 3300. It seems odd that in order to handle the extra load I need to take away some of the servers. |
------------------------------------------------------------------------------ One dashboard for servers and applications across Physical-Virtual-Cloud Widest out-of-the-box monitoring support with 50+ applications Performance metrics, stats and reports that give you Actionable Insights Deep dive visibility with transaction tracing using APM Insight. http://ad.doubleclick.net/ddm/clk/290420510;117567292;y
_______________________________________________ Shinken-devel mailing list Shinken-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/shinken-devel