GOSSIP-22 and failure detector

Edward Capriolo Tue, 15 Nov 2016 13:02:08 -0800

I was doing some load testing and I found the the current gating factor for
max instances running in the same JVM is limited by the JMX based
notification system the failure detector uses.


Currently a cluster of N requires N * (N-1) JMX notification threads. I
started attempting to remove this limit without going into building the
accrual failure detector (22) but there were some nuanced bugs and I backed
off because it did not seem worth the change.

If anyone has an literature to contribute about building a consensus based
failure detector please discuss. Once we cut this release that is likely
were I will spent my attention.

Thanks,
Edward

https://issues.apache.org/jira/browse/GOSSIP-22 and failure detector

Reply via email to