[
https://issues.apache.org/jira/browse/SOLR-5325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Christine Poerschke updated SOLR-5325:
--------------------------------------
Description:
The problem we saw was that when the solr overseer leader experienced temporary
zk connectivity problems it stopped processing overseer queue events.
This first happened when quorum within the external zk ensemble was lost due to
too many zookeepers being stopped (similar to SOLR-5199). The second time it
happened when there was a sufficient number of zookeepers but they were holding
zookeeper leadership elections and thus refused connections (the elections were
taking several seconds, we were using the default zookeeper.cnxTimeout=5s value
and it was hit for one ensemble member).
Affects Version/s: 4.3
4.4
> zk connection loss causes overseer leader loss
> ----------------------------------------------
>
> Key: SOLR-5325
> URL: https://issues.apache.org/jira/browse/SOLR-5325
> Project: Solr
> Issue Type: Bug
> Affects Versions: 4.3, 4.4
> Reporter: Christine Poerschke
>
> The problem we saw was that when the solr overseer leader experienced
> temporary zk connectivity problems it stopped processing overseer queue
> events.
> This first happened when quorum within the external zk ensemble was lost due
> to too many zookeepers being stopped (similar to SOLR-5199). The second time
> it happened when there was a sufficient number of zookeepers but they were
> holding zookeeper leadership elections and thus refused connections (the
> elections were taking several seconds, we were using the default
> zookeeper.cnxTimeout=5s value and it was hit for one ensemble member).
--
This message was sent by Atlassian JIRA
(v6.1#6144)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]