[
https://issues.apache.org/jira/browse/SOLR-5325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13792671#comment-13792671
]
Mark Miller edited comment on SOLR-5325 at 10/11/13 2:50 PM:
-------------------------------------------------------------
Added some more testing that I thought would catch it, but it has not yet on my
system. Still poking around a bit.
Anyway, I've committed the fix.
was (Author: [email protected]):
Add some more testing that I thought would catch it, but it has not yet on my
system. Still poking around a bit.
Anyway, I've committed the fix.
> zk connection loss causes overseer leader loss
> ----------------------------------------------
>
> Key: SOLR-5325
> URL: https://issues.apache.org/jira/browse/SOLR-5325
> Project: Solr
> Issue Type: Bug
> Affects Versions: 4.3, 4.4, 4.5
> Reporter: Christine Poerschke
> Assignee: Mark Miller
> Fix For: 4.5.1, 4.6, 5.0
>
> Attachments: SOLR-5325.patch, SOLR-5325.patch, SOLR-5325.patch
>
>
> The problem we saw was that when the solr overseer leader experienced
> temporary zk connectivity problems it stopped processing overseer queue
> events.
> This first happened when quorum within the external zk ensemble was lost due
> to too many zookeepers being stopped (similar to SOLR-5199). The second time
> it happened when there was a sufficient number of zookeepers but they were
> holding zookeeper leadership elections and thus refused connections (the
> elections were taking several seconds, we were using the default
> zookeeper.cnxTimeout=5s value and it was hit for one ensemble member).
--
This message was sent by Atlassian JIRA
(v6.1#6144)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]