[ 
https://issues.apache.org/jira/browse/SOLR-16416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17607944#comment-17607944
 ] 

ASF subversion and git services commented on SOLR-16416:
--------------------------------------------------------

Commit 0aa9b333d94f7573f83a414586f0ae34fe1c13a5 in solr's branch 
refs/heads/main from Houston Putman
[ https://gitbox.apache.org/repos/asf?p=solr.git;h=0aa9b333d94 ]

SOLR-16416: Add error logging to allow debugging of OverseerNodePrioritizer


> Fix silently failing Overseer Election joinAtHead during 
> testDesignatedOverseerRestarts
> ---------------------------------------------------------------------------------------
>
>                 Key: SOLR-16416
>                 URL: https://issues.apache.org/jira/browse/SOLR-16416
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Houston Putman
>            Priority: Major
>
> OverseerRolesTest.testDesignatedOverseerRestarts has been failing 
> consistently (around 2.5% of the time). I think this is because 
> LeaderElection.joinElection does not respect the joinAtHead flag, if 
> connectionIssues happen while setting the leader election nodes.
> LeaderElection does not use the automatic retryOnConnLoss flags when doing zk 
> operations. Instead, it waits for an error to come back, and it handles the 
> retry itself. This is fine for the normal case, because it checks if node is 
> represented in the leaderElection child nodes, and if so it ignores the 
> connection loss. However when doing joinAtHead, if the childNode exists, but 
> isn't at the place it should be, then the manual retry should be exercised.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to