Mano Kovacs created SOLR-11431:
----------------------------------

             Summary: Leader candidate cannot become leader if replica responds 
500 to PeerSync
                 Key: SOLR-11431
                 URL: https://issues.apache.org/jira/browse/SOLR-11431
             Project: Solr
          Issue Type: Bug
      Security Level: Public (Default Security Level. Issues are Public)
    Affects Versions: 7.0
            Reporter: Mano Kovacs


When leader candidate does PeerSync to all replicas, to download any missing 
updates, it is tolerant to failures. It uses {{cantReachIsSuccess=true}} switch 
which handles connection issue, 404 and 503 as success, since replicas being 
DOWN should not affect the process.

However, if a replica has disk issues, the core initialization might fail and 
that results in {{500}} instead of {{503}}. I failing replica like that can 
prevent any other replicas becoming the leader.

Proposing either:
* Accepting {{500}} as "cant reach" so leader candidate can go on
or
* Changing {{SolrCoreInitializationException}} to return {{503}} instead of 
{{500}}
* * this might be API change, however



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to