Can I assume that without a leader the shard will not respond to write requests? I can search on the collection. If I can't update docs or add any new docs then this becomes an emergency.
Erick Erickson wrote > It's worth a try to take down your entire cluster. Bring one machine > back up at a time. There _may_ be something like a 3 minute wait > before each of the replicas on that machine come up, the leader > election process has a 180 second delay before the replicas on that > node take over leadership to wait for the last known good leader to > come up. > > Continue bringing one node up at a time and wait patiently until all > the replicas on it are green and until you have a leader for each > shard elected. Bringing up the rest of the Solr nodes should be > quicker then. > > Be sure to sequence things so you have known good Solr nodes come up > first for the shard that's wonky. By that I mean that the first node > you bring up for the leaderless shard should be the one with the best > chance of having a totally OK index. > > > Let's claim that the above does bring up a leader for each shard. If > you still have a replica that refuses to come up, use the > DELETEREPLICA command to remove it. Just for insurance, I'd take the > Solr node down after the DELETEREPLICA and remove the entire core > directory for the replica that didn't come up. Then restart the node > and use the ADDREPLICA collections API command to put it back. > > If none of that works, you could try hand-editing the state.json file > and _make_ one of the shards a leader (I'd do this with the Solr nodes > down), but that's not for the faint of heart. > > Best, > Erick > > On Wed, Feb 1, 2017 at 1:57 PM, Jeff Wartes < > jwartes@ > > wrote: >> Sounds similar to a thread last year: >> http://lucene.472066.n3.nabble.com/Node-not-recovering-leader-elections-not-occuring-tp4287819p4287866.html >> >> >> >> On 2/1/17, 7:49 AM, "tedsolr" < > tsmith@ > > wrote: >> >> I have version 5.2.1. Short of an upgrade, are there any remedies? >> >> >> Erick Erickson wrote >> > What version of Solr? since 5.4 there's been a FORCELEADER >> colelctions >> > API call that might help. >> > >> > I'd run it with the newly added replicas offline. you only want it >> to >> > have good replicas to choose from. >> > >> > Best, >> > Erick >> > >> > On Wed, Feb 1, 2017 at 6:48 AM, tedsolr < >> >> > tsmith@ >> >> > > wrote: >> >> Update! I did find an error: >> >> >> >> 2017-02-01 09:23:22.673 ERROR org.apache.solr.common.SolrException >> >> :org.apache.solr.common.SolrException: Error getting leader from >> zk for >> >> shard shard1 >> >> .... >> >> Caused by: org.apache.solr.common.SolrException: Could not get >> leader >> >> props >> >> at >> >> >> org.apache.solr.cloud.ZkController.getLeaderProps(ZkController.java:1040) >> >> at >> >> >> org.apache.solr.cloud.ZkController.getLeaderProps(ZkController.java:1004) >> >> at >> >> >> org.apache.solr.cloud.ZkController.getLeader(ZkController.java:960) >> >> ... 14 more >> >> Caused by: org.apache.zookeeper.KeeperException$NoNodeException: >> >> KeeperErrorCode = NoNode for /collections/colname/leaders/shard1 >> >> at >> >> >> org.apache.zookeeper.KeeperException.create(KeeperException.java:111) >> >> >> >> When I view the cluster status I see that this shard does not have >> a >> >> leader. >> >> So it appears I need to force the leader designation to the >> "active" >> >> replica. How do I do that? >> >> >> >> >> >> >> >> -- >> >> View this message in context: >> >> >> http://lucene.472066.n3.nabble.com/Collection-will-not-replicate-tp4318260p4318265.html >> >> Sent from the Solr - User mailing list archive at Nabble.com. >> >> >> >> >> >> -- >> View this message in context: >> http://lucene.472066.n3.nabble.com/Collection-will-not-replicate-tp4318260p4318283.html >> Sent from the Solr - User mailing list archive at Nabble.com. >> >> -- View this message in context: http://lucene.472066.n3.nabble.com/Collection-will-not-replicate-tp4318260p4318479.html Sent from the Solr - User mailing list archive at Nabble.com.