[
https://issues.apache.org/jira/browse/SOLR-11427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16189081#comment-16189081
]
Erick Erickson commented on SOLR-11427:
---------------------------------------
Well, the behavior has changed over time. "In the old days" with legacyCloud,
the replica could reconstruct itself after it had been deleted. Scenario
> shut down Solr
> delete replica on the down node
> bring that Solr back up
The replica could recreate itself. I think there was work at one point to not
let that happen if a DELETEREPLICA had been issued. Much of that behavior is
behind us now so we may be dealing with some remnants of how it used to be
dealt with.
bq: so the script would want to delete replicas from decommissioned nodes or
from a node which has replicas in down state for some reason and they don't
want it?
Not quite. Imagine a small typo:
if (replica.state.equals("ative") == false) {
delete the replica
}
Yeah, yeah, yeah, we can't protect users from programming errors. And that's
not a great example anyway. But you get the idea. The onlyIfDown bits are an
extra safeguard there. Won't delete recovering nodes or active nodes etc.
> DELETEREPLICA with onlyIfDown specified should succeed if the host node is
> not present in the live_nodes Znode
> --------------------------------------------------------------------------------------------------------------
>
> Key: SOLR-11427
> URL: https://issues.apache.org/jira/browse/SOLR-11427
> Project: Solr
> Issue Type: Bug
> Security Level: Public(Default Security Level. Issues are Public)
> Reporter: Erick Erickson
> Assignee: Erick Erickson
>
> The title says it pretty much, so opening up for discussion:
> Here's the problem. Let's say a node is killed via {{kill -9}}. The
> state.json file still says it's "active", but the node is gone from
> live_nodes. If the node in question never comes back, the replica's state
> doesn't necessarily get switched to "down", so specifying onlyIfDown fails
> with "node is active" message. This is all documented more thoroughly in
> SOLR-9361.
> The question is whether it's sufficient and/or safe to succeed in deleting
> the replica from state.json if the state is "active" _and_ the node is NOT
> present in live_nodes.
> I'm assigning to myself, but others should feel free to take it.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]