[
https://issues.apache.org/jira/browse/SOLR-14278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
David Smiley updated SOLR-14278:
--------------------------------
Attachment: SOLR-14278.patch
Status: Open (was: Open)
Attached is an updated patch that applies to main. It's internally failing
before it gets to the end... connection refused errors. I didn't look closely
-- I just tried it and hoped for luck but got none. Anyone who wants to look
further should start a fresh PR from the new Git solr repo, not the old
lucene-solr joint repo.
> data loss during live shard split if leader dies
> ------------------------------------------------
>
> Key: SOLR-14278
> URL: https://issues.apache.org/jira/browse/SOLR-14278
> Project: Solr
> Issue Type: Bug
> Reporter: Yonik Seeley
> Priority: Major
> Attachments: SOLR-14278.patch
>
> Time Spent: 10m
> Remaining Estimate: 0h
>
> While trying to develop better tests for shared storage (SOLR-13101), I ran
> across another failure for normal replica types as well (one of the first
> things I do when a test fails for shared storage is to try and validate that
> normal NRT replicas succeed.) The PR I'll open has a test adapted from the
> one in SOLR-13813 for master.
> Scenario:
> - indexing is happening during shard split
> - leader is killed shortly after (before the split has finished) and never
> brought back up
> - there are often some missing documents at the end
> While it's possible that the simulated killing of the node in the unit test
> is imperfect, I haven't reproduced a failure if I comment out the split
> command and just kill the leader.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]