[
https://issues.apache.org/jira/browse/RATIS-2245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tsz-wo Sze updated RATIS-2245:
------------------------------
Component/s: server
Summary: StateMachineUpdater should wait for all apply transaction
futures before taking snapshot and group remove (was: Ratis should wait for
all apply transaction futures before taking snapshot and group remove)
> StateMachineUpdater should wait for all apply transaction futures before
> taking snapshot and group remove
> ---------------------------------------------------------------------------------------------------------
>
> Key: RATIS-2245
> URL: https://issues.apache.org/jira/browse/RATIS-2245
> Project: Ratis
> Issue Type: Bug
> Components: server
> Reporter: Swaminathan Balachandran
> Assignee: Swaminathan Balachandran
> Priority: Critical
> Time Spent: 10m
> Remaining Estimate: 0h
>
> On Ratis Snapshot and group removal the statemachine just waits for apply
> transactions that have been applied on a single iteration. If there are no
> more transactions added onto the state machine and all of the apply
> transaction future are still in progress. The state machine ends up not
> waiting for the updater thread and ends up calling the notifyGroupRemove
> function and deletes the raft group directory. So this could lead to some
> node not being able to apply some of the transactions still in flight in case
> of a restart.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)