[ 
https://issues.apache.org/jira/browse/IGNITE-18495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17654325#comment-17654325
 ] 

Mirza Aliev commented on IGNITE-18495:
--------------------------------------

[~rpuch] thank you for the contribution, ok from my side 

> Fix RAFT snapshot installation hang due to response swap on retry
> -----------------------------------------------------------------
>
>                 Key: IGNITE-18495
>                 URL: https://issues.apache.org/jira/browse/IGNITE-18495
>             Project: Ignite
>          Issue Type: Bug
>            Reporter: Roman Puchkovskiy
>            Assignee: Roman Puchkovskiy
>            Priority: Major
>              Labels: ignite-3
>             Fix For: 3.0.0-beta2
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> The scenario follows:
>  # InstallSnapshot request is sent, its processing starts hanging forever (it 
> will be cancelled on step 3)
>  # After a timeout, second InstallSnapshot request is sent with same 
> index+term as the first had; in JRaft, it causes a special handling (previous 
> request processing is NOT cancelled)
>  # After a timeout, third InstallSnapshot request is sent with DIFFERENT 
> index, so it cancels the first snapshot processing effectively unblocking the 
> first thread
> In the original JRaft implementation, after being unblocked, the first thread 
> fails to clean up, so subsequent retries will always see a phantom of an 
> unfinished snapshot, so the snapshotting process will be jammed. Also, node 
> stop might stuck because one 'download' task will remain unfinished forever.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to