Jimmy Mårdell created CASSANDRA-7783:
----------------------------------------

             Summary: Snapshot repairs can hang forever
                 Key: CASSANDRA-7783
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-7783
             Project: Cassandra
          Issue Type: Bug
            Reporter: Jimmy Mårdell


When the AntiEntropService sends the snapshot repair request, it sets up a 
callback in an ExpiringMap. If the time it takes for the snapshot exceeds the 
RPC timeout, the callback will expire from the map and the snapshot responses 
will be dropped. The repair then gets stuck forever blocking at the 
snapshotLatch.  It's not even possible to kill the repair with 
forceTerminateAllRepairSessions()

This is likely fixed in 2.0 since that part of the code is completely 
rewritten. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to