[ 
https://issues.apache.org/jira/browse/CASSANDRA-2253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13000615#comment-13000615
 ] 

Hudson commented on CASSANDRA-2253:
-----------------------------------

Integrated in Cassandra-0.7 #333 (See 
[https://hudson.apache.org/hudson/job/Cassandra-0.7/333/])
    movefile deletions off of scheduledtasks executor
patch by Mikael Sitruk; reviewed by jbellis for CASSANDRA-2253


> Gossiper Starvation
> -------------------
>
>                 Key: CASSANDRA-2253
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2253
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.7.0
>         Environment: linux, windows
>            Reporter: Mikael Sitruk
>            Assignee: Mikael Sitruk
>             Fix For: 0.7.4
>
>         Attachments: CASSANDRA-0.7-2253.txt
>
>   Original Estimate: 2h
>  Remaining Estimate: 2h
>
> Gossiper periodic task will get into starvation in case large sstable files 
> need to be deleted.
> Indeed the SSTableDeletingReference uses the same scheduledTasks pool (from 
> StorageService) as the Gossiper and other periodic tasks, but the gossiper 
> tasks should run each second to assure correct cluster status (liveness of 
> nodes). In case of large sstable files to be deleted (several GB) the delete 
> operation can take more than 30 sec, thus making the whole cluster going into 
> a wrong state where nodes are marked as not living while they are!
> This will lead to unneeded additional load like hinted hand off, wrong 
> cluster state, increase in latency.
> One of the possible solution is to use a separate pool for periodic and non 
> periodic tasks. 
> I've implemented such change and it resolves the problem. 
> I can provide a patch 

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to