[ 
https://issues.apache.org/jira/browse/CASSANDRA-6338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Ellis updated CASSANDRA-6338:
--------------------------------------

    Attachment: 6338.txt

Attached.

Of course this doesn't help if we have too much for one thread to handle and it 
just gets farther and farther behind, but I don't think we've seen a failure 
scenario like that yet (at least post-CASSANDRA-6244).  And even then, at least 
this gives users something obvious in the log to alert them as to the cause of 
the problem instead of "your cluster mysteriously marked everyone down and 
started serving up UAE."

> Make gossip tolerate slow Gossip tasks
> --------------------------------------
>
>                 Key: CASSANDRA-6338
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-6338
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Core
>            Reporter: Jonathan Ellis
>            Assignee: Jonathan Ellis
>            Priority: Minor
>              Labels: gossip
>             Fix For: 2.0.3
>
>         Attachments: 6338.txt
>
>
> Currently if a single gossip task bogs down the gossip Stage, Gossip will 
> mark everyone down because it hasn't seen updates from them (since they are 
> all queued behind the slow one).
> This means that full GCs can cause gossip "flapping" as well as any actually 
> problematic tasks such as recomputing pending ranges.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to