[jira] [Commented] (CASSANDRA-3199) Counter write protocol: have the coordinator (instead of first replica) waits for replica responses directly

Ryan King (JIRA) Tue, 13 Sep 2011 10:03:34 -0700

    [ 
https://issues.apache.org/jira/browse/CASSANDRA-3199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13103760#comment-13103760
 ]


Ryan King commented on CASSANDRA-3199:
--------------------------------------

This may be out of scope for this ticket, but can we differentiate between 
exceptions in writing to the first replica and in replicating to the others? 
That might help us do some limited forms of retries with counters.

> Counter write protocol: have the coordinator (instead of first replica) waits 
> for replica responses directly
> ------------------------------------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-3199
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-3199
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Sylvain Lebresne
>            Priority: Minor
>              Labels: counters
>
> Current counter write protocol is this (where we take the case of write 
> coordinator != first replica):
>   # coordinator forward write request to first replica
>   # first replica write locally and replicate to other replica
>   # first replica waits for enough answers from the other replica to satisfy 
> the consistency level
>   # first replica acks the coordinator that completes the write to the client
> This ticket proposes to modify this protocol to:
>   # coordinator forward write request to first replica
>   # first replica write locally, acks the coordinator for its own write and 
> replicate to other replica
>   # other replica respond directly to coordinator
>   # once coordinator has enough responses, it completes the write
> I see 2 advantages to this new protocol:
>   * it should be at tad faster since it parallelizes wire transfer better
>   * it woud make TimeoutException a bit less likely and more importantly, a 
> TimeoutException would much more likely mean that the write hasn't been 
> persisted. Indeed, in the current protocol, once the first replica has send 
> the write to the other replica, it has to wait for the replica answers and 
> answer the coordinator. If it dies during that time, we will return a 
> TimeoutException, even though the first replica died after having done it's 
> main job.
> The cons is that this adds a bit of complexity. In particular, the "other 
> replica" would have to answer to the coordinator for a query that has been 
> issued by the first replica.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CASSANDRA-3199) Counter write protocol: have the coordinator (instead of first replica) waits for replica responses directly

Reply via email to