[jira] [Commented] (CASSANDRA-8672) Ambiguous WriteTimeoutException while completing pending CAS commits

Stefan Podkowinski (JIRA) Wed, 01 Apr 2015 08:02:40 -0700

    [ 
https://issues.apache.org/jira/browse/CASSANDRA-8672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14390745#comment-14390745
 ]


Stefan Podkowinski commented on CASSANDRA-8672:
-----------------------------------------------

You'd have to read serial to get a chance to find out if your cas operation has 
been applied or not. And no, you didn't suggest you absolutely have to do that. 
But for the sake of the discussion, lets assume users will be interested to 
find out if they need to retry a cas operation in case the operation hasn't 
been applied at all. 

> Ambiguous WriteTimeoutException while completing pending CAS commits
> --------------------------------------------------------------------
>
>                 Key: CASSANDRA-8672
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-8672
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Core
>            Reporter: Stefan Podkowinski
>            Assignee: Tyler Hobbs
>            Priority: Minor
>              Labels: CAS
>             Fix For: 3.0
>
>
> Any CAS update has a chance to trigger a pending/stalled commit of any 
> previously agreed on CAS update. After completing the pending commit, the CAS 
> operation will resume to execute the actual update and also possibly create a 
> new commit. See StorageProxy.cas()
> Theres two possbile execution paths that might end up throwing a 
> WriteTimeoutException:
> cas() -> beginAndRepairPaxos() -> commitPaxos()
> cas() -> commitPaxos()
> Unfortunatelly clients catching a WriteTimeoutException won't be able to tell 
> at which stage the commit failed. My guess would be that most developers are 
> not aware that the beginAndRepairPaxos() could also trigger a write and 
> assume that write timeouts would refer to a timeout while writting the actual 
> CAS update. Its therefor not safe to assume that successive CAS or SERIAL 
> read operations will cause a (write-)timeouted CAS operation to get 
> eventually applied. Although some [best-practices 
> advise|http://www.datastax.com/dev/blog/cassandra-error-handling-done-right] 
> claims otherwise.
> At this point the safest bet is possibly to retry the complete business 
> transaction in case of an WriteTimeoutException. However, as theres a chance 
> that the timeout occurred while writing the actual CAS operation, another 
> write could potentially complete it and our CAS condition will get a 
> different result upon retry.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-8672) Ambiguous WriteTimeoutException while completing pending CAS commits

Reply via email to