[
https://issues.apache.org/jira/browse/KUDU-568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mike Percy updated KUDU-568:
----------------------------
Parent: KUDU-560
> Exactly-once semantics on writes
> --------------------------------
>
> Key: KUDU-568
> URL: https://issues.apache.org/jira/browse/KUDU-568
> Project: Kudu
> Issue Type: Sub-task
> Components: consensus
> Affects Versions: M4.5
> Reporter: David Alves
> Priority: Critical
>
> When failing over from a leader to the next one, the new leader might elect
> to 1) commit or 2) overwrite incomplete operations that were sent to the
> first leader.
> If the client, gets a timeout, or tries the new leader before getting a
> response from the old leader, it retries the writes, and if the leader chose
> option 1), commit, then it will get back an Status::AlreadyPresent().
> We should implement exactly-once semantics to mask this behavior, i.e. when
> the client tries the new leader it should just get back that the writes were
> successful, independently of when they were applied.
> A common strategy to do this is to have a a replay cache. Each write has a
> client id and a write sequence number, which we store along with the
> WriteRequestPB. When a new leader is promoted it keeps in memory, for a
> period, the ids and sequence numbers of client writes. When a client is then
> submitting a duplicated write it just replies immediately.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)