[
https://issues.apache.org/jira/browse/ACCUMULO-2931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Josh Elser updated ACCUMULO-2931:
---------------------------------
Assignee: (was: Josh Elser)
> Ensure correct ordering of updates to a tablet across different WALs
> --------------------------------------------------------------------
>
> Key: ACCUMULO-2931
> URL: https://issues.apache.org/jira/browse/ACCUMULO-2931
> Project: Accumulo
> Issue Type: Bug
> Components: replication
> Reporter: Josh Elser
> Priority: Minor
>
> I was talking to [~enis] today about common replication problems across HBase
> and Accumulo and he was telling me about the following:
> A tablet is hosted by tserver1 using WAL1. That tablet moves to a different
> tserver for whatever reason (tserver1 failed, the balancer, etc) and starts
> getting used by tserver2 with WAL2.
> In the simple case of replicating to another Accumulo instance with servers
> running NTP, this shouldn't be a big concern because the timestamp assigned
> to the updates will ensure a final consistent view. However, the intermediate
> view is incorrect. We can do a better job to ensure that we replicate the
> data in the correct order.
> We already know the WALs that are used by a tablet and the time in which that
> tablet began using it (done by the TabletServer before any updates hit that
> Tablet) in the metadata table. We can use these records, in addition to the
> timestamp on the {{log}} column entries to determine the correct ordering for
> this Tablet WRT to all WALs. All the information is present so that the
> Master can assign the replication work in the correct order.
> Some extra bookkeeping would also be required to either keep that {{log}}
> column around longer than the minc or recovery, or to record some additional
> piece of replication metadata that the master can read from the replication
> table.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)