Keith Turner created ACCUMULO-2175:
--------------------------------------

             Summary: Batch defining tablets in walog
                 Key: ACCUMULO-2175
                 URL: https://issues.apache.org/jira/browse/ACCUMULO-2175
             Project: Accumulo
          Issue Type: Improvement
            Reporter: Keith Turner
             Fix For: 1.7.0


If a batch of mutations comes into a tablet server AND the tablet server just 
got a new walog then it will sync the walog for each tablet.  Below is a sketch 
of what the tablet server currently does.

{code:java}
foreach(Tablet t : tabletsInMutationBatch){
    if(!tabletIsDefinedInWalog(t, currentWalog)){
        defineTablet(currentWalog, t); //syncs walog
        addWalogToMetadataTable(currentWalog, t); //syncronous metadata table 
update
     }
}
{code}

Seems like doing the following would be better.  Then  no matter how many 
undefined tablets there are, only one walog sync would be done.

{code:java}
foreach(Tablet t : tabletsInMutationBatch){
    Set<Tablet> undefined = new HashSet<Tablet>();
    if(!tabletIsDefinedInWalog(t, currentWalog)){
        undefined.add(t);
     }
}

defineTablets(currentWalog, undefined); //syncs walog after writing all 
definitions
addWalogToMetadataTable(currentWalog, undefined); //syncronous metadata table 
batch write
{code}

There is not problem when all tablets in a batch update are defined in the 
walog. In this case a batch update that contains multiple tablet will only sync 
the log once after adding all the mutations from all tablets.

Noticed this while looking into ACCUMULO-2172



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to