[ 
https://issues.apache.org/jira/browse/HBASE-18573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16129044#comment-16129044
 ] 

Xiang Li commented on HBASE-18573:
----------------------------------

Thanks [~jerryhe].
I studied the code of ArrayList and find that
* New ArrayList with initial capacity of 1 is better (save space) when a caller 
adds only one cell for a family and then processes the mutation.
* New ArrayList with no initial capacity specified is better when a caller adds 
a lot of cells for a family. 

{panel:title=More details} 
When an ArrayList needs to inflate its backing array(elementData), the 
increment when doing inflation is in proportion to the initial capacity. 
{code:title=ArrayList#grow(int minCapacity)|borderStyle=solid}
int oldCapacity = elementData.length;
int newCapacity = oldCapacity + (oldCapacity >> 1);
if (newCapacity - minCapacity < 0)
   newCapacity = minCapacity;
{code}
The newCapacity is max of
* min capacity required
* current capacity * 1.5

When initial capacity is not specified, default capacity 10 is used, so it is 
more aggressive then having initial capacity to 1 in terms of inflation. That 
is, with no initial capacity specified, an Object array with size of 10 has 
been allocated even you only add 1 cell. If initial capacity is set to 1, the 
array size is 1 when adding the first cell for a family, the array size is 2 
when another cell is added to the same family.
{panel}

> Update Append and Delete to use Mutation#getCellList(family)
> ------------------------------------------------------------
>
>                 Key: HBASE-18573
>                 URL: https://issues.apache.org/jira/browse/HBASE-18573
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Xiang Li
>            Assignee: Xiang Li
>            Priority: Minor
>
> In addxxx() of Put and Increment, Mutation#getCellList(family) is called to 
> get cell list from familyMap. But in the other 2 sub-class of Mutation: 
> Append and Delete, the logic like Mutation#getCellList(family) is used, like
> {code}
>     List<Cell> list = familyMap.get(family);
>     if(list == null) {
>       list = new ArrayList<>(1);
>     }
> {code}
> in
> {code}
> public Delete addColumn(byte [] family, byte [] qualifier, long timestamp)
> {code}
> of Delete
> We could make them to call Mutation#getCellList(family) to get better 
> encapsulation



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to