[ 
https://issues.apache.org/jira/browse/HBASE-5625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tudor Scurtu updated HBASE-5625:
--------------------------------

    Attachment: 5625v3.txt

@Zhihong:
I initially submitted a minimalistic version so I wouldn't have to make 
extensive modifications after reviews. Now the method is called from a unit 
test class.

The variable names were copied from existing methods and I wanted them to have 
a uniform naming scheme. I renamed the variables, but not the method parameters.

The same for the method comment.

Implemented code style comments.

@stack:
Implemented code style comments.

Added check for buffer size.

Created 'KeyValue.checkParameters()' method. Should 'createEmptyByteArray()' 
call it as well?

'containsNonEmptyColumn()' checks if the value exists & is not empty; 
'containsEmptyColumn()' checks if the value exists & is empty. If you would 
have only one, for the other case you would have to actually read the value and 
check it.

Moved most 'loadValue()' functionality to 'KeyValue'. This raises a problem: 
how do we elegantly treat the case when the buffer (provided from 'Result') 
isn't big enough?

Refactored 'Result.getSearchTerm()' as another 'KeyValue.createFirstOnRow()'.

The new 'binarySearch()' method avoids allocating a byte array.

We run incremental jobs that update values; we also have to read different 
values form the same row in different places.
                
> Avoid byte buffer allocations when reading a value from a Result object
> -----------------------------------------------------------------------
>
>                 Key: HBASE-5625
>                 URL: https://issues.apache.org/jira/browse/HBASE-5625
>             Project: HBase
>          Issue Type: Improvement
>          Components: client
>    Affects Versions: 0.92.1
>            Reporter: Tudor Scurtu
>              Labels: patch
>         Attachments: 5625.txt, 5625v2.txt, 5625v3.txt
>
>
> When calling Result.getValue(), an extra dummy KeyValue and its associated 
> underlying byte array are allocated, as well as a persistent buffer that will 
> contain the returned value.
> These can be avoided by reusing a static array for the dummy object and by 
> passing a ByteBuffer object as a value destination buffer to the read method.
> The current functionality is maintained, and we have added a separate method 
> call stack that employs the described changes. I will provide more details 
> with the patch.
> Running tests with a profiler, the reduction of read time seems to be of up 
> to 40%.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to