[ 
https://issues.apache.org/jira/browse/HBASE-430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12576709#action_12576709
 ] 

Bryan Duxbury commented on HBASE-430:
-------------------------------------

I have a simple implementation for the RowResult class itself already. It'll be 
pretty easy to use from a consumption standpoint (implements Map<Text, Cell>). 
However, if I change HScannerInterface#next from boolean(HStoreKey, 
SortedMap<Text, byte[]>) to RowResult(), there will be far-reaching changes 
throughout the code. This is because all scanners everywhere, including the 
internal ones used in the region server, implement HScannerInterface at some 
point. 

Overall I don't think it's going to be a challenging change, beyond the fact 
that all the mechanics of advancing scanners is pretty hairy.

> Performance: Scanners and getRow return maps with duplicate data
> ----------------------------------------------------------------
>
>                 Key: HBASE-430
>                 URL: https://issues.apache.org/jira/browse/HBASE-430
>             Project: Hadoop HBase
>          Issue Type: Improvement
>            Reporter: Bryan Duxbury
>            Priority: Minor
>
> Right now, whenever we get back multiple cells worth of data at a time, we do 
> so in a map of HStoreKey->byte[]. This means that there is a duplicated Text 
> row and long timestamp at the very least between every cell. This is quite a 
> bit wasted. It also means we have to do a lot of translation every time. 
> We could create a new Writable that contains just one row, one timestamp, and 
> a map of Text->byte[].

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to