get(row, column) is more efficient than get(row) because get(row) must access 
multiple HStores and do multiple reads while get(row, column) only accesses one 
HStore.

---
Jim Kellerman, Senior Engineer; Powerset


> -----Original Message-----
> From: Daniel [mailto:[EMAIL PROTECTED]
> Sent: Tuesday, July 15, 2008 5:54 AM
> To: [email protected]
> Subject: performance on getRow and get
>
> hi all,
>    i'm writting a program to access my hbase table in a MR
> job. my first version is to get different values from
> get(row,column name), and now im changing to get one row each
> time into a map, and query that map instead - for one reduce job.
>    i think it would be better to access hbase only once per
> one reduce function, but it seems like the latter version
> takes a longer time to finish
>
> during the reduce job. does this mean get(row, column name)
> is less expensive than get(row) ?
>   thanks.
>
> Daniel
>
> No virus found in this incoming message.
> Checked by AVG - http://www.avg.com
> Version: 8.0.138 / Virus Database: 270.4.10/1551 - Release
> Date: 7/14/2008 6:49 AM
>
No virus found in this outgoing message.
Checked by AVG - http://www.avg.com
Version: 8.0.138 / Virus Database: 270.4.10/1551 - Release Date: 7/14/2008 6:49 
AM

Reply via email to