get(row, column) is more efficient than get(row) because get(row) must access multiple HStores and do multiple reads while get(row, column) only accesses one HStore.
--- Jim Kellerman, Senior Engineer; Powerset > -----Original Message----- > From: Daniel [mailto:[EMAIL PROTECTED] > Sent: Tuesday, July 15, 2008 5:54 AM > To: [email protected] > Subject: performance on getRow and get > > hi all, > i'm writting a program to access my hbase table in a MR > job. my first version is to get different values from > get(row,column name), and now im changing to get one row each > time into a map, and query that map instead - for one reduce job. > i think it would be better to access hbase only once per > one reduce function, but it seems like the latter version > takes a longer time to finish > > during the reduce job. does this mean get(row, column name) > is less expensive than get(row) ? > thanks. > > Daniel > > No virus found in this incoming message. > Checked by AVG - http://www.avg.com > Version: 8.0.138 / Virus Database: 270.4.10/1551 - Release > Date: 7/14/2008 6:49 AM > No virus found in this outgoing message. Checked by AVG - http://www.avg.com Version: 8.0.138 / Virus Database: 270.4.10/1551 - Release Date: 7/14/2008 6:49 AM
