Just a random thought. What about keeping a per region row count? Then if you needed to get a row count for a table you'd just have to query each region once and sum. Seems like it wouldn't be too expensive because you'd just have a row counter variable. It maybe more complicated than I'm making it out to be though...

~Jeff

On 3/16/2011 2:40 PM, Stack wrote:
On Wed, Mar 16, 2011 at 1:35 PM, Vivek Krishna<[email protected]>  wrote:
1.  How do I count rows fast in hbase?

First I tired count 'test'  , takes ages.

Saw that I could use RowCounter, but looks like it is deprecated.
It is not.  Make sure you are using the one from mapreduce package as
opposed to mapred package.


I just need to verify the total counts.  Is it possible to see somewhere in
the web interface or ganglia or by any other means?

We don't keep a current count on a table.  Too expensive.  Run the
rowcounter MR job.  This page may be of help:
http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/mapreduce/package-summary.html#package_description

Good luck,
St.Ack

--
Jeff Whiting
Qualtrics Senior Software Engineer
[email protected]

Reply via email to