Hi y'all,
Ok, I put about 2.5 million rows into HBase that is running on three machines
(2 region servers and 1 name node, etc). The row id is the date plus a number
that increments. ('20101201|0000001'). From a java client I do a scan with the
starting row and ending row for one days logs (the last 627k rows in HBase).
Right now the scan runs in about 9 seconds to process 627k
rows. For commodity servers is the about normal? Also, where can I learn how to
optimize this process?
Thanks again.
-Pete