Hello, We are using hadoop + hbase (0.20.1) for tests now. Machines we are testing on have following configuration: Vmware 4 core intel xeon, 2.27GHz Two hbase nodes (one master and one regionserver), 6GB RAM per each.
Table has following definition: 12-byte string as Row Column family: C1 and 3 qualifiers: q1, q2, q3 (about 200 bytes per record) Column family: C2 and 2 qualifiers q1, q2 (about 2-4KB per record) I've implemented simple java utility which parses our data source and inserts results into hbase (write buffer is 12MB, autoflush off). We got following results: ~450K records ~= 4GB of data. Total time of insertion is about 600-650 seconds or ~7 MB/second or 675 rows per second, or 2ms per row. So the question is: is this time ok for such hardware or did I miss something important? Thank you. Regards, Dmitriy.
