Hi *!

I think everybody who working with the real BigData know – performance is
very important.

Unfortunaly our lovely HBase slower then Cassandra approximately in 2 times
when reading huge amount of data.


For example – this is Cassandra the performance test run from 2 hosts
(client side)

Host1 - Throughput(ops/sec), 231 021

Host2 - Throughput(ops/sec), 224 691



Summary ~450 000.

HBase shows in the same conditions only 210 000.



Maybe this is one of the reason why Cassandra is more popular (see
https://db-engines.com/en/ranking/wide+column+store)

I’ve done an improvment which can make HBase faster up 2-3 times (it
depends of many reasons, and sometimes even faster).

With the improvement HBase speed up to 430 000 ops/sec.

See the picture in attachment.



If you interested to get this improvement in release you can help to
attract some developers attention here -
https://issues.apache.org/jira/browse/HBASE-23887

Put some line there with your opinion and vote if you think it could be
useful for your work.

I believe discussion about this approach can make HBase more useful and
popular.



Thanks for attention)

With the best regards,

Pustota

Reply via email to