does phoenix+hbase work for tables larger than a few GB?

Konstantinos Kougios Wed, 30 Sep 2015 12:11:31 -0700

Hi all,

I had various issues with big tables while experimenting the couple lastweeks.

The thing that goes to my mind is that hbase (+phoenix) works only whenthere is a fairly powerful cluster and say 1/2 the data can fit into thecombined servers memory and disks are fast (SSD?) as well. It doesn'tseem to be able to work when tables are 2x as large as the memoryallocated to region servers (frankly I think it is less)


Things that constantly fail:

- non-trivial queries on large tables (with group by, counts, joins)with region server out of memory errors or crashes without any reasonfor Xmx of 4G or 8G- index creation on the same big tables. Those always fail I thinkaround the point when hbase has to flush it's memory regions to the diskand couldn't find a solution- spark jobs fail unless they are throttled to feed hbase with the datait can take . No backpressure?

There were no replies to my emails regarding the issues, which makes methink there aren't solutions (or solutions are pretty hard to find andnot many ppl know them).

So after 21 tweaks to the default config, I am still not able to operateit as a normal database.

Should I start believing my config is all wrong or that hbase+phoenix isonly working if there is a sufficiently powerful cluster to handle the data?

I believe it is a great project and the functionality is really useful.What's lacking is 3 sample configs for 3 different strength clusters.


Thanks

does phoenix+hbase work for tables larger than a few GB?

Reply via email to