Hi,

I have large HBase tables (>10 billion rows). RowIDs are salted with byte 
prefix and the table splitted into 64…256 regions.

1. When I run CREATE TABLE “table” (…) SALT_BUCKETS=256, Phoenix adds “0:_0” 
key to each row that add huge update load to the HBase cluster and eventually 
HBase clients timeout.
The HBase cluster is tuned for other tasks and I don’t want to change it just 
to “create” Phoenix table.
I tried "CREATE VIEW” but it doesn’t support “SALTED_BUCKETS” option.

2. Running CREATE INDEX for existing large tables also overload HBase cluster.

Is there a way to run CREATE TABLE and CREATE INDEX as a mapreduce job on YARN?
Is 10bil tables fit to Phoenix use cases?

Any thoughts on this?

Thanks,
Alex

Reply via email to