@phoenix.apache.org
Subject: Re: Phoenix-Spark: Number of partitions in PhoenixRDD
Hi Diego,
The phoenix-spark RDD partition count is equal to the number of splits that the
query planner returns. Adjusting the HBase region splits, table salting [1], as
well as the guidepost width [2] should help
Hi Diego,
The phoenix-spark RDD partition count is equal to the number of splits that
the query planner returns. Adjusting the HBase region splits, table salting
[1], as well as the guidepost width [2] should help with the
parallelization here.
Using 'EXPLAIN' for the generated query in sqlline
Hi all,
I'm working with the Phoenix spark plugin to process a HUGE table. The table is
salted in 100 buckets and is split in 400 regions. When I read it with
phoenixTableAsRDD, I get a RDD with 150 parititions. These partitions are too
big, such
that I am getting OutOfMemory problems.