I think flushes are still done by region in all versions, so this can lead
to a lot of file IO depending on how well compaction can keep up. The CF is
the unit of IO scheduling granularity. For a single row query where you
don't select only a subset of CFs, then each CF adds IO demand with
The Apache Phoenix team is pleased to announce release of it's next major
version 5.0.0
compatible with HBase 2.0+. Apache Phoenix enables SQL-based OLTP and
operational
analytics for Apache Hadoop using Apache HBase as its backing store and
providing
integration with other projects in the Apache
Putting dev@ to bcc.
Which hbase-spark connector are you using ?
What's the hbase release in your deployment ?
bq. some of the columns in dataframe becomes null
Is it possible to characterize what type of columns become null ? Earlier
you said one column has xml data. Did you mean this column
Thank you for reply.
I found one handler thread state is RUNNABLE.
Other handler thread state is TIMED_WAITING.
I think the RUNNABLE handler thread is issue.
[Thread dump]
"RpcServer"
java.lang.Thread.State: RUNNABLE
at