Re: On the number of column families

2018-07-13 Thread Andrew Purtell
I think flushes are still done by region in all versions, so this can lead to a lot of file IO depending on how well compaction can keep up. The CF is the unit of IO scheduling granularity. For a single row query where you don't select only a subset of CFs, then each CF adds IO demand with

[ANNOUNCE] Apache Phoenix 5.0.0 released

2018-07-13 Thread rajeshb...@apache.org
The Apache Phoenix team is pleased to announce release of it's next major version 5.0.0 compatible with HBase 2.0+. Apache Phoenix enables SQL-based OLTP and operational analytics for Apache Hadoop using Apache HBase as its backing store and providing integration with other projects in the Apache

Re: issue while reading data from hbase

2018-07-13 Thread Ted Yu
Putting dev@ to bcc. Which hbase-spark connector are you using ? What's the hbase release in your deployment ? bq. some of the columns in dataframe becomes null Is it possible to characterize what type of columns become null ? Earlier you said one column has xml data. Did you mean this column

Re: Region state is PENDING_CLOSE persists.

2018-07-13 Thread Kang Minwoo
Thank you for reply. I found one handler thread state is RUNNABLE. Other handler thread state is TIMED_WAITING. I think the RUNNABLE handler thread is issue. [Thread dump] "RpcServer" java.lang.Thread.State: RUNNABLE at