[
https://issues.apache.org/jira/browse/DRILL-3491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14723623#comment-14723623
]
Travis Camechis commented on DRILL-3491:
----------------------------------------
I have noticed the same issue. The counts are incorrect from drill but correct
from HBASE. If we do a SELECT(COLUMN_NAME) FROM TABLE, Drill reports the
correct count. We are using HortonWorks HDP 2.3 ( hbase 1.1.x and hadoop 2.7.x
).
> SELECT COUNT(*) FROM HBASE Returns Incorrect Row Count
> ------------------------------------------------------
>
> Key: DRILL-3491
> URL: https://issues.apache.org/jira/browse/DRILL-3491
> Project: Apache Drill
> Issue Type: Bug
> Components: Storage - HBase
> Affects Versions: 1.0.0, 1.1.0
> Environment: CentOS6.5
> jdk1.8.0_45
> hadoop-2.6.0-cdh5.4.2
> hbase-1.0.0-cdh5.4.2
> IntelliJ14.1.4
> Maven3.0.5
> Reporter: Carrot Hu
> Assignee: Aditya Kishore
> Labels: hbase, sql
>
> Create a table 'test' in Hbase with 1 column family, 7 columns.
> Inserting 100,000 rows into 'test' using Java API, each column with same
> value = "value".
> SELECT COUNT(<all>) FROM hbase.test
> returns an incorrect row count.
> Verified using count 'test' in hbase shell, the row count is correct.
> SELECT COUNT(row_key) is correct,
> SELECT COUNT(<Any subset of the columns>) is also correct.
> Clear the table, and changed to inserting 1000 rows, keep the number of
> columns, Drill returns the right count. But when increasing the number of
> columns to 30. SELLECT COUNT(<all>) returns an incorrect row count (only 673).
> Use count 'test' and scan 'test' in hbase to check the result, nothing usual
> were noticed.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)