[ 
https://issues.apache.org/jira/browse/DRILL-3491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14723623#comment-14723623
 ] 

Travis Camechis commented on DRILL-3491:
----------------------------------------

I have noticed the same issue.  The counts are incorrect from drill but correct 
from HBASE.  If we do a SELECT(COLUMN_NAME) FROM TABLE, Drill reports the 
correct count.  We are using HortonWorks HDP 2.3 ( hbase 1.1.x and hadoop 2.7.x 
).

> SELECT COUNT(*) FROM HBASE Returns Incorrect Row Count
> ------------------------------------------------------
>
>                 Key: DRILL-3491
>                 URL: https://issues.apache.org/jira/browse/DRILL-3491
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - HBase
>    Affects Versions: 1.0.0, 1.1.0
>         Environment: CentOS6.5
> jdk1.8.0_45
> hadoop-2.6.0-cdh5.4.2
> hbase-1.0.0-cdh5.4.2
> IntelliJ14.1.4
> Maven3.0.5
>            Reporter: Carrot Hu
>            Assignee: Aditya Kishore
>              Labels: hbase, sql
>
> Create a table 'test' in Hbase with 1 column family, 7 columns.
> Inserting 100,000 rows into 'test' using Java API, each column with same 
> value = "value".
> SELECT COUNT(<all>) FROM hbase.test
> returns an incorrect row count.
> Verified using count 'test' in hbase shell, the row count is correct.
> SELECT COUNT(row_key) is correct,
> SELECT COUNT(<Any subset of the columns>) is also correct.
> Clear the table, and changed to inserting 1000 rows, keep the number of 
> columns, Drill returns the right count. But when increasing the number of 
> columns to 30. SELLECT COUNT(<all>) returns an incorrect row count (only 673).
> Use count 'test' and scan 'test' in hbase to check the result, nothing usual 
> were noticed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to