[ 
https://issues.apache.org/jira/browse/HIVE-16741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aleksey Vovchenko reassigned HIVE-16741:
----------------------------------------


>  Counting number of records in hive and hbase are different for NULL fields 
> in hive
> -----------------------------------------------------------------------------------
>
>                 Key: HIVE-16741
>                 URL: https://issues.apache.org/jira/browse/HIVE-16741
>             Project: Hive
>          Issue Type: Bug
>          Components: Hive
>    Affects Versions: 2.1.0, 1.2.0
>            Reporter: Aleksey Vovchenko
>            Assignee: Aleksey Vovchenko
>
> Steps to reproduce:
> STEP 1.  
> hbase> create 'testTable',{NAME=>'cf'}
> STEP 2.
> put 'testTable','10','cf:Address','My Address 411002'
> put 'testTable','10','cf:contactId','653638'
> put 'testTable','10','cf:currentStatus','Awaiting'
> put 'testTable','10','cf:createdAt','1452815193'
> put 'testTable','10','cf:Id','10'
> put 'testTable','15','cf:contactId','653638'
> put 'testTable','15','cf:currentStatus','Awaiting'
> put 'testTable','15','cf:createdAt','1452815193'
> put 'testTable','15','cf:Id','15'
> (Note: Here Addrees column is not provided.It means that NULL.)
> put 'testTable','20','cf:Address','My Address 411003'
> put 'testTable','20','cf:contactId','653638'
> put 'testTable','20','cf:currentStatus','Awaiting'
> put 'testTable','20','cf:createdAt','1452815193'
> put 'testTable','20','cf:Id','20'
> put 'testTable','17','cf:Address','My Address 411003'
> put 'testTable','17','cf:currentStatus','Awaiting'
> put 'testTable','17','cf:createdAt','1452815193'
> put 'testTable','17','cf:Id','17'
> STEP 3.
> hive> CREATE external TABLE hh_testTable(Id string,Address string,contactId 
> string,currentStatus string,createdAt string) STORED BY 
> 'org.apache.hadoop.hive.hbase.HBaseStorageHandler' WITH SERDEPROPERTIES 
> ("hbase.columns.mapping"=":key,cf:Address,cf:contactId,cf:currentStatus,cf:createdAt")
>  TBLPROPERTIES ("hbase.table.name"="testTable");
> STEP 4.
> hive> select count(*),contactid from hh_testTable group by contactid;
> Actual result:
> OK
> 3     653638
> Expected result:
> OK
> 1     NULL
> 3     653637



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to