[
https://issues.apache.org/jira/browse/HIVE-4372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13640273#comment-13640273
]
Yashaswy Andavilli commented on HIVE-4372:
--
This is the create command I am using to create the HBase-integrated table:
CREATE EXTERNAL TABLE CALLCOUNTBYGENDER(Id STRING, Hour STRING, Gender String,
Count String)STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
WITH SERDEPROPERTIES (hbase.columns.mapping =
:key,cf1:Hour,cf1:Gender,cf1:Count)
TBLPROPERTIES (hbase.table.name = hbase_cdr_summary_callcountbygender);
I am using the following hive statement to populate the above table:
INSERT OVERWRITE TABLE CALLCOUNTBYGENDER
SELECT
concat(tod.hour,sd.SubscriberAgeGroup),tod.hour,sd.SubscriberAgeGroup,count(*)
FROM FACT f JOIN TimeofDayD tod on f.timeofdaykey=tod.timeofdaykey
JOIN SubscriberDemographicsD sd on
f.SubscriberDemographicsKey=sd.SubscriberDemographicsKey
GROUP BY tod.hour,sd.SubscriberAgeGroup;
The table is getting populated correctly when I am viewing it in HBase, the
third column 'Gender' is getting messed up. It is supposed to contain only
male/female values but the Hour column values are also getting populated under
Gender column. After repopulating the table again and again, it finally got
populated correctly. I'm not sure what kind of a bug it is.
When trying to populate an external HBase table using a hive query involving
joins, the data is incorrectly getting mixed up inside the rows.
--
Key: HIVE-4372
URL: https://issues.apache.org/jira/browse/HIVE-4372
Project: Hive
Issue Type: Bug
Components: HBase Handler
Affects Versions: 0.10.0
Reporter: Yashaswy Andavilli
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira