[ 
https://issues.apache.org/jira/browse/HIVE-705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12836323#action_12836323
 ] 

Jonathan Ellis commented on HIVE-705:
-------------------------------------

ISTM that merging the HBase columnfamilies into a single Hive table is the 
wrong approach and could lead to poor performance; rather, each HBase CF should 
be its own Hive table, which may of course be joined with others as necessary.  
(I think using the word "table" for HBase's "collection of CFs" is unfortunate 
in the first place since they are different animals; fundamentally, the basic 
unit of data access in HBase is the CF.)

I'm interested because Cassandra is also looking at adding Hive support, and we 
also implement a ColumnFamily data model.

> Let Hive can analyse hbase's tables
> -----------------------------------
>
>                 Key: HIVE-705
>                 URL: https://issues.apache.org/jira/browse/HIVE-705
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Samuel Guo
>            Assignee: John Sichi
>         Attachments: hbase-0.19.3-test.jar, hbase-0.19.3.jar, 
> HIVE-705.1.patch, HIVE-705_draft.patch, HIVE-705_revision806905.patch, 
> HIVE-705_revision883033.patch
>
>
> Add a serde over the hbase's tables, so that hive can analyse the data stored 
> in hbase easily.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to