[ 
https://issues.apache.org/jira/browse/HIVE-6147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13887238#comment-13887238
 ] 

Xuefu Zhang commented on HIVE-6147:
-----------------------------------

This looks good, but from the patch, it seems that the solution is only for 
HBase. I wonder if we have given thoughts on the idea of generalizing the 
problem and providing a general solution. I can see the benefits of separating 
the storage (such as hbase) and data format (avro, thrift, protocol buf, 
parquet, etc).  Then we solve M + N problems rather than M * N problems. What 
if the avro data is coming from other storage, such as accumulo, or parquet 
data from HBase.

> Support avro data stored in HBase columns
> -----------------------------------------
>
>                 Key: HIVE-6147
>                 URL: https://issues.apache.org/jira/browse/HIVE-6147
>             Project: Hive
>          Issue Type: Bug
>          Components: HBase Handler
>    Affects Versions: 0.12.0
>            Reporter: Swarnim Kulkarni
>            Assignee: Swarnim Kulkarni
>         Attachments: HIVE-6147.1.patch.txt
>
>
> Presently, the HBase Hive integration supports querying only primitive data 
> types in columns. It would be nice to be able to store and query Avro objects 
> in HBase columns by making them visible as structs to Hive. This will allow 
> Hive to perform ad hoc analysis of HBase data which can be deeply structured.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to