[ 
https://issues.apache.org/jira/browse/HIVE-2365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13890259#comment-13890259
 ] 

Nick Dimiduk commented on HIVE-2365:
------------------------------------

After much fighting with input data and ordering, I have my first little 
improvement. I've started a [WIP 
branch|https://github.com/ndimiduk/hive/tree/2365-sql-support-hbase-bulkloads] 
over on Github. I will regularly rewrite it's history, but if you'd like to 
follow along, I'll take comments as they come. Once things take shape, I'll 
squash into a patch and attach here.

The patch posted supports generating HFiles from a table defined using the 
HBaseStorageHandler. The next improvement here is to actually rewrite the plan 
to introduce a step that invokes LoadIncrementalHFiles. After that, we can get 
rid of the need for specifying hfile.family.path, just detect it from the 
column family from the mapping attribute and write the HFiles to a temporary 
location before loading.

> SQL support for bulk load into HBase
> ------------------------------------
>
>                 Key: HIVE-2365
>                 URL: https://issues.apache.org/jira/browse/HIVE-2365
>             Project: Hive
>          Issue Type: Improvement
>          Components: HBase Handler
>            Reporter: John Sichi
>            Assignee: Nick Dimiduk
>
> Support the "as simple as this" SQL for bulk load from Hive into HBase.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to