Nick Dimiduk created HIVE-7572:
----------------------------------

             Summary: Enable LOAD DATA into StorageHandler tables
                 Key: HIVE-7572
                 URL: https://issues.apache.org/jira/browse/HIVE-7572
             Project: Hive
          Issue Type: Improvement
          Components: StorageHandler
            Reporter: Nick Dimiduk


Once annoyance when working with HBaseStorageHandler is its inaccessibility to 
local data. Populating an HBase table from local test data, for instance, is a 
multi-step process:

{noformat}
# create a hive table you HAVE to populate
> CREATE TABLE src(key int, value string);
# populate the intermediate hive table
> LOAD DATA LOCAL INPATH '/path/to/hive/data/files/kv1.txt' OVERWRITE INTO 
> TABLE src;
# create the hbase table you WANT to populate
> CREATE TABLE hbase_src(key INT, value STRING) STORED BY 
> 'org.apache.hadoop.hive.hbase.HBaseStorageHandler' WITH SERDEPROPERTIES 
> ('hbase.columns.mapping' = ':key,cf:val') TBLPROPERTIES ('hbase.table.name' = 
> 'hbase_src');
# copy data into hbase
> INSERT OVERWRITE TABLE hbase_src SELECT * FROM src;
{noformat}

This multi-step process could be simplified and isn't limited to 
HBaseStorageHandler -- any StorageHandler implementation will suffer this 
problem.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to