Sergey Soldatov created PHOENIX-2723:
----------------------------------------

             Summary: Make BulkLoad able to load several tables at once
                 Key: PHOENIX-2723
                 URL: https://issues.apache.org/jira/browse/PHOENIX-2723
             Project: Phoenix
          Issue Type: Improvement
            Reporter: Sergey Soldatov


It comes that usually bulk load is required for more than one table and usually 
it's done by running jobs one by one. The idea is to provide lists of tables 
and corresponding input sources to the MR BulkLoad job. Syntax can be something 
like :
yarn ... CsvBulkLoadTool -t table1,table2,table3 --input input1,input2,input3
Having map tableName => input during map phase we can determine to which table 
the current split belongs to and produce necessary tableRowKeyPair. 

Any thoughts, suggestions?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to