Hari Sekhon created SQOOP-1055:
----------------------------------

             Summary: Add Sqoop export --query option
                 Key: SQOOP-1055
                 URL: https://issues.apache.org/jira/browse/SQOOP-1055
             Project: Sqoop
          Issue Type: Improvement
            Reporter: Hari Sekhon


Sqoop currently has a --query option for import but not for export.

It would be nice if the export --query option supporting HiveQL could be added 
as users currently have to create a temporary table and then export that as a 
two step process with a full disk re-write of all the to-be-exported data to a 
new table before the sqoop export command is started.

Since Sqoop executes a distributed map-only job, I believe certain queries such 
as joins that have to be done via a reduce phase will yield little performance 
improvement due to the map->reduce intermediate writes needing to be written 
anyway. However we could save on the final reduce phase writes and also turn 
this in to a more convenient one step instead two step process.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to