[ https://issues.apache.org/jira/browse/SQOOP-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15057582#comment-15057582 ]
ASF GitHub Bot commented on SQOOP-1532: --------------------------------------- Github user eruizgar commented on the pull request: https://github.com/apache/sqoop/pull/11#issuecomment-164680816 Hi jarcec, enclosed you can find the link of the complete PR patch. Hope this helps: https://patch-diff.githubusercontent.com/raw/apache/sqoop/pull/11.diff > Sqoop2: Support Sqoop on Spark Execution Engine > ----------------------------------------------- > > Key: SQOOP-1532 > URL: https://issues.apache.org/jira/browse/SQOOP-1532 > Project: Sqoop > Issue Type: Improvement > Reporter: Veena Basavaraj > Assignee: Veena Basavaraj > Fix For: 2.0.0 > > > The current execution engine supported in sqoop is MR. > The goal if this ticket is to support sqoop jobs ( map only and map+reduce ) > to run on spark environment. > It should at the minimum support running on the standalone spark cluster and > then subsequently work with YARN/mesos. > High level goals > 1. Hook up with the connector apis to provide the basic load/ extract to the > spark RDD. > 2. Implementation of the Sqoop RDD to support extraction from different data > sources . The design proposal will discuss the alternatives on how this can > be achieved. > 3. Optimizing the loading/writing ( re-use/ refactor the consumer thread code > to be agnostic of the hadoop output format) -- This message was sent by Atlassian JIRA (v6.3.4#6332)