C Scyphers created SQOOP-3281:
---------------------------------

             Summary: Support for Hive UDFs on import
                 Key: SQOOP-3281
                 URL: https://issues.apache.org/jira/browse/SQOOP-3281
             Project: Sqoop
          Issue Type: Improvement
          Components: hive-integration
    Affects Versions: 1.4.6
            Reporter: C Scyphers


As many companies are using UDF to establish column level encryption during 
write time, Sqoop should support applying such a UDF during the write process.  
This would be an extension of the map-column-hive functionality, where the 
value of the parseColumnMapping would accept the UDF:

{{sqoop import --verbose --connect "jdbcconnectionstring" --username user 
--password password --hive-import --hive-database hiveschematest 
--map-column-hive 
"emptest.id=int,emptest.name=varchar(100),emptest.ssn=UDF_ENCRYPT()" -m 1}}

With this approach, the data does not have to be written to HDFS in the clear.  
This functionality can also be extended to other UDFs (naturally).

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to