[ 
https://issues.apache.org/jira/browse/HIVE-17289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sankar Hariappan updated HIVE-17289:
------------------------------------
    Description: 
Currently, EXPORT uses distcp to dump data files to dump directory and IMPORT 
uses distcp to copy the larger files/large number of files from dump directory 
to table staging directory. But, this copy fails as distcp is always done with 
doAs user specified in hive.distcp.privileged.doAs, which is "hdfs' by default.
Need to remove usage of doAs user when try to distcp from EXPORT/IMPORT flow.
Privileged user based distcp should be done only for REPL DUMP/LOAD commands.
Also, need to set the default config for hive.distcp.privileged.doAs to "hive" 
as "hdfs" super-user is never allowed.

  was:
Currently, IMPORT uses distcp to copy the larger files/large number of files 
from dump directory to table staging directory. But, this copy fails as distcp 
is always done with doAs user specified in hive.distcp.privileged.doAs, which 
is "hdfs' by default.
Need to remove usage of doAs user when try to distcp from IMPORT flow.
Also, need to set the default config for hive.distcp.privileged.doAs to "hive" 
as "hdfs" super-user is never allowed.


> EXPORT and IMPORT shouldn't perform distcp with doAs privileged user.
> ---------------------------------------------------------------------
>
>                 Key: HIVE-17289
>                 URL: https://issues.apache.org/jira/browse/HIVE-17289
>             Project: Hive
>          Issue Type: Sub-task
>          Components: HiveServer2, repl
>    Affects Versions: 3.0.0
>            Reporter: Sankar Hariappan
>            Assignee: Sankar Hariappan
>              Labels: DR, Import, replication
>             Fix For: 3.0.0
>
>
> Currently, EXPORT uses distcp to dump data files to dump directory and IMPORT 
> uses distcp to copy the larger files/large number of files from dump 
> directory to table staging directory. But, this copy fails as distcp is 
> always done with doAs user specified in hive.distcp.privileged.doAs, which is 
> "hdfs' by default.
> Need to remove usage of doAs user when try to distcp from EXPORT/IMPORT flow.
> Privileged user based distcp should be done only for REPL DUMP/LOAD commands.
> Also, need to set the default config for hive.distcp.privileged.doAs to 
> "hive" as "hdfs" super-user is never allowed.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to