[ https://issues.apache.org/jira/browse/SPARK-33826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17251513#comment-17251513 ]
Hyukjin Kwon commented on SPARK-33826: -------------------------------------- [~AlberyZJG] are you able to show the self-contained reproducer so people can verify easily? > InsertIntoHiveTable generate HDFS file with invalid user > -------------------------------------------------------- > > Key: SPARK-33826 > URL: https://issues.apache.org/jira/browse/SPARK-33826 > Project: Spark > Issue Type: Bug > Components: Spark Core > Affects Versions: 2.3.2, 3.0.0 > Reporter: Zhang Jianguo > Priority: Minor > > *Arch:* Hive on Spark. > > *Version:* Spark 2.3.2 > > *Conf:* > Enable user impersonation > hive.server2.enable.doAs=true > > *Scenario:* > Thriftserver is running with loginUser A, and Task run as User A too. > Client execute SQL with user B > > Data generated by sql "insert into TABLE \[tbl\] select XXX form XXXX." is > written to HDFS on executor, executor doesn't know B. > > *{color:#de350b}So the user file written to HDFS will be user A which should > be B.{color}* > > I also check the inplementation of Spark 3.0.0, It could have the same issue. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org