[ 
https://issues.apache.org/jira/browse/FLINK-23169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17377350#comment-17377350
 ] 

jinfeng edited comment on FLINK-23169 at 7/8/21, 12:22 PM:
-----------------------------------------------------------

[~fly_in_gis]  thanks for  your reply.

When we use a Flink job management platform to submit jobs from different 
users, we config the 'yarn.staging-directory' ,for example. /flink/staging/

1. if user1 submit the job, and user1 will create the 
/flink/staging/.flink/application_xxxx user1 supergroup drwx--x--x
 2. if user2 submit the job, and user2 can't use /flink/staging/.flink , becase 
directory `.flink` belongs to user1.
 3. if we create /flink/staging/.flink first, there is also another problem, 
user1, user2, user3.. 's application staging directory will be created in the 
same directory .

The reason why we don't use the default home directory is that we want to 
provided a common directory to different users. If not, every new user need to 
create their own home directory and apply quota.
 I notice that the spark also provided a config to specific the yarn staging 
directory, and it will be divided by user directory by default.


was (Author: hackergin):
[~fly_in_gis]  thank for  you reply.

When we use a Flink job management platform to submit jobs from different 
users, we config the 'yarn.staging-directory' ,for example. /flink/staging/

1. if user1 submit the job, and user1 will create the 
/flink/staging/.flink/application_xxxx user1 supergroup drwx--x--x
2. if user2 submit the job, and user2 can't use /flink/staging/.flink , becase 
directory `.flink` belongs to user1.
3. if we create /flink/staging/.flink first, there is also another problem, 
user1, user2, user3.. 's application staging directory will be created in the 
same directory .

The reason why we don't use the default home directory is that we want to 
provided a common directory to different users. If not, every new user need to 
create their own home directory and apply quota.
I notice that the spark also provided a config to specific the yarn staging 
directory, and it will be divided by user directory by default.

> Support user-level app staging directory when yarn.staging-directory is 
> specified
> ---------------------------------------------------------------------------------
>
>                 Key: FLINK-23169
>                 URL: https://issues.apache.org/jira/browse/FLINK-23169
>             Project: Flink
>          Issue Type: Improvement
>          Components: Deployment / YARN
>            Reporter: jinfeng
>            Priority: Major
>
> When yarn.staging-directory is specified,  different users will use the same 
> directory as the staging directory.   It may not friendly for a job platform 
> to submit job for different users.  I propose to use the user-level directory 
> by default when yarn.staging-directory is specified.  We only need to make 
> small changes  for `getStagingDir` function in 
> YarnClusterDescriptor 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to