[ 
https://issues.apache.org/jira/browse/YARN-5765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15665443#comment-15665443
 ] 

Miklos Szegedi commented on YARN-5765:
--------------------------------------

Thank you, [~Naganarasimha] for the reply!
Just to finish this discusssion before you resolve this bug.
"What i want to understand is what impact would it have if we do it always ? As 
we never run the container-executor.c binary with root user refer (set_user -> 
check_user) and would it be sufficient to reset the umask after mkdir ?"
I was more concerned about any future changes, who will just call mkdirs 
without knowing that it actually changes umask. Resetting the umask mitigates 
possible side effects. All in all, if the umask change is chosen, I think it is 
best to put it into create_container_directories.
"One more question is if we reset the umask after mkdir then will the container 
logs created will be accessible to the NM because of restrictive rights ? would 
be ideal to set default ACL for the folders created and reset the umask so that 
files created by the user under these directories have the rightful 
permissions?"
I think a more future proof approach to default acls and umask could be a 
feature like the current bin/container-executor --checksetup, when specifying 
the user and root dir tells the administrator, if the user/dir has the right 
settings to run containers. It gives an advice what needs to be set if it does 
not. This has a different philosophy, I admit.
Does this answer your questions?



> LinuxContainerExecutor creates appcache and its subdirectories with wrong 
> group owner.
> --------------------------------------------------------------------------------------
>
>                 Key: YARN-5765
>                 URL: https://issues.apache.org/jira/browse/YARN-5765
>             Project: Hadoop YARN
>          Issue Type: Bug
>    Affects Versions: 2.8.0, 3.0.0-alpha1
>            Reporter: Haibo Chen
>            Assignee: Naganarasimha G R
>            Priority: Blocker
>         Attachments: YARN-5765.001.patch
>
>
> LinuxContainerExecutor creates usercache/\{userId\}/appcache/\{appId\} with 
> wrong group owner, causing Log aggregation and ShuffleHandler to fail because 
> node manager process does not have permission to read the files under the 
> directory.
> This can be easily reproduced by enabling LCE and submitting a MR example job 
> as a user that does not belong to the same group that NM process belongs to. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to