[jira] [Commented] (YARN-3962) If we change node manager identity to run as virtual account, then resource localization service fails to start with incorrect permission
[ https://issues.apache.org/jira/browse/YARN-3962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16448775#comment-16448775 ] genericqa commented on YARN-3962: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 25s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:red}-1{color} | {color:red} test4tests {color} | {color:red} 0m 0s{color} | {color:red} The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. {color} | || || || || {color:brown} trunk Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 26m 4s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 51s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 24s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 34s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 11m 3s{color} | {color:green} branch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 0m 50s{color} | {color:green} trunk passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 22s{color} | {color:green} trunk passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 0m 31s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 47s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 0m 47s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 20s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} mvnsite {color} | {color:green} 0m 30s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} shadedclient {color} | {color:green} 11m 8s{color} | {color:green} patch has no errors when building and testing our client artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 0m 56s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 19s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:green}+1{color} | {color:green} unit {color} | {color:green} 19m 1s{color} | {color:green} hadoop-yarn-server-nodemanager in the patch passed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 22s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 74m 26s{color} | {color:black} {color} | \\ \\ || Subsystem || Report/Notes || | Docker | Client=17.05.0-ce Server=17.05.0-ce Image:yetus/hadoop:8620d2b | | JIRA Issue | YARN-3962 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12919681/YARN-3962.003.patch | | Optional Tests | asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle | | uname | Linux 17ef74905c4c 3.13.0-137-generic #186-Ubuntu SMP Mon Dec 4 19:09:19 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | /testptch/patchprocess/precommit/personality/provided.sh | | git revision | trunk / f411de6 | | maven | version: Apache Maven 3.3.9 | | Default Java | 1.8.0_162 | | findbugs | v3.1.0-RC1 | | Test Results | https://builds.apache.org/job/PreCommit-YARN-Build/20445/testReport/ | | Max. process+thread count | 337 (vs. ulimit of 1) | | modules | C: hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager U: hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager | | Console output | https://builds.apache.org/job/PreCommit-YARN-Build/20445/console | | Powered by | Apache Yetus 0.8.0-SNAPSH
[jira] [Commented] (YARN-3962) If we change node manager identity to run as virtual account, then resource localization service fails to start with incorrect permission
[ https://issues.apache.org/jira/browse/YARN-3962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16443103#comment-16443103 ] Íñigo Goiri commented on YARN-3962: --- We've been using this internally for a while and when testing 2.9 we hit the same issue. I'd like to try this out again. > If we change node manager identity to run as virtual account, then resource > localization service fails to start with incorrect permission > - > > Key: YARN-3962 > URL: https://issues.apache.org/jira/browse/YARN-3962 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager >Affects Versions: 2.6.0 >Reporter: madhumita chakraborty >Priority: Major > Labels: oct16-easy > Attachments: YARN-3962-002.patch, Yarn-3962.001.patch > > > For azure hdinsight we need to change node manager to run as virtual account > instead of use account. Else after azure reimage, it wont be able to access > the map output data of the running job in that node. But when we changed the > nodemanager to run as virtual account we got this error, > 2015-06-02 06:11:45,281 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Writing credentials to the nmPrivate file > c:/apps1/temp/hdfs/nm-local-dir/nmPrivate/container_1433128260970_0007_01_01.tokens. > Credentials list: > 2015-06-02 06:11:45,313 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,313 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Attempting to initialize c:/apps1/temp/hdfs/nm-local-dir > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Failed to setup local dir c:/apps1/temp/hdfs/nm-local-dir, which was marked > as good. > org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Permissions > incorrectly set for dir c:/apps1/temp/hdfs/nm-local-dir/usercache, should be > rwxr-xr-x, actual value = rwxrwxr-x > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.checkLocalDir(ResourceLocalizationService.java:1400) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1367) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.access$900(ResourceLocalizationService.java:137) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1085) > 2015-06-02 06:11:45,375 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Localizer failed > org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Failed to setup > local dir c:/apps1/temp/hdfs/nm-local-dir, which was marked as good. > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1372) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.access$900(ResourceLocalizationService.java:137) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1085) > Caused by: org.apache.hadoop.yarn.exceptions.YarnRuntimeException: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.checkLocalDir(ResourceLocalizationService.java:1400) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1367) > Fix - When node manager runs as virtual account, the resourcelocalization > service fails to come. It checks for the permission of usercache and file > cache to be 755 and nmPrivate to be 700. But in windows, for virtual account, >
[jira] [Commented] (YARN-3962) If we change node manager identity to run as virtual account, then resource localization service fails to start with incorrect permission
[ https://issues.apache.org/jira/browse/YARN-3962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14661004#comment-14661004 ] Chris Nauroth commented on YARN-3962: - This looks good to me. I agree with Xuan that it would be good to find a way to add unit tests. Thank you, Madhumita! > If we change node manager identity to run as virtual account, then resource > localization service fails to start with incorrect permission > - > > Key: YARN-3962 > URL: https://issues.apache.org/jira/browse/YARN-3962 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager >Affects Versions: 2.6.0 >Reporter: madhumita chakraborty > Attachments: YARN-3962-002.patch, Yarn-3962.001.patch > > > For azure hdinsight we need to change node manager to run as virtual account > instead of use account. Else after azure reimage, it wont be able to access > the map output data of the running job in that node. But when we changed the > nodemanager to run as virtual account we got this error, > 2015-06-02 06:11:45,281 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Writing credentials to the nmPrivate file > c:/apps1/temp/hdfs/nm-local-dir/nmPrivate/container_1433128260970_0007_01_01.tokens. > Credentials list: > 2015-06-02 06:11:45,313 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,313 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Attempting to initialize c:/apps1/temp/hdfs/nm-local-dir > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Failed to setup local dir c:/apps1/temp/hdfs/nm-local-dir, which was marked > as good. > org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Permissions > incorrectly set for dir c:/apps1/temp/hdfs/nm-local-dir/usercache, should be > rwxr-xr-x, actual value = rwxrwxr-x > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.checkLocalDir(ResourceLocalizationService.java:1400) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1367) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.access$900(ResourceLocalizationService.java:137) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1085) > 2015-06-02 06:11:45,375 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Localizer failed > org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Failed to setup > local dir c:/apps1/temp/hdfs/nm-local-dir, which was marked as good. > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1372) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.access$900(ResourceLocalizationService.java:137) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1085) > Caused by: org.apache.hadoop.yarn.exceptions.YarnRuntimeException: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.checkLocalDir(ResourceLocalizationService.java:1400) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1367) > Fix - When node manager runs as virtual account, the resourcelocalization > service fails to come. It checks for the permission of usercache and file > cache to be 755 and nmPrivate to be 700. But in windows, for virtual account, > the owner and group is same. So this pemrission check fails.
[jira] [Commented] (YARN-3962) If we change node manager identity to run as virtual account, then resource localization service fails to start with incorrect permission
[ https://issues.apache.org/jira/browse/YARN-3962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14643784#comment-14643784 ] Xuan Gong commented on YARN-3962: - The patch looks fine. Is it possible to add some testcases to verify this ? [~cnauroth] Could you review this, too ? > If we change node manager identity to run as virtual account, then resource > localization service fails to start with incorrect permission > - > > Key: YARN-3962 > URL: https://issues.apache.org/jira/browse/YARN-3962 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager >Affects Versions: 2.6.0 >Reporter: madhumita chakraborty > Attachments: YARN-3962-002.patch, Yarn-3962.001.patch > > > For azure hdinsight we need to change node manager to run as virtual account > instead of use account. Else after azure reimage, it wont be able to access > the map output data of the running job in that node. But when we changed the > nodemanager to run as virtual account we got this error, > 2015-06-02 06:11:45,281 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Writing credentials to the nmPrivate file > c:/apps1/temp/hdfs/nm-local-dir/nmPrivate/container_1433128260970_0007_01_01.tokens. > Credentials list: > 2015-06-02 06:11:45,313 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,313 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Attempting to initialize c:/apps1/temp/hdfs/nm-local-dir > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Failed to setup local dir c:/apps1/temp/hdfs/nm-local-dir, which was marked > as good. > org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Permissions > incorrectly set for dir c:/apps1/temp/hdfs/nm-local-dir/usercache, should be > rwxr-xr-x, actual value = rwxrwxr-x > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.checkLocalDir(ResourceLocalizationService.java:1400) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1367) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.access$900(ResourceLocalizationService.java:137) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1085) > 2015-06-02 06:11:45,375 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Localizer failed > org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Failed to setup > local dir c:/apps1/temp/hdfs/nm-local-dir, which was marked as good. > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1372) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.access$900(ResourceLocalizationService.java:137) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1085) > Caused by: org.apache.hadoop.yarn.exceptions.YarnRuntimeException: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.checkLocalDir(ResourceLocalizationService.java:1400) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1367) > Fix - When node manager runs as virtual account, the resourcelocalization > service fails to come. It checks for the permission of usercache and file > cache to be 755 and nmPrivate to be 700. But in windows, for virtual account, > the owner and group is same. So this pemrission check fails. So added a
[jira] [Commented] (YARN-3962) If we change node manager identity to run as virtual account, then resource localization service fails to start with incorrect permission
[ https://issues.apache.org/jira/browse/YARN-3962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14641381#comment-14641381 ] Hadoop QA commented on YARN-3962: - \\ \\ | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | pre-patch | 16m 0s | Pre-patch trunk compilation is healthy. | | {color:green}+1{color} | @author | 0m 0s | The patch does not contain any @author tags. | | {color:red}-1{color} | tests included | 0m 0s | The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. | | {color:green}+1{color} | javac | 7m 49s | There were no new javac warning messages. | | {color:green}+1{color} | javadoc | 9m 42s | There were no new javadoc warning messages. | | {color:green}+1{color} | release audit | 0m 22s | The applied patch does not increase the total number of release audit warnings. | | {color:green}+1{color} | checkstyle | 0m 38s | There were no new checkstyle issues. | | {color:green}+1{color} | whitespace | 0m 0s | The patch has no lines that end in whitespace. | | {color:green}+1{color} | install | 1m 22s | mvn install still works. | | {color:green}+1{color} | eclipse:eclipse | 0m 34s | The patch built with eclipse:eclipse. | | {color:green}+1{color} | findbugs | 1m 12s | The patch does not introduce any new Findbugs (version 3.0.0) warnings. | | {color:green}+1{color} | yarn tests | 6m 18s | Tests passed in hadoop-yarn-server-nodemanager. | | | | 44m 1s | | \\ \\ || Subsystem || Report/Notes || | Patch URL | http://issues.apache.org/jira/secure/attachment/12747151/YARN-3962-002.patch | | Optional Tests | javadoc javac unit findbugs checkstyle | | git revision | trunk / adcf5dd | | hadoop-yarn-server-nodemanager test log | https://builds.apache.org/job/PreCommit-YARN-Build/8665/artifact/patchprocess/testrun_hadoop-yarn-server-nodemanager.txt | | Test Results | https://builds.apache.org/job/PreCommit-YARN-Build/8665/testReport/ | | Java | 1.7.0_55 | | uname | Linux asf905.gq1.ygridcore.net 3.13.0-36-lowlatency #63-Ubuntu SMP PREEMPT Wed Sep 3 21:56:12 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux | | Console output | https://builds.apache.org/job/PreCommit-YARN-Build/8665/console | This message was automatically generated. > If we change node manager identity to run as virtual account, then resource > localization service fails to start with incorrect permission > - > > Key: YARN-3962 > URL: https://issues.apache.org/jira/browse/YARN-3962 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager >Affects Versions: 2.6.0 >Reporter: madhumita chakraborty > Attachments: YARN-3962-002.patch, Yarn-3962.001.patch > > > For azure hdinsight we need to change node manager to run as virtual account > instead of use account. Else after azure reimage, it wont be able to access > the map output data of the running job in that node. But when we changed the > nodemanager to run as virtual account we got this error, > 2015-06-02 06:11:45,281 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Writing credentials to the nmPrivate file > c:/apps1/temp/hdfs/nm-local-dir/nmPrivate/container_1433128260970_0007_01_01.tokens. > Credentials list: > 2015-06-02 06:11:45,313 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,313 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Attempting to initialize c:/apps1/temp/hdfs/nm-local-dir > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Failed to setup local dir c:/apps1/temp/hdfs/nm-local-dir, which was marked > as good. > org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Permissions > incorrectly set for dir c:/apps1/temp/hdfs/nm-local-dir/usercache, should be > rwxr-xr-x, actual value = rwxrwxr-x > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.checkLocalDir(ResourceLocalizationServi
[jira] [Commented] (YARN-3962) If we change node manager identity to run as virtual account, then resource localization service fails to start with incorrect permission
[ https://issues.apache.org/jira/browse/YARN-3962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14641364#comment-14641364 ] Brahma Reddy Battula commented on YARN-3962: [~madhuch-ms] thanks for reporting for this and worked on this..Nicework..Verified the fix, it's working fine..Attached the patch behalf of [~madhuch-ms].. > If we change node manager identity to run as virtual account, then resource > localization service fails to start with incorrect permission > - > > Key: YARN-3962 > URL: https://issues.apache.org/jira/browse/YARN-3962 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager >Affects Versions: 2.6.0 >Reporter: madhumita chakraborty > Attachments: YARN-3962-002.patch, Yarn-3962.001.patch > > > For azure hdinsight we need to change node manager to run as virtual account > instead of use account. Else after azure reimage, it wont be able to access > the map output data of the running job in that node. But when we changed the > nodemanager to run as virtual account we got this error, > 2015-06-02 06:11:45,281 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Writing credentials to the nmPrivate file > c:/apps1/temp/hdfs/nm-local-dir/nmPrivate/container_1433128260970_0007_01_01.tokens. > Credentials list: > 2015-06-02 06:11:45,313 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,313 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Attempting to initialize c:/apps1/temp/hdfs/nm-local-dir > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Failed to setup local dir c:/apps1/temp/hdfs/nm-local-dir, which was marked > as good. > org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Permissions > incorrectly set for dir c:/apps1/temp/hdfs/nm-local-dir/usercache, should be > rwxr-xr-x, actual value = rwxrwxr-x > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.checkLocalDir(ResourceLocalizationService.java:1400) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1367) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.access$900(ResourceLocalizationService.java:137) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1085) > 2015-06-02 06:11:45,375 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Localizer failed > org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Failed to setup > local dir c:/apps1/temp/hdfs/nm-local-dir, which was marked as good. > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1372) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.access$900(ResourceLocalizationService.java:137) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1085) > Caused by: org.apache.hadoop.yarn.exceptions.YarnRuntimeException: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.checkLocalDir(ResourceLocalizationService.java:1400) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1367) > Fix - When node manager runs as virtual account, the resourcelocalization > service fails to come. It checks for the permission of usercache and file > cache to be 755 and nmPrivate to be 700. But in windows, for virtual account, > the owner
[jira] [Commented] (YARN-3962) If we change node manager identity to run as virtual account, then resource localization service fails to start with incorrect permission
[ https://issues.apache.org/jira/browse/YARN-3962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14639288#comment-14639288 ] Varun Vasudev commented on YARN-3962: - [~madhuch-ms] - looks like the patch doesn't apply cleanly. Can you attach the output from git diff? Thanks! > If we change node manager identity to run as virtual account, then resource > localization service fails to start with incorrect permission > - > > Key: YARN-3962 > URL: https://issues.apache.org/jira/browse/YARN-3962 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager >Affects Versions: 2.6.0 >Reporter: madhumita chakraborty > Attachments: Yarn-3962.001.patch > > > For azure hdinsight we need to change node manager to run as virtual account > instead of use account. Else after azure reimage, it wont be able to access > the map output data of the running job in that node. But when we changed the > nodemanager to run as virtual account we got this error, > 2015-06-02 06:11:45,281 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Writing credentials to the nmPrivate file > c:/apps1/temp/hdfs/nm-local-dir/nmPrivate/container_1433128260970_0007_01_01.tokens. > Credentials list: > 2015-06-02 06:11:45,313 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,313 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Attempting to initialize c:/apps1/temp/hdfs/nm-local-dir > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Failed to setup local dir c:/apps1/temp/hdfs/nm-local-dir, which was marked > as good. > org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Permissions > incorrectly set for dir c:/apps1/temp/hdfs/nm-local-dir/usercache, should be > rwxr-xr-x, actual value = rwxrwxr-x > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.checkLocalDir(ResourceLocalizationService.java:1400) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1367) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.access$900(ResourceLocalizationService.java:137) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1085) > 2015-06-02 06:11:45,375 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Localizer failed > org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Failed to setup > local dir c:/apps1/temp/hdfs/nm-local-dir, which was marked as good. > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1372) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.access$900(ResourceLocalizationService.java:137) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1085) > Caused by: org.apache.hadoop.yarn.exceptions.YarnRuntimeException: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.checkLocalDir(ResourceLocalizationService.java:1400) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1367) > Fix - When node manager runs as virtual account, the resourcelocalization > service fails to come. It checks for the permission of usercache and file > cache to be 755 and nmPrivate to be 700. But in windows, for virtual account, > the owner and group is same. So this pemrission check fails. So added a check > that is use
[jira] [Commented] (YARN-3962) If we change node manager identity to run as virtual account, then resource localization service fails to start with incorrect permission
[ https://issues.apache.org/jira/browse/YARN-3962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14638158#comment-14638158 ] Hadoop QA commented on YARN-3962: - \\ \\ | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:red}-1{color} | patch | 0m 0s | The patch command could not apply the patch during dryrun. | \\ \\ || Subsystem || Report/Notes || | Patch URL | http://issues.apache.org/jira/secure/attachment/12746718/Yarn-3962.001.patch | | Optional Tests | | | git revision | trunk / ee98d63 | | Console output | https://builds.apache.org/job/PreCommit-YARN-Build/8633/console | This message was automatically generated. > If we change node manager identity to run as virtual account, then resource > localization service fails to start with incorrect permission > - > > Key: YARN-3962 > URL: https://issues.apache.org/jira/browse/YARN-3962 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager >Affects Versions: 2.6.0 >Reporter: madhumita chakraborty > Attachments: Yarn-3962.001.patch > > > For azure hdinsight we need to change node manager to run as virtual account > instead of use account. Else after azure reimage, it wont be able to access > the map output data of the running job in that node. But when we changed the > nodemanager to run as virtual account we got this error, > 2015-06-02 06:11:45,281 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Writing credentials to the nmPrivate file > c:/apps1/temp/hdfs/nm-local-dir/nmPrivate/container_1433128260970_0007_01_01.tokens. > Credentials list: > 2015-06-02 06:11:45,313 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,313 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Attempting to initialize c:/apps1/temp/hdfs/nm-local-dir > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > 2015-06-02 06:11:45,375 WARN > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Failed to setup local dir c:/apps1/temp/hdfs/nm-local-dir, which was marked > as good. > org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Permissions > incorrectly set for dir c:/apps1/temp/hdfs/nm-local-dir/usercache, should be > rwxr-xr-x, actual value = rwxrwxr-x > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.checkLocalDir(ResourceLocalizationService.java:1400) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1367) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.access$900(ResourceLocalizationService.java:137) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1085) > 2015-06-02 06:11:45,375 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: > Localizer failed > org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Failed to setup > local dir c:/apps1/temp/hdfs/nm-local-dir, which was marked as good. > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.getInitializedLocalDirs(ResourceLocalizationService.java:1372) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.access$900(ResourceLocalizationService.java:137) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1085) > Caused by: org.apache.hadoop.yarn.exceptions.YarnRuntimeException: > Permissions incorrectly set for dir > c:/apps1/temp/hdfs/nm-local-dir/usercache, should be rwxr-xr-x, actual value > = rwxrwxr-x > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService.checkLocalDir(ResourceLocalizationService.java:1400) > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocaliza