Bob created YARN-4508: ------------------------- Summary: The mount_cgroup method in container-executor.c should enhance mount check when mount the request cgroup controller. Key: YARN-4508 URL: https://issues.apache.org/jira/browse/YARN-4508 Project: Hadoop YARN Issue Type: Bug Components: yarn Affects Versions: 2.7.1, 2.6.1 Reporter: Bob Priority: Minor
In one scenarios , could result in mount_cgroup return success, but actually the request cgroup controller mount failed. Below code should enhance the condition check: {code} } else { fprintf(LOGFILE, "Failed to mount cgroup controller %s at %s - %s\n", controller, mount_path, strerror(errno)); // if controller is already mounted, don't stop trying to mount others if (errno != EBUSY) { result = -1; } } {code} In below scenarios can reproduce the issue: 1.Start NM, it will mount cgroups normally 2.Manually unmount the cgroups used by NM 3.Restart NM, NM can start successfully , but container cant be started due to cgroups did not mounted successfully. -- This message was sent by Atlassian JIRA (v6.3.4#6332)