[jira] [Commented] (YARN-8679) [ATSv2] If HBase cluster is down for long time, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584655#comment-16584655 ] Hudson commented on YARN-8679: -- SUCCESS: Integrated in Jenkins build Hadoop-trunk-Commit #14800 (See

[jira] [Updated] (YARN-7835) [Atsv2] Race condition in NM while publishing events if second attempt is launched on the same node

2018-08-17 Thread Rohith Sharma K S (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith Sharma K S updated YARN-7835: Fix Version/s: 3.0.4 > [Atsv2] Race condition in NM while publishing events if second

[jira] [Commented] (YARN-7835) [Atsv2] Race condition in NM while publishing events if second attempt is launched on the same node

2018-08-17 Thread Rohith Sharma K S (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584647#comment-16584647 ] Rohith Sharma K S commented on YARN-7835: - This commit was missing in branch-3.0.. I back ported

[jira] [Commented] (YARN-8679) [ATSv2] If HBase cluster is down for long time, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread Rohith Sharma K S (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584634#comment-16584634 ] Rohith Sharma K S commented on YARN-8679: - +1 lgtm..committing shortly. I ran tests and it seems

[jira] [Commented] (YARN-8648) Container cgroups are leaked when using docker

2018-08-17 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584600#comment-16584600 ] Eric Yang commented on YARN-8648: - I am in favor of minimal fix at this time. Let docker be docker seems

[jira] [Commented] (YARN-8648) Container cgroups are leaked when using docker

2018-08-17 Thread Jim Brennan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584479#comment-16584479 ] Jim Brennan commented on YARN-8648: --- I have been experimenting with the following incomplete approach:

[jira] [Commented] (YARN-8242) YARN NM: OOM error while reading back the state store on recovery

2018-08-17 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584475#comment-16584475 ] genericqa commented on YARN-8242: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-8242) YARN NM: OOM error while reading back the state store on recovery

2018-08-17 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584463#comment-16584463 ] Jason Lowe commented on YARN-8242: -- Thanks for updating the patch! +1 lgtm pending Jenkins. > YARN NM:

[jira] [Commented] (YARN-8679) [ATSv2] If HBase cluster is down for long time, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584461#comment-16584461 ] genericqa commented on YARN-8679: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-7018) Interface for adding extra behavior to node heartbeats

2018-08-17 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584453#comment-16584453 ] Jason Lowe commented on YARN-7018: -- Thanks for the POC patch! At a very high level it's along the lines

[jira] [Commented] (YARN-7863) Modify placement constraints to support node attributes

2018-08-17 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584441#comment-16584441 ] genericqa commented on YARN-7863: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-8242) YARN NM: OOM error while reading back the state store on recovery

2018-08-17 Thread Pradeep Ambati (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pradeep Ambati updated YARN-8242: - Attachment: YARN-8242.008.patch > YARN NM: OOM error while reading back the state store on

[jira] [Commented] (YARN-8679) [ATSv2] If HBase cluster is down for long time, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584379#comment-16584379 ] Eric Yang commented on YARN-8679: - +1 Patch 2, pending Jenkins results. > [ATSv2] If HBase cluster is

[jira] [Commented] (YARN-8673) [AMRMProxy] More robust responseId resync after an YarnRM master slave switch

2018-08-17 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584354#comment-16584354 ] genericqa commented on YARN-8673: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Created] (YARN-8681) Wrong error message in RM placement constraints check

2018-08-17 Thread Daniel Templeton (JIRA)
Daniel Templeton created YARN-8681: -- Summary: Wrong error message in RM placement constraints check Key: YARN-8681 URL: https://issues.apache.org/jira/browse/YARN-8681 Project: Hadoop YARN

[jira] [Assigned] (YARN-8679) [ATSv2] If HBase cluster is down for long time, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wangda Tan reassigned YARN-8679: Assignee: Wangda Tan (was: Rohith Sharma K S) > [ATSv2] If HBase cluster is down for long time,

[jira] [Comment Edited] (YARN-8448) AM HTTPS Support

2018-08-17 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16582321#comment-16582321 ] Szilard Nemeth edited comment on YARN-8448 at 8/17/18 8:16 PM: --- Hey

[jira] [Commented] (YARN-8640) Restore previous state in container-executor after failure

2018-08-17 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584338#comment-16584338 ] Jason Lowe commented on YARN-8640: -- Thanks for updating the patches! +1 for the branch-2.8 and

[jira] [Commented] (YARN-8621) Add REST API tests for Resource Types fields for the apps/ endpoint

2018-08-17 Thread Daniel Templeton (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584336#comment-16584336 ] Daniel Templeton commented on YARN-8621: I don't think we need to return the resource requests

[jira] [Commented] (YARN-8679) [ATSv2] If HBase cluster is down for long time, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584322#comment-16584322 ] Wangda Tan commented on YARN-8679: -- [~rohithsharma], thanks for the patch. I'm a bit worried about the

[jira] [Updated] (YARN-8679) [ATSv2] If HBase cluster is down for long time, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wangda Tan updated YARN-8679: - Attachment: YARN-8679.02.patch > [ATSv2] If HBase cluster is down for long time, high chances that NM >

[jira] [Commented] (YARN-7863) Modify placement constraints to support node attributes

2018-08-17 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584281#comment-16584281 ] Sunil Govindan commented on YARN-7863: -- Updating a refined patch. Thanks [~cheersyang] for quick

[jira] [Updated] (YARN-7863) Modify placement constraints to support node attributes

2018-08-17 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sunil Govindan updated YARN-7863: - Attachment: YARN-7863-YARN-3409.006.patch > Modify placement constraints to support node

[jira] [Commented] (YARN-8242) YARN NM: OOM error while reading back the state store on recovery

2018-08-17 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584261#comment-16584261 ] Jason Lowe commented on YARN-8242: -- Thanks for updating the patch! When translating DBException into

[jira] [Updated] (YARN-8678) Queue Management API - rephrase error messages

2018-08-17 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sunil Govindan updated YARN-8678: - Issue Type: Sub-task (was: Bug) Parent: YARN-5734 > Queue Management API - rephrase

[jira] [Updated] (YARN-8677) Queue Management API - no errors thrown for wrong properties

2018-08-17 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sunil Govindan updated YARN-8677: - Issue Type: Sub-task (was: Bug) Parent: YARN-5734 > Queue Management API - no errors

[jira] [Commented] (YARN-8677) Queue Management API - no errors thrown for wrong properties

2018-08-17 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584242#comment-16584242 ] Wangda Tan commented on YARN-8677: -- [~akhilpb], could u move these issues to sub jira of YARN-5734 for

[jira] [Commented] (YARN-8657) User limit calculation should be read-lock-protected within LeafQueue

2018-08-17 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584238#comment-16584238 ] Wangda Tan commented on YARN-8657: -- [~sunilg], I'm not quite sure if the patch changed locking scope of

[jira] [Updated] (YARN-8680) YARN NM: Implement Iterable Abstraction for LocalResourceTrackerstate

2018-08-17 Thread Pradeep Ambati (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pradeep Ambati updated YARN-8680: - Description: Similar to YARN-8242, implement iterable abstraction for LocalResourceTrackerState

[jira] [Created] (YARN-8680) YARN NM: Implement Iterable Abstraction for LocalResourceTrackerstate

2018-08-17 Thread Pradeep Ambati (JIRA)
Pradeep Ambati created YARN-8680: Summary: YARN NM: Implement Iterable Abstraction for LocalResourceTrackerstate Key: YARN-8680 URL: https://issues.apache.org/jira/browse/YARN-8680 Project: Hadoop

[jira] [Updated] (YARN-8673) [AMRMProxy] More robust responseId resync after an YarnRM master slave switch

2018-08-17 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-8673: --- Attachment: YARN-8673.v2.patch > [AMRMProxy] More robust responseId resync after an YarnRM master

[jira] [Commented] (YARN-8640) Restore previous state in container-executor after failure

2018-08-17 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584059#comment-16584059 ] genericqa commented on YARN-8640: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-6495) check docker container's exit code when writing to cgroup task files

2018-08-17 Thread Jim Brennan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584038#comment-16584038 ] Jim Brennan commented on YARN-6495: --- YARN-8656 removed the code that this Jira was fixing. I think we

[jira] [Updated] (YARN-8640) Restore previous state in container-executor after failure

2018-08-17 Thread Jim Brennan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Brennan updated YARN-8640: -- Attachment: YARN-8640-branch-2.8.002.patch YARN-8640-branch-2.7.002.patch > Restore

[jira] [Commented] (YARN-8640) Restore previous state in container-executor after failure

2018-08-17 Thread Jim Brennan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584024#comment-16584024 ] Jim Brennan commented on YARN-8640: --- [~jlowe], thanks for the review!  I have removed the changes to

[jira] [Updated] (YARN-5738) Allow services to release/kill specific containers

2018-08-17 Thread Gour Saha (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gour Saha updated YARN-5738: Target Version/s: 3.0.3 Component/s: yarn-native-services > Allow services to release/kill

[jira] [Commented] (YARN-8679) [ATSv2] If HBase cluster is down for long time, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583988#comment-16583988 ] genericqa commented on YARN-8679: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-8679) [ATSv2] If HBase cluster is down for long time, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread Rohith Sharma K S (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith Sharma K S updated YARN-8679: Summary: [ATSv2] If HBase cluster is down for long time, high chances that NM

[jira] [Commented] (YARN-8679) [ATSv2] If HBase cluster is down, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread Rohith Sharma K S (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583923#comment-16583923 ] Rohith Sharma K S commented on YARN-8679: - Attaching a patch which relaxes a strict

[jira] [Updated] (YARN-8679) [ATSv2] If HBase cluster is down, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread Rohith Sharma K S (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith Sharma K S updated YARN-8679: Attachment: YARN-8679.01.patch > [ATSv2] If HBase cluster is down, high chances that NM

[jira] [Commented] (YARN-8679) [ATSv2] If HBase cluster is down, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread Rohith Sharma K S (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583915#comment-16583915 ] Rohith Sharma K S commented on YARN-8679: - In one of our cluster we see that NM jvm events are

[jira] [Assigned] (YARN-8679) [ATSv2] If HBase cluster is down, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread Rohith Sharma K S (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith Sharma K S reassigned YARN-8679: --- Assignee: Rohith Sharma K S > [ATSv2] If HBase cluster is down, high chances that NM

[jira] [Created] (YARN-8679) [ATSv2] If HBase cluster is down, high chances that NM ContainerManager dispatcher get blocked

2018-08-17 Thread Rohith Sharma K S (JIRA)
Rohith Sharma K S created YARN-8679: --- Summary: [ATSv2] If HBase cluster is down, high chances that NM ContainerManager dispatcher get blocked Key: YARN-8679 URL: https://issues.apache.org/jira/browse/YARN-8679

[jira] [Commented] (YARN-8632) No data in file realtimetrack.json after running SchedulerLoadSimulator

2018-08-17 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583824#comment-16583824 ] genericqa commented on YARN-8632: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-8632) No data in file realtimetrack.json after running SchedulerLoadSimulator

2018-08-17 Thread Xianghao Lu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianghao Lu updated YARN-8632: -- Attachment: YARN-8632.002.patch > No data in file realtimetrack.json after running

[jira] [Commented] (YARN-8632) No data in file realtimetrack.json after running SchedulerLoadSimulator

2018-08-17 Thread Xianghao Lu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583766#comment-16583766 ] Xianghao Lu commented on YARN-8632: --- Thanks for your review! {quote} It is not a good practice to catch

[jira] [Created] (YARN-8678) Queue Management API - rephrase error messages

2018-08-17 Thread Akhil PB (JIRA)
Akhil PB created YARN-8678: -- Summary: Queue Management API - rephrase error messages Key: YARN-8678 URL: https://issues.apache.org/jira/browse/YARN-8678 Project: Hadoop YARN Issue Type: Bug

[jira] [Created] (YARN-8677) Queue Management API - no errors thrown for wrong properties

2018-08-17 Thread Akhil PB (JIRA)
Akhil PB created YARN-8677: -- Summary: Queue Management API - no errors thrown for wrong properties Key: YARN-8677 URL: https://issues.apache.org/jira/browse/YARN-8677 Project: Hadoop YARN Issue

[jira] [Commented] (YARN-7863) Modify placement constraints to support node attributes

2018-08-17 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583613#comment-16583613 ] genericqa commented on YARN-7863: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-8676) Incorrect progress index in old yarn UI

2018-08-17 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583481#comment-16583481 ] genericqa commented on YARN-8676: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-3611) Support Docker Containers In LinuxContainerExecutor

2018-08-17 Thread JackZhou (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583470#comment-16583470 ] JackZhou commented on YARN-3611: Hi, [~shaneku...@gmail.com]  I would like to use this feature in our

[jira] [Commented] (YARN-8676) Incorrect progress index in old yarn UI

2018-08-17 Thread Yeliang Cang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16583427#comment-16583427 ] Yeliang Cang commented on YARN-8676: Seams related to YARN-7088 > Incorrect progress index in old

[jira] [Updated] (YARN-8676) Incorrect progress index in old yarn UI

2018-08-17 Thread Yeliang Cang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yeliang Cang updated YARN-8676: --- Priority: Critical (was: Major) > Incorrect progress index in old yarn UI >