[jira] [Updated] (YARN-6959) RM may allocate wrong AM Container for new attempt

2017-08-07 Thread Yuqi Wang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuqi Wang updated YARN-6959: Attachment: (was: YARN-6959.004.patch) > RM may allocate wrong AM Container for new attempt >

[jira] [Updated] (YARN-6959) RM may allocate wrong AM Container for new attempt

2017-08-07 Thread Yuqi Wang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuqi Wang updated YARN-6959: Attachment: YARN-6959.004.patch > RM may allocate wrong AM Container for new attempt >

[jira] [Commented] (YARN-6903) Yarn-native-service framework core rewrite

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117851#comment-16117851 ] Hadoop QA commented on YARN-6903: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-6959) RM may allocate wrong AM Container for new attempt

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117829#comment-16117829 ] Hadoop QA commented on YARN-6959: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-6965) Duplicate instantiation in FairSchedulerQueueInfo

2017-08-07 Thread Masahiro Tanaka (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117824#comment-16117824 ] Masahiro Tanaka commented on YARN-6965: --- I would like to fix this duplication. Could anyone assign me

[jira] [Created] (YARN-6965) Duplicate instantiation in FairSchedulerQueueInfo

2017-08-07 Thread Masahiro Tanaka (JIRA)
Masahiro Tanaka created YARN-6965: - Summary: Duplicate instantiation in FairSchedulerQueueInfo Key: YARN-6965 URL: https://issues.apache.org/jira/browse/YARN-6965 Project: Hadoop YARN Issue

[jira] [Commented] (YARN-6940) FairScheduler: Enable Container update CodePaths and container resize testcase

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117821#comment-16117821 ] Hadoop QA commented on YARN-6940: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-6959) RM may allocate wrong AM Container for new attempt

2017-08-07 Thread Yuqi Wang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuqi Wang updated YARN-6959: Component/s: fairscheduler capacity scheduler > RM may allocate wrong AM Container for new

[jira] [Commented] (YARN-6959) RM may allocate wrong AM Container for new attempt

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117796#comment-16117796 ] Hadoop QA commented on YARN-6959: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-6811) [ATS1.5] All history logs should be kept under its own User Directory.

2017-08-07 Thread Rohith Sharma K S (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117790#comment-16117790 ] Rohith Sharma K S commented on YARN-6811: - cc :/ [~djp] Updated the patch for branch-2 > [ATS1.5]

[jira] [Updated] (YARN-6959) RM may allocate wrong AM Container for new attempt

2017-08-07 Thread Yuqi Wang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuqi Wang updated YARN-6959: Attachment: YARN-6959.004.patch Adjust Style > RM may allocate wrong AM Container for new attempt >

[jira] [Commented] (YARN-6956) preemption may only consider resource requests for one node

2017-08-07 Thread Karthik Kambatla (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117772#comment-16117772 ] Karthik Kambatla commented on YARN-6956: There are a series of follow-up changes to be made: #

[jira] [Commented] (YARN-6852) [YARN-6223] Native code changes to support isolate GPU devices by using CGroups

2017-08-07 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117773#comment-16117773 ] Zhankun Tang commented on YARN-6852: [~miklos.szeg...@cloudera.com], [~wangda], Thanks for the good

[jira] [Commented] (YARN-6240) TestCapacityScheduler.testRefreshQueuesWithQueueDelete fails randomly

2017-08-07 Thread Naganarasimha G R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117769#comment-16117769 ] Naganarasimha G R commented on YARN-6240: - Thanks [~wangda], seems like YARN-6741 will go in

[jira] [Commented] (YARN-6890) If UI is not secured, we allow user to kill other users' job even yarn cluster is secured.

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117763#comment-16117763 ] Hadoop QA commented on YARN-6890: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-6964) Fair scheduler misuses Resources operations

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117761#comment-16117761 ] Hadoop QA commented on YARN-6964: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-6959) RM may allocate wrong AM Container for new attempt

2017-08-07 Thread Yuqi Wang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuqi Wang updated YARN-6959: Attachment: YARN-6959.003.patch Re-trigger QA use the same patch as 002 > RM may allocate wrong AM

[jira] [Commented] (YARN-6885) AllocationFileLoaderService.loadQueue() should use a switch statement in the main tag parsing loop instead of the if/else-if/...

2017-08-07 Thread Yu-Tang Lin (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117743#comment-16117743 ] Yu-Tang Lin commented on YARN-6885: --- Thanks to Daniel for filling this form, I would like to take this

[jira] [Comment Edited] (YARN-6212) NodeManager metrics returning wrong negative values

2017-08-07 Thread Yang Wang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16116452#comment-16116452 ] Yang Wang edited comment on YARN-6212 at 8/8/17 2:25 AM: - Hi,

[jira] [Commented] (YARN-6940) FairScheduler: Enable Container update CodePaths and container resize testcase

2017-08-07 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117732#comment-16117732 ] Arun Suresh commented on YARN-6940: --- ping [~templedf] / [~kasha] ? > FairScheduler: Enable Container

[jira] [Updated] (YARN-6940) FairScheduler: Enable Container update CodePaths and container resize testcase

2017-08-07 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun Suresh updated YARN-6940: -- Summary: FairScheduler: Enable Container update CodePaths and container resize testcase (was:

[jira] [Updated] (YARN-6940) FairScheduler: Enable Container update CodePath and container resize testcase

2017-08-07 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun Suresh updated YARN-6940: -- Summary: FairScheduler: Enable Container update CodePath and container resize testcase (was: Enable

[jira] [Updated] (YARN-6920) Fix resource leak that happens during container re-initialization.

2017-08-07 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun Suresh updated YARN-6920: -- Fix Version/s: 3.0.0-beta1 2.9.0 > Fix resource leak that happens during container

[jira] [Updated] (YARN-6920) Fix resource leak that happens during container re-initialization.

2017-08-07 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun Suresh updated YARN-6920: -- Component/s: nodemanager > Fix resource leak that happens during container re-initialization. >

[jira] [Updated] (YARN-6920) Fix resource leak that happens during container re-initialization.

2017-08-07 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun Suresh updated YARN-6920: -- Summary: Fix resource leak that happens during container re-initialization. (was: Fix TestNMClient

[jira] [Updated] (YARN-6920) Fix TestNMClient failure due to YARN-6706

2017-08-07 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun Suresh updated YARN-6920: -- Description: Looks like {{TestNMClient}} has been failing for a while. Opening this JIRA to track the

[jira] [Commented] (YARN-6668) Use cgroup to get container resource utilization

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117708#comment-16117708 ] Hadoop QA commented on YARN-6668: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-6852) [YARN-6223] Native code changes to support isolate GPU devices by using CGroups

2017-08-07 Thread Miklos Szegedi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117702#comment-16117702 ] Miklos Szegedi commented on YARN-6852: -- No comments on the design, thanks for the explanation. We can

[jira] [Commented] (YARN-5536) Multiple format support (JSON, etc.) for exclude node file in NM graceful decommission with timeout

2017-08-07 Thread Junping Du (JIRA)
[ https://issues-test.apache.org/jira/browse/YARN-5536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16090129#comment-16090129 ] Junping Du commented on YARN-5536: -- Oh. I see your point. Yes. If we can make this JIRA happen before

[jira] [Updated] (YARN-5536) Multiple format support (JSON, etc.) for exclude node file in NM graceful decommission with timeout

2017-08-07 Thread Junping Du (JIRA)
[ https://issues-test.apache.org/jira/browse/YARN-5536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Junping Du updated YARN-5536: - Priority: Critical (was: Major) > Multiple format support (JSON, etc.) for exclude node file in NM

[jira] [Commented] (YARN-6964) Fair scheduler misuses Resources operations

2017-08-07 Thread Miklos Szegedi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117694#comment-16117694 ] Miklos Szegedi commented on YARN-6964: -- Thanks for the patch [~templedf]. {code} 566 Resource

[jira] [Commented] (YARN-6890) If UI is not secured, we allow user to kill other users' job even yarn cluster is secured.

2017-08-07 Thread Junping Du (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117678#comment-16117678 ] Junping Du commented on YARN-6890: -- Thanks for review, Jian! v3 patch should incorporate your comments. >

[jira] [Updated] (YARN-6890) If UI is not secured, we allow user to kill other users' job even yarn cluster is secured.

2017-08-07 Thread Junping Du (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Junping Du updated YARN-6890: - Attachment: YARN-6890-v3.patch > If UI is not secured, we allow user to kill other users' job even yarn >

[jira] [Commented] (YARN-6033) Add support for sections in container-executor configuration file

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117677#comment-16117677 ] Hadoop QA commented on YARN-6033: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-5536) Multiple format support (JSON, etc.) for exclude node file in NM graceful decommission with timeout

2017-08-07 Thread Ming Ma (JIRA)
[ https://issues-test.apache.org/jira/browse/YARN-5536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16090128#comment-16090128 ] Ming Ma commented on YARN-5536: --- I am not suggesting removing the previous format support (without

[jira] [Commented] (YARN-6033) Add support for sections in container-executor configuration file

2017-08-07 Thread Miklos Szegedi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117675#comment-16117675 ] Miklos Szegedi commented on YARN-6033: -- Thank you for the patch [~wangda]. The last comments are style

[jira] [Commented] (YARN-6920) Fix TestNMClient failure due to YARN-6706

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117670#comment-16117670 ] Hadoop QA commented on YARN-6920: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-5536) Multiple format support (JSON, etc.) for exclude node file in NM graceful decommission with timeout

2017-08-07 Thread Junping Du (JIRA)
[ https://issues-test.apache.org/jira/browse/YARN-5536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16090127#comment-16090127 ] Junping Du commented on YARN-5536: -- Hi [~mingma], I am more conservative on removing previous format

[jira] [Updated] (YARN-6668) Use cgroup to get container resource utilization

2017-08-07 Thread Miklos Szegedi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Miklos Szegedi updated YARN-6668: - Attachment: YARN-6668.004.patch > Use cgroup to get container resource utilization >

[jira] [Updated] (YARN-6964) Fair scheduler misuses Resources operations

2017-08-07 Thread Daniel Templeton (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Templeton updated YARN-6964: --- Attachment: YARN-6964.001.patch > Fair scheduler misuses Resources operations >

[jira] [Created] (YARN-6964) Fair scheduler misuses Resources operations

2017-08-07 Thread Daniel Templeton (JIRA)
Daniel Templeton created YARN-6964: -- Summary: Fair scheduler misuses Resources operations Key: YARN-6964 URL: https://issues.apache.org/jira/browse/YARN-6964 Project: Hadoop YARN Issue

[jira] [Updated] (YARN-6033) Add support for sections in container-executor configuration file

2017-08-07 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wangda Tan updated YARN-6033: - Attachment: YARN-6033.012.patch Attached ver.012 patch, should addressed all your comments. bq. I already

[jira] [Commented] (YARN-5536) Multiple format support (JSON, etc.) for exclude node file in NM graceful decommission with timeout

2017-08-07 Thread Ming Ma (JIRA)
[ https://issues-test.apache.org/jira/browse/YARN-5536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16090126#comment-16090126 ] Ming Ma commented on YARN-5536: --- Per discussion in YARN-4676, current timeout config support is via

[jira] [Resolved] (YARN-1038) LocalizationProtocolPBClientImpl RPC failing

2017-08-07 Thread Junping Du (JIRA)
[ https://issues-test.apache.org/jira/browse/YARN-1038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Junping Du resolved YARN-1038. -- Resolution: Cannot Reproduce I don't think trunk branch has this problem now, just resolve as

[jira] [Comment Edited] (YARN-6852) [YARN-6223] Native code changes to support isolate GPU devices by using CGroups

2017-08-07 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117607#comment-16117607 ] Wangda Tan edited comment on YARN-6852 at 8/8/17 12:19 AM: --- Again, thanks

[jira] [Updated] (YARN-6852) [YARN-6223] Native code changes to support isolate GPU devices by using CGroups

2017-08-07 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wangda Tan updated YARN-6852: - Attachment: YARN-6852.004.patch Again, thanks [~miklos.szeg...@cloudera.com] for your reviews. bq. could

[jira] [Commented] (YARN-6033) Add support for sections in container-executor configuration file

2017-08-07 Thread Miklos Szegedi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117604#comment-16117604 ] Miklos Szegedi commented on YARN-6033: -- {code} 485 cfg->size = 0; 486 conf_file =

[jira] [Assigned] (YARN-6910) Increase RM audit log coverage

2017-08-07 Thread zhenzhao wang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhenzhao wang reassigned YARN-6910: --- Assignee: zhenzhao wang > Increase RM audit log coverage > -- > >

[jira] [Commented] (YARN-6955) Handle concurrent register AM requests in FederationInterceptor

2017-08-07 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117596#comment-16117596 ] Botong Huang commented on YARN-6955: Thanks [~subru]! > Handle concurrent register AM requests in

[jira] [Commented] (YARN-1038) LocalizationProtocolPBClientImpl RPC failing

2017-08-07 Thread Ming Ma (JIRA)
[ https://issues-test.apache.org/jira/browse/YARN-1038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16090124#comment-16090124 ] Ming Ma commented on YARN-1038: --- Given this blocker was opened many years ago and there isn't any recent

[jira] [Commented] (YARN-6668) Use cgroup to get container resource utilization

2017-08-07 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117587#comment-16117587 ] Haibo Chen commented on YARN-6668: -- Two more comments 1) all the statements,

[jira] [Commented] (YARN-6668) Use cgroup to get container resource utilization

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117585#comment-16117585 ] Hadoop QA commented on YARN-6668: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-6903) Yarn-native-service framework core rewrite

2017-08-07 Thread Jian He (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jian He updated YARN-6903: -- Attachment: YARN-6903.yarn-native-services.03.patch > Yarn-native-service framework core rewrite >

[jira] [Commented] (YARN-6917) Queue path is recomputed from scratch on every allocation

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117563#comment-16117563 ] Hadoop QA commented on YARN-6917: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-6962) Add support for updateContainers when allocating using FederationInterceptor

2017-08-07 Thread Subru Krishnan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117553#comment-16117553 ] Subru Krishnan commented on YARN-6962: -- Thanks [~botong] for working on this. The patch itself LGTM,

[jira] [Commented] (YARN-6920) Fix TestNMClient failure due to YARN-6706

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117547#comment-16117547 ] Hadoop QA commented on YARN-6920: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-6962) Add support for updateContainers when allocating using FederationInterceptor

2017-08-07 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6962: --- Description: Container update is introduced in YARN-5221. Federation Interceptor needs to support it

[jira] [Updated] (YARN-6962) Add support for updateContainers when allocating using FederationInterceptor

2017-08-07 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6962: --- Summary: Add support for updateContainers when allocating using FederationInterceptor (was:

[jira] [Updated] (YARN-6955) Handle concurrent register AM requests in FederationInterceptor

2017-08-07 Thread Subru Krishnan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Subru Krishnan updated YARN-6955: - Summary: Handle concurrent register AM requests in FederationInterceptor (was: Concurrent

[jira] [Commented] (YARN-6930) Admins should be able to explicitly enable specific LinuxContainerRuntime in the NodeManager

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117508#comment-16117508 ] Hadoop QA commented on YARN-6930: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-6890) If UI is not secured, we allow user to kill other users' job even yarn cluster is secured.

2017-08-07 Thread Jian He (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117505#comment-16117505 ] Jian He commented on YARN-6890: --- - Looks good overall, minor comment, there's an

[jira] [Commented] (YARN-6033) Add support for sections in container-executor configuration file

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117495#comment-16117495 ] Hadoop QA commented on YARN-6033: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-6668) Use cgroup to get container resource utilization

2017-08-07 Thread Miklos Szegedi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Miklos Szegedi updated YARN-6668: - Attachment: YARN-6668.003.patch Thank you for the comments [~haibochen], I updated the patch. >

[jira] [Commented] (YARN-6668) Use cgroup to get container resource utilization

2017-08-07 Thread Miklos Szegedi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117467#comment-16117467 ] Miklos Szegedi commented on YARN-6668: -- bq. 3) The CGroupsResoureCalculator now takes precedence over

[jira] [Commented] (YARN-6668) Use cgroup to get container resource utilization

2017-08-07 Thread Miklos Szegedi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117464#comment-16117464 ] Miklos Szegedi commented on YARN-6668: -- bq. 5) ContainersMonitorImpl is missing an import of

[jira] [Commented] (YARN-6668) Use cgroup to get container resource utilization

2017-08-07 Thread Miklos Szegedi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117454#comment-16117454 ] Miklos Szegedi commented on YARN-6668: -- bq. 9) We call setCGroupFilePaths() in constructor which

[jira] [Commented] (YARN-6668) Use cgroup to get container resource utilization

2017-08-07 Thread Miklos Szegedi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117445#comment-16117445 ] Miklos Szegedi commented on YARN-6668: -- bq. the lock around firstError is unnecessary if we make

[jira] [Updated] (YARN-6917) Queue path is recomputed from scratch on every allocation

2017-08-07 Thread Eric Payne (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Payne updated YARN-6917: - Attachment: YARN-6917.001.patch The queue path is only ever set during queue initialization or

[jira] [Commented] (YARN-3254) HealthReport should include disk full information

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117436#comment-16117436 ] Hadoop QA commented on YARN-3254: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-6668) Use cgroup to get container resource utilization

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117430#comment-16117430 ] Hadoop QA commented on YARN-6668: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-6900) ZooKeeper based implementation of the FederationStateStore

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117414#comment-16117414 ] Hadoop QA commented on YARN-6900: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-6033) Add support for sections in container-executor configuration file

2017-08-07 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wangda Tan updated YARN-6033: - Attachment: YARN-6033.011.patch Attached ver.11 patch, cc: [~miklos.szeg...@cloudera.com] > Add support

[jira] [Commented] (YARN-6900) ZooKeeper based implementation of the FederationStateStore

2017-08-07 Thread JIRA
[ https://issues.apache.org/jira/browse/YARN-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117410#comment-16117410 ] Íñigo Goiri commented on YARN-6900: --- Pushed the refactoring to HADOOP-14741. Once that's done I'll rebase

[jira] [Commented] (YARN-6930) Admins should be able to explicitly enable specific LinuxContainerRuntime in the NodeManager

2017-08-07 Thread Miklos Szegedi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117380#comment-16117380 ] Miklos Szegedi commented on YARN-6930: -- Thank you for the patch, [~shaneku...@gmail.com]. Would you

[jira] [Commented] (YARN-5464) Server-Side NM Graceful Decommissioning with RM HA

2017-08-07 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117374#comment-16117374 ] Robert Kanter commented on YARN-5464: - I don't currently have the bandwidth. Go ahead [~djp].

[jira] [Updated] (YARN-3254) HealthReport should include disk full information

2017-08-07 Thread Suma Shivaprasad (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suma Shivaprasad updated YARN-3254: --- Attachment: YARN-3254-006.patch Thanks [~sunilg]. Attached patched with review comments

[jira] [Comment Edited] (YARN-6920) Fix TestNMClient failure due to YARN-6706

2017-08-07 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117367#comment-16117367 ] Arun Suresh edited comment on YARN-6920 at 8/7/17 9:48 PM: --- Kicked Jenkins off

[jira] [Commented] (YARN-6920) Fix TestNMClient failure due to YARN-6706

2017-08-07 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117367#comment-16117367 ] Arun Suresh commented on YARN-6920: --- Kicked Jenkins off again for this patch - to verify that the last

[jira] [Updated] (YARN-6930) Admins should be able to explicitly enable specific LinuxContainerRuntime in the NodeManager

2017-08-07 Thread Shane Kumpf (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shane Kumpf updated YARN-6930: -- Attachment: YARN-6930.001.patch > Admins should be able to explicitly enable specific

[jira] [Commented] (YARN-6033) Add support for sections in container-executor configuration file

2017-08-07 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117351#comment-16117351 ] Wangda Tan commented on YARN-6033: -- Thanks [~miklos.szeg...@cloudera.com], will update patch shortly. >

[jira] [Commented] (YARN-6920) Fix TestNMClient failure due to YARN-6706

2017-08-07 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117336#comment-16117336 ] Haibo Chen commented on YARN-6920: -- [~asuresh] Seems like I am hitting another issue (due to race

[jira] [Commented] (YARN-6920) Fix TestNMClient failure due to YARN-6706

2017-08-07 Thread Jian He (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117334#comment-16117334 ] Jian He commented on YARN-6920: --- yep, the patch lgtm > Fix TestNMClient failure due to YARN-6706 >

[jira] [Commented] (YARN-6920) Fix TestNMClient failure due to YARN-6706

2017-08-07 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117322#comment-16117322 ] Arun Suresh commented on YARN-6920: --- Sure.. Raised YARN-6963 to track that. Let me know if this patch is

[jira] [Updated] (YARN-6963) Prevent other containers from staring when a container is re-initializing

2017-08-07 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun Suresh updated YARN-6963: -- Issue Type: Improvement (was: Bug) > Prevent other containers from staring when a container is

[jira] [Created] (YARN-6963) Prevent other containers from staring when a container is re-initializing

2017-08-07 Thread Arun Suresh (JIRA)
Arun Suresh created YARN-6963: - Summary: Prevent other containers from staring when a container is re-initializing Key: YARN-6963 URL: https://issues.apache.org/jira/browse/YARN-6963 Project: Hadoop YARN

[jira] [Commented] (YARN-4161) Capacity Scheduler : Assign single or multiple containers per heart beat driven by configuration

2017-08-07 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117289#comment-16117289 ] Wangda Tan commented on YARN-4161: -- Test failure is not related, pushing this to branch-2. > Capacity

[jira] [Commented] (YARN-6240) TestCapacityScheduler.testRefreshQueuesWithQueueDelete fails randomly

2017-08-07 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117288#comment-16117288 ] Wangda Tan commented on YARN-6240: -- +1 to push this patch to code base. I saw this UT failure occurred

[jira] [Commented] (YARN-4161) Capacity Scheduler : Assign single or multiple containers per heart beat driven by configuration

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117270#comment-16117270 ] Hadoop QA commented on YARN-4161: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-6668) Use cgroup to get container resource utilization

2017-08-07 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117264#comment-16117264 ] Haibo Chen commented on YARN-6668: -- Thanks [~miklos.szeg...@cloudera.com] for the update! I have a few

[jira] [Commented] (YARN-6920) Fix TestNMClient failure due to YARN-6706

2017-08-07 Thread Jian He (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117260#comment-16117260 ] Jian He commented on YARN-6920: --- [~asuresh], yep, I'm fine to fix it in a separate jira. I would think the

[jira] [Commented] (YARN-6033) Add support for sections in container-executor configuration file

2017-08-07 Thread Miklos Szegedi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117247#comment-16117247 ] Miklos Szegedi commented on YARN-6033: -- [~wangda], sorry there is an issue in the latest change:

[jira] [Commented] (YARN-6900) ZooKeeper based implementation of the FederationStateStore

2017-08-07 Thread Subru Krishnan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117220#comment-16117220 ] Subru Krishnan commented on YARN-6900: -- [~elgoiri], thanks for refactoring, it looks good. Since it's

[jira] [Commented] (YARN-6920) Fix TestNMClient failure due to YARN-6706

2017-08-07 Thread Arun Suresh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117208#comment-16117208 ] Arun Suresh commented on YARN-6920: --- Ping [~haibo.chen] / [~jianhe]. Do you guys want me to make the

[jira] [Commented] (YARN-6897) Refactoring RMWebServices by moving some util methods to RMWebAppUtil

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117188#comment-16117188 ] Hadoop QA commented on YARN-6897: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-6852) [YARN-6223] Native code changes to support isolate GPU devices by using CGroups

2017-08-07 Thread Miklos Szegedi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117175#comment-16117175 ] Miklos Szegedi commented on YARN-6852: -- Thank you [~wangda], I have a second batch. I could not find a

[jira] [Commented] (YARN-65) Reduce RM app memory footprint once app has completed

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-65?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117174#comment-16117174 ] Hadoop QA commented on YARN-65: --- | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime

[jira] [Updated] (YARN-6900) ZooKeeper based implementation of the FederationStateStore

2017-08-07 Thread Inigo Goiri (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Inigo Goiri updated YARN-6900: -- Attachment: YARN-6900-003.patch Created {{ZKManager}}. Let me know what are the thought on the

[jira] [Assigned] (YARN-6879) TestLeafQueue.testDRFUserLimits() has commented out code

2017-08-07 Thread Daniel Templeton (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Templeton reassigned YARN-6879: -- Assignee: Angela Wang (was: Daniel Templeton) > TestLeafQueue.testDRFUserLimits() has

[jira] [Commented] (YARN-6875) New aggregated log file format for YARN log aggregation.

2017-08-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117123#comment-16117123 ] Jason Lowe commented on YARN-6875: -- I'm not clear on how the local index file would work in practice. One

[jira] [Commented] (YARN-6033) Add support for sections in container-executor configuration file

2017-08-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117084#comment-16117084 ] Hadoop QA commented on YARN-6033: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

  1   2   >