[jira] [Updated] (YARN-7199) TestAMRMClientContainerRequest.testOpportunisticAndGuaranteedRequests is failing in trunk

2017-09-14 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-7199: --- Attachment: YARN-7199.v1.patch > TestAMRMClientContainerRequest.testOpportunisticAndGuaranteedRequests

[jira] [Assigned] (YARN-7199) TestAMRMClientContainerRequest.testOpportunisticAndGuaranteedRequests is failing in trunk

2017-09-14 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang reassigned YARN-7199: -- Assignee: Botong Huang > TestAMRMClientContainerRequest.testOpportunisticAndGuaranteedRequests

[jira] [Commented] (YARN-7102) NM heartbeat stuck when responseId overflows MAX_INT

2017-09-14 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167000#comment-16167000 ] Botong Huang commented on YARN-7102: After fighting through unit tests... in v6 patch:

[jira] [Created] (YARN-7199) TestAMRMClientContainerRequest.testOpportunisticAndGuaranteedRequests is failing in trunk

2017-09-14 Thread Botong Huang (JIRA)
Botong Huang created YARN-7199: -- Summary: TestAMRMClientContainerRequest.testOpportunisticAndGuaranteedRequests is failing in trunk Key: YARN-7199 URL: https://issues.apache.org/jira/browse/YARN-7199

[jira] [Updated] (YARN-7102) NM heartbeat stuck when responseId overflows MAX_INT

2017-09-14 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-7102: --- Attachment: YARN-7102.v6.patch > NM heartbeat stuck when responseId overflows MAX_INT >

[jira] [Updated] (YARN-7102) NM heartbeat stuck when responseId overflows MAX_INT

2017-09-13 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-7102: --- Attachment: YARN-7102.v5.patch > NM heartbeat stuck when responseId overflows MAX_INT >

[jira] [Updated] (YARN-7102) NM heartbeat stuck when responseId overflows MAX_INT

2017-09-13 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-7102: --- Attachment: YARN-7102.v4.patch > NM heartbeat stuck when responseId overflows MAX_INT >

[jira] [Comment Edited] (YARN-7102) NM heartbeat stuck when responseId overflows MAX_INT

2017-09-11 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16161623#comment-16161623 ] Botong Huang edited comment on YARN-7102 at 9/11/17 5:23 PM: - V3 updated, fix

[jira] [Updated] (YARN-7102) NM heartbeat stuck when responseId overflows MAX_INT

2017-09-11 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-7102: --- Attachment: YARN-7102.v3.patch V3 updated, fix more unit test failures around {{MiniYarnCluster}} >

[jira] [Updated] (YARN-7102) NM heartbeat stuck when responseId overflows MAX_INT

2017-09-07 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-7102: --- Attachment: YARN-7102.v2.patch Some explanations since v2 patch is much bigger. This change revealed

[jira] [Updated] (YARN-7102) NM heartbeat stuck when responseId overflows MAX_INT

2017-08-31 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-7102: --- Attachment: YARN-7102.v1.patch > NM heartbeat stuck when responseId overflows MAX_INT >

[jira] [Commented] (YARN-6640) AM heartbeat stuck when responseId overflows MAX_INT

2017-08-25 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16142312#comment-16142312 ] Botong Huang commented on YARN-6640: Hi [~vinodkv], yes the same issue exists NM->RM hearbeat as well.

[jira] [Created] (YARN-7102) NM heartbeat stuck when responseId overflows MAX_INT

2017-08-25 Thread Botong Huang (JIRA)
Botong Huang created YARN-7102: -- Summary: NM heartbeat stuck when responseId overflows MAX_INT Key: YARN-7102 URL: https://issues.apache.org/jira/browse/YARN-7102 Project: Hadoop YARN Issue

[jira] [Commented] (YARN-6640) AM heartbeat stuck when responseId overflows MAX_INT

2017-08-25 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16141785#comment-16141785 ] Botong Huang commented on YARN-6640: Great, thanks [~jlowe], [~leftnoteasy] for the review and advise!

[jira] [Commented] (YARN-7074) Fix NM state store update comment

2017-08-24 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16140175#comment-16140175 ] Botong Huang commented on YARN-7074: Cool, thanks [~bibinchundatt] and [~kasha]! > Fix NM state store

[jira] [Commented] (YARN-6640) AM heartbeat stuck when responseId overflows MAX_INT

2017-08-23 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16139055#comment-16139055 ] Botong Huang commented on YARN-6640: Unit test failure is irrelevant and being tracked under YARN-7044.

[jira] [Commented] (YARN-7074) Fix NM state store update comment

2017-08-23 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16139038#comment-16139038 ] Botong Huang commented on YARN-7074: Hi [~kasha], can you help me commit this one? Thanks in advance!

[jira] [Commented] (YARN-6640) AM heartbeat stuck when responseId overflows MAX_INT

2017-08-23 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16138600#comment-16138600 ] Botong Huang commented on YARN-6640: Sure, v2 patch uploaded. Thanks! > AM heartbeat stuck when

[jira] [Updated] (YARN-6640) AM heartbeat stuck when responseId overflows MAX_INT

2017-08-23 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6640: --- Attachment: YARN-6640.v2.patch > AM heartbeat stuck when responseId overflows MAX_INT >

[jira] [Commented] (YARN-6640) AM heartbeat stuck when responseId overflows MAX_INT

2017-08-22 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16137825#comment-16137825 ] Botong Huang commented on YARN-6640: Hi [~wangda], I think [~jlowe]'s approach already handles the case

[jira] [Comment Edited] (YARN-6798) Fix NM startup failure with old state store due to version mismatch

2017-08-22 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16137679#comment-16137679 ] Botong Huang edited comment on YARN-6798 at 8/23/17 12:35 AM: -- Thanks [~kasha]

[jira] [Commented] (YARN-6798) Fix NM startup failure with old state store due to version mismatch

2017-08-22 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16137679#comment-16137679 ] Botong Huang commented on YARN-6798: Thanks [~kasha] for catching it. I've create YARN-7074 to fix the

[jira] [Updated] (YARN-7074) Fix NM state store update comment

2017-08-22 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-7074: --- Description: A follow up of YARN-6798 to fix a typo. (was: A follow up of YARN-6798. ) > Fix NM

[jira] [Updated] (YARN-7074) Fix NM state store update comment

2017-08-22 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-7074: --- Attachment: YARN-7074.v1.patch > Fix NM state store update comment > -

[jira] [Updated] (YARN-7074) Fix NM state store update comment

2017-08-22 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-7074: --- Component/s: nodemanager > Fix NM state store update comment > - > >

[jira] [Updated] (YARN-7074) Fix NM state store update comment

2017-08-22 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-7074: --- Description: A follow up of YARN-6798. > Fix NM state store update comment >

[jira] [Created] (YARN-7074) Fix NM state store update comment

2017-08-22 Thread Botong Huang (JIRA)
Botong Huang created YARN-7074: -- Summary: Fix NM state store update comment Key: YARN-7074 URL: https://issues.apache.org/jira/browse/YARN-7074 Project: Hadoop YARN Issue Type: Bug

[jira] [Commented] (YARN-6640) AM heartbeat stuck when responseId overflows MAX_INT

2017-08-22 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16137448#comment-16137448 ] Botong Huang commented on YARN-6640: [~wangda] ([~jianhe] and [~asuresh]), can you please take a look

[jira] [Updated] (YARN-6704) Add Federation Interceptor restart when work preserving NM is enabled

2017-08-11 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6704: --- Attachment: YARN-6704.v4.patch > Add Federation Interceptor restart when work preserving NM is enabled

[jira] [Updated] (YARN-6704) Add Federation Interceptor restart when work preserving NM is enabled

2017-08-11 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6704: --- Attachment: YARN-6704.v3.patch v3 patch update: separate out reAttachUAM from launchUAM. Addressed

[jira] [Commented] (YARN-6955) Handle concurrent register AM requests in FederationInterceptor

2017-08-07 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16117596#comment-16117596 ] Botong Huang commented on YARN-6955: Thanks [~subru]! > Handle concurrent register AM requests in

[jira] [Updated] (YARN-6962) Add support for updateContainers when allocating using FederationInterceptor

2017-08-07 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6962: --- Description: Container update is introduced in YARN-5221. Federation Interceptor needs to support it

[jira] [Updated] (YARN-6962) Add support for updateContainers when allocating using FederationInterceptor

2017-08-07 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6962: --- Summary: Add support for updateContainers when allocating using FederationInterceptor (was:

[jira] [Commented] (YARN-6955) Concurrent registerAM thread in Federation Interceptor

2017-08-07 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16116965#comment-16116965 ] Botong Huang commented on YARN-6955: The unit test failures are irrelevant. > Concurrent registerAM

[jira] [Updated] (YARN-6962) Federation interceptor should support full allocate request/response api

2017-08-07 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6962: --- Attachment: YARN-6962.v1.patch > Federation interceptor should support full allocate request/response

[jira] [Updated] (YARN-6962) Federation interceptor should support full allocate request/response api

2017-08-07 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6962: --- Issue Type: Sub-task (was: Bug) Parent: YARN-5597 > Federation interceptor should support

[jira] [Created] (YARN-6962) Federation interceptor should support full allocate request/response api

2017-08-07 Thread Botong Huang (JIRA)
Botong Huang created YARN-6962: -- Summary: Federation interceptor should support full allocate request/response api Key: YARN-6962 URL: https://issues.apache.org/jira/browse/YARN-6962 Project: Hadoop

[jira] [Updated] (YARN-6955) Concurrent registerAM thread in Federation Interceptor

2017-08-05 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6955: --- Attachment: YARN-6955.v2.patch > Concurrent registerAM thread in Federation Interceptor >

[jira] [Updated] (YARN-6955) Concurrent registerAM thread in Federation Interceptor

2017-08-04 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6955: --- Attachment: YARN-6955.v1.patch > Concurrent registerAM thread in Federation Interceptor >

[jira] [Updated] (YARN-6955) Concurrent registerAM thread in Federation Interceptor

2017-08-04 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6955: --- Attachment: (was: YARN-6955.v1.patch) > Concurrent registerAM thread in Federation Interceptor >

[jira] [Updated] (YARN-6955) Concurrent registerAM thread in Federation Interceptor

2017-08-04 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6955: --- Attachment: YARN-6955.v1.patch > Concurrent registerAM thread in Federation Interceptor >

[jira] [Created] (YARN-6955) Concurrent registerAM thread in Federation Interceptor

2017-08-04 Thread Botong Huang (JIRA)
Botong Huang created YARN-6955: -- Summary: Concurrent registerAM thread in Federation Interceptor Key: YARN-6955 URL: https://issues.apache.org/jira/browse/YARN-6955 Project: Hadoop YARN Issue

[jira] [Commented] (YARN-6932) Fix TestFederationRMFailoverProxyProvider test case

2017-08-03 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16112817#comment-16112817 ] Botong Huang commented on YARN-6932: Thanks [~subru] for the patch. I've investigated a bit, it is

[jira] [Assigned] (YARN-6924) Metrics for Federation AMRMProxy

2017-08-01 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang reassigned YARN-6924: -- Assignee: Botong Huang > Metrics for Federation AMRMProxy > >

[jira] [Commented] (YARN-6853) Add MySql Scripts for FederationStateStore

2017-07-31 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16107841#comment-16107841 ] Botong Huang commented on YARN-6853: Thanks for the patch, I've tested the v4 patch on a small

[jira] [Commented] (YARN-6853) Add MySql Scripts for FederationStateStore

2017-07-31 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16107817#comment-16107817 ] Botong Huang commented on YARN-6853: Thanks [~giovanni.fumarola] for the patch! Two quick questions:

[jira] [Commented] (YARN-6902) Update Microsoft JDBC Driver for SQL Server version in License.txt

2017-07-28 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16106010#comment-16106010 ] Botong Huang commented on YARN-6902: Thanks [~subru]! > Update Microsoft JDBC Driver for SQL Server

[jira] [Updated] (YARN-6902) Update SQL server note in License.txt

2017-07-28 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6902: --- Attachment: (was: YARN-6902-YARN-2915.patch) > Update SQL server note in License.txt >

[jira] [Updated] (YARN-6902) Update SQL server note in License.txt

2017-07-28 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6902: --- Attachment: YARN-6902-YARN-2915.v1.patch > Update SQL server note in License.txt >

[jira] [Updated] (YARN-6902) Update SQL server note in License.txt

2017-07-28 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6902: --- Attachment: YARN-6902-YARN-2915.patch > Update SQL server note in License.txt >

[jira] [Updated] (YARN-6902) Update SQL server note in License.txt

2017-07-28 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6902: --- Issue Type: Sub-task (was: Task) Parent: YARN-2915 > Update SQL server note in License.txt >

[jira] [Created] (YARN-6902) Update SQL server note in License.txt

2017-07-28 Thread Botong Huang (JIRA)
Botong Huang created YARN-6902: -- Summary: Update SQL server note in License.txt Key: YARN-6902 URL: https://issues.apache.org/jira/browse/YARN-6902 Project: Hadoop YARN Issue Type: Task

[jira] [Commented] (YARN-6866) Minor clean-up and fixes in anticipation of YARN-2915 merge with trunk

2017-07-26 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16101931#comment-16101931 ] Botong Huang commented on YARN-6866: Thanks [~subru] and [~curino] for the editing and review! I have

[jira] [Updated] (YARN-6866) Minor clean-up and fixes in anticipation of merge with trunk

2017-07-25 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6866: --- Attachment: YARN-6866-YARN-2915.v2.patch > Minor clean-up and fixes in anticipation of merge with

[jira] [Updated] (YARN-6866) Minor clean-up and fixes in anticipation of merge with trunk

2017-07-25 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6866: --- Attachment: YARN-6866-YARN-2915.v1.patch > Minor clean-up and fixes in anticipation of merge with

[jira] [Commented] (YARN-6798) Fix NM startup failure with old state store due to version mismatch

2017-07-18 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16092308#comment-16092308 ] Botong Huang commented on YARN-6798: Thanks [~rchiang]! > Fix NM startup failure with old state store

[jira] [Commented] (YARN-6704) Add Federation Interceptor restart when work preserving NM is enabled

2017-07-16 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16089203#comment-16089203 ] Botong Huang commented on YARN-6704: Thanks [~subru] for the review! Please see below. 1. The reason

[jira] [Commented] (YARN-6798) NM startup failure with old state store due to version mismatch

2017-07-14 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16088088#comment-16088088 ] Botong Huang commented on YARN-6798: Sounds good, thx! > NM startup failure with old state store due

[jira] [Updated] (YARN-6704) Add Federation Interceptor restart when work preserving NM is enabled

2017-07-12 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6704: --- Attachment: YARN-6704-YARN-2915.v2.patch > Add Federation Interceptor restart when work preserving NM

[jira] [Commented] (YARN-6798) NM startup failure with old state store due to version mismatch

2017-07-12 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16084596#comment-16084596 ] Botong Huang commented on YARN-6798: Yeah, I guess we need to decide to go with 1.1 or 2.1. > NM

[jira] [Updated] (YARN-6798) NM startup failure with old state store due to version mismatch

2017-07-11 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6798: --- Attachment: YARN-6798.v1.patch v1 patch uploaded that roll back version to 1.1, with added notes. What

[jira] [Updated] (YARN-6704) Add Federation Interceptor restart when work preserving NM is enabled

2017-07-06 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6704: --- Attachment: YARN-6704-YARN-2915.v1.patch > Add Federation Interceptor restart when work preserving NM

[jira] [Commented] (YARN-6127) Add support for work preserving NM restart when AMRMProxy is enabled

2017-06-23 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16061173#comment-16061173 ] Botong Huang commented on YARN-6127: Thanks [~asuresh] and [~subru] for the review! > Add support for

[jira] [Updated] (YARN-6127) Add support for work preserving NM restart when AMRMProxy is enabled

2017-06-22 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6127: --- Attachment: YARN-6127-branch-2.v1.patch > Add support for work preserving NM restart when AMRMProxy is

[jira] [Commented] (YARN-6127) Add support for work preserving NM restart when AMRMProxy is enabled

2017-06-22 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16059809#comment-16059809 ] Botong Huang commented on YARN-6127: Thanks [~asuresh]! YARN-6730 created to follow up. > Add support

[jira] [Created] (YARN-6730) Make sure NM state store is not null consistently

2017-06-22 Thread Botong Huang (JIRA)
Botong Huang created YARN-6730: -- Summary: Make sure NM state store is not null consistently Key: YARN-6730 URL: https://issues.apache.org/jira/browse/YARN-6730 Project: Hadoop YARN Issue Type:

[jira] [Updated] (YARN-6127) Add support for work preserving NM restart when AMRMProxy is enabled

2017-06-21 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6127: --- Attachment: YARN-6127.v4.patch > Add support for work preserving NM restart when AMRMProxy is enabled

[jira] [Updated] (YARN-6127) Add support for work preserving NM restart when AMRMProxy is enabled

2017-06-21 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6127: --- Attachment: YARN-6127.v3.patch Thanks [~asuresh] for the comments. v3 uploaded: NMSS main version

[jira] [Updated] (YARN-6704) Add Federation Interceptor restart when work preserving NM is enabled

2017-06-09 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6704: --- Issue Type: Sub-task (was: Task) Parent: YARN-2915 > Add Federation Interceptor restart when

[jira] [Created] (YARN-6704) Add Federation Interceptor restart when work preserving NM is enabled

2017-06-09 Thread Botong Huang (JIRA)
Botong Huang created YARN-6704: -- Summary: Add Federation Interceptor restart when work preserving NM is enabled Key: YARN-6704 URL: https://issues.apache.org/jira/browse/YARN-6704 Project: Hadoop YARN

[jira] [Commented] (YARN-6127) Add support for work preserving NM restart when AMRMProxy is enabled

2017-06-08 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16043635#comment-16043635 ] Botong Huang commented on YARN-6127: Unit test failures are not related:

[jira] [Comment Edited] (YARN-5655) TestContainerManagerSecurity#testNMTokens is asserting

2017-06-08 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16043620#comment-16043620 ] Botong Huang edited comment on YARN-5655 at 6/8/17 11:29 PM: - Hi [~rkanter] and

[jira] [Commented] (YARN-5655) TestContainerManagerSecurity#testNMTokens is asserting

2017-06-08 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16043620#comment-16043620 ] Botong Huang commented on YARN-5655: Hi [~rkanter] and [~templedf], I am also getting lots of NPE here.

[jira] [Updated] (YARN-6127) Add support for work preserving NM restart when AMRMProxy is enabled

2017-06-08 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6127: --- Attachment: YARN-6127.v2.patch > Add support for work preserving NM restart when AMRMProxy is enabled

[jira] [Updated] (YARN-6127) Add support for work preserving NM restart when AMRMProxy is enabled

2017-06-07 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6127: --- Attachment: YARN-6127.v1.patch > Add support for work preserving NM restart when AMRMProxy is enabled

[jira] [Commented] (YARN-6511) Federation: transparently spanning application across multiple sub-clusters

2017-06-07 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16041753#comment-16041753 ] Botong Huang commented on YARN-6511: Great! Thanks [~subru] and [~jianhe] for the review and quick

[jira] [Updated] (YARN-6511) Federation Intercepting and propagating AM-RM communications (part two: secondary subclusters added)

2017-06-06 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6511: --- Attachment: YARN-6511-YARN-2915.v3.patch Thanks [~subru] for the review! I've addressed most comments

[jira] [Comment Edited] (YARN-6511) Federation Intercepting and propagating AM-RM communications (part two: secondary subclusters added)

2017-06-05 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16037364#comment-16037364 ] Botong Huang edited comment on YARN-6511 at 6/5/17 6:55 PM: Hi [~jianhe],

[jira] [Commented] (YARN-6511) Federation Intercepting and propagating AM-RM communications (part two: secondary subclusters added)

2017-06-05 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16037364#comment-16037364 ] Botong Huang commented on YARN-6511: Hi [~jianhe], thanks for the review! We only register UAM for the

[jira] [Updated] (YARN-6640) AM heartbeat stuck when responseId overflows MAX_INT

2017-06-02 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6640: --- Attachment: YARN-6640.v1.patch > AM heartbeat stuck when responseId overflows MAX_INT >

[jira] [Updated] (YARN-6511) Federation Intercepting and propagating AM-RM communications (part two: secondary subclusters added)

2017-05-31 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6511: --- Attachment: YARN-6511-YARN-2915.v2.patch > Federation Intercepting and propagating AM-RM

[jira] [Updated] (YARN-6511) Federation Intercepting and propagating AM-RM communications (part two: secondary subclusters added)

2017-05-31 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6511: --- Attachment: YARN-6511-YARN-2915.v1.patch > Federation Intercepting and propagating AM-RM

[jira] [Commented] (YARN-3666) Federation Intercepting and propagating AM- home RM communications

2017-05-31 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16031930#comment-16031930 ] Botong Huang commented on YARN-3666: Great, thanks [~subru] and [~curino] for the feedback! @All

[jira] [Commented] (YARN-3666) Federation Intercepting and propagating AM-RM communications (part one: home RM only)

2017-05-31 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16031825#comment-16031825 ] Botong Huang commented on YARN-3666: Unit test

[jira] [Updated] (YARN-3666) Federation Intercepting and propagating AM-RM communications (part one: home RM only)

2017-05-31 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-3666: --- Attachment: YARN-3666-YARN-2915.v9.patch The test failure in

[jira] [Updated] (YARN-6667) Handle containerId duplicate without failing the heartbeat in Federation Interceptor

2017-05-30 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6667: --- Summary: Handle containerId duplicate without failing the heartbeat in Federation Interceptor (was:

[jira] [Updated] (YARN-6667) Handle containerId duplicate without throwing in Federation Interceptor

2017-05-30 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-6667: --- Issue Type: Sub-task (was: Task) Parent: YARN-5597 > Handle containerId duplicate without

[jira] [Updated] (YARN-3666) Federation Intercepting and propagating AM-RM communications (part one: home RM only)

2017-05-30 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-3666: --- Attachment: YARN-3666-YARN-2915.v8.patch v8: comments addressed and rebased after Yarn- >

[jira] [Created] (YARN-6667) Handle containerId duplicate without throwing in Federation Interceptor

2017-05-30 Thread Botong Huang (JIRA)
Botong Huang created YARN-6667: -- Summary: Handle containerId duplicate without throwing in Federation Interceptor Key: YARN-6667 URL: https://issues.apache.org/jira/browse/YARN-6667 Project: Hadoop YARN

[jira] [Commented] (YARN-6666) Fix unit test failure in TestRouterClientRMService

2017-05-30 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16030103#comment-16030103 ] Botong Huang commented on YARN-: Thanks [~subru]! > Fix unit test failure in

[jira] [Updated] (YARN-6666) Fix unit test in TestRouterClientRMService

2017-05-30 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-: --- Attachment: YARN--YARN-2915.v2.patch Thanks [~subru] for the quick review! Fixed the minor check

[jira] [Commented] (YARN-3666) Federation Intercepting and propagating AM-RM communications (part one: home RM only)

2017-05-30 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16029974#comment-16029974 ] Botong Huang commented on YARN-3666: I've just opened YARN- for this failure:

[jira] [Commented] (YARN-6666) Fix unit test in TestRouterClientRMService

2017-05-30 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16029954#comment-16029954 ] Botong Huang commented on YARN-: I can, but for now it is not needed for unit tests yet. The

[jira] [Updated] (YARN-6666) Fix unit test in TestRouterClientRMService

2017-05-30 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-: --- Attachment: YARN--YARN-2915.v1.patch > Fix unit test in TestRouterClientRMService >

[jira] [Updated] (YARN-6666) Fix unit test in TestRouterClientRMService

2017-05-30 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-: --- Attachment: (was: YARN--YARN-2915.v1.patch) > Fix unit test in TestRouterClientRMService >

[jira] [Updated] (YARN-6666) Fix unit test in TestRouterClientRMService

2017-05-30 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-: --- Attachment: YARN--YARN-2915.v1.patch > Fix unit test in TestRouterClientRMService >

[jira] [Created] (YARN-6666) Fix unit test in TestRouterClientRMService

2017-05-30 Thread Botong Huang (JIRA)
Botong Huang created YARN-: -- Summary: Fix unit test in TestRouterClientRMService Key: YARN- URL: https://issues.apache.org/jira/browse/YARN- Project: Hadoop YARN Issue Type: Bug

[jira] [Updated] (YARN-3666) Federation Intercepting and propagating AM-RM communications (part one: home RM only)

2017-05-29 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-3666: --- Attachment: YARN-3666-YARN-2915.v7.patch > Federation Intercepting and propagating AM-RM

[jira] [Updated] (YARN-3666) Federation Intercepting and propagating AM-RM communications (part one: home RM only)

2017-05-26 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-3666: --- Attachment: YARN-3666-YARN-2915.v6.patch v6 patch uploaded, rebased after UAM patch (YARN-5531), also

[jira] [Commented] (YARN-5531) UnmanagedAM pool manager for federating application across clusters

2017-05-26 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16027032#comment-16027032 ] Botong Huang commented on YARN-5531: Thanks [~subru] and [~kasha] for the all the review and detailed

<    1   2   3   4   5   6   7   8   >