[jira] [Comment Edited] (YARN-9948) Remove attempts that are beyond max-attempt limit from RMAppImpl

2019-11-05 Thread Jun Gong (Jira)
[ https://issues.apache.org/jira/browse/YARN-9948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16967374#comment-16967374 ] Jun Gong edited comment on YARN-9948 at 11/5/19 9:31 AM: - [~ziqian hu] Thanks for

[jira] [Commented] (YARN-9948) Remove attempts that are beyond max-attempt limit from RMAppImpl

2019-11-05 Thread Jun Gong (Jira)
[ https://issues.apache.org/jira/browse/YARN-9948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16967374#comment-16967374 ] Jun Gong commented on YARN-9948: [~ziqian hu] Thanks for the patch. How much memory does it consume? If

[jira] [Assigned] (YARN-5015) Unify restart policies across AM and container restarts

2018-02-06 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong reassigned YARN-5015: -- Assignee: Chandni Singh (was: Jun Gong) > Unify restart policies across AM and container restarts >

[jira] [Commented] (YARN-5015) Unify restart policies across AM and container restarts

2018-02-06 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16354785#comment-16354785 ] Jun Gong commented on YARN-5015: Hi [~csingh], I am busy with other things, sorry for it, please feel free

[jira] [Updated] (YARN-4122) Add support for GPU as a resource

2017-02-26 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-4122: --- Issue Type: Sub-task (was: New Feature) Parent: YARN-6223 > Add support for GPU as a resource >

[jira] [Commented] (YARN-4122) Add support for GPU as a resource

2017-02-26 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15885002#comment-15885002 ] Jun Gong commented on YARN-4122: [~leftnoteasy] It's OK. Move it now. > Add support for GPU as a resource

[jira] [Resolved] (YARN-4770) Auto-restart of containers should work across NM restarts.

2016-11-18 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong resolved YARN-4770. Resolution: Not A Bug > Auto-restart of containers should work across NM restarts. >

[jira] [Commented] (YARN-4770) Auto-restart of containers should work across NM restarts.

2016-11-18 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15676781#comment-15676781 ] Jun Gong commented on YARN-4770: Hi [~jianhe], I just tested again and confirmed it: container would

[jira] [Commented] (YARN-4382) Container hierarchy in cgroup may remain for ever after the container have be terminated

2016-11-16 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15670310#comment-15670310 ] Jun Gong commented on YARN-4382: {quote} You want to write the shell command or the script to release_agent

[jira] [Commented] (YARN-3998) Add support in the NodeManager to re-launch containers

2016-08-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15430039#comment-15430039 ] Jun Gong commented on YARN-3998: Thanks [~asuresh] for pointing it out. I did not notice it. Yes, the

[jira] [Commented] (YARN-5475) Test failed for TestAggregatedLogFormat on trunk

2016-08-16 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15423697#comment-15423697 ] Jun Gong commented on YARN-5475: Thanks [~varun_saxena] for the review and commit! Thanks [~djp] for

[jira] [Updated] (YARN-5475) Test failed for TestAggregatedLogFormat on trunk

2016-08-15 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5475: --- Attachment: YARN-5475.02.patch Fix checkstyle error. > Test failed for TestAggregatedLogFormat on trunk >

[jira] [Assigned] (YARN-5475) Test failed for TestAggregatedLogFormat on trunk

2016-08-15 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong reassigned YARN-5475: -- Assignee: Jun Gong > Test failed for TestAggregatedLogFormat on trunk >

[jira] [Updated] (YARN-5475) Test failed for TestAggregatedLogFormat on trunk

2016-08-12 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5475: --- Attachment: YARN-5475.01.patch In the patch, create a new configuration to avoid the change. > Test failed

[jira] [Commented] (YARN-5475) Test failed for TestAggregatedLogFormat on trunk

2016-08-12 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15419041#comment-15419041 ] Jun Gong commented on YARN-5475: Thanks [~GergelyNovak] for the analysis.

[jira] [Commented] (YARN-4910) Fix incomplete log info in ResourceLocalizationService

2016-08-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15412803#comment-15412803 ] Jun Gong commented on YARN-4910: Thanks [~varun_saxena] for the review and commit! > Fix incomplete log

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-05 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15410467#comment-15410467 ] Jun Gong commented on YARN-5333: Thanks [~rohithsharma], [~jianhe] and [~sunilg] > Some recovered apps

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-05 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15409275#comment-15409275 ] Jun Gong commented on YARN-5333: Test case errors are not related, addressed in YARN-5157 and YARN-5057. >

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-04 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15407442#comment-15407442 ] Jun Gong commented on YARN-5333: Hi [~sunilg], in order to reproduce the error case, we need to create some

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-04 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.10.patch Attach a new patch 10.patch to address above problem. > Some recovered apps

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-04 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15407374#comment-15407374 ] Jun Gong commented on YARN-5333: Yes, I read comments in YARN-3893 again, I agree with it too. I'll update

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.09.patch > Some recovered apps are put into default queue when RM HA >

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15407189#comment-15407189 ] Jun Gong commented on YARN-5333: Attach a new patch 09.patch. Rename {{refreshXXXWithoutCheck}} to

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15407041#comment-15407041 ] Jun Gong commented on YARN-5333: Thanks [~rohithsharma] for the review. bq. refreshXXXWithoutCheck does

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.08.patch Fix test case error... > Some recovered apps are put into default queue when

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15406060#comment-15406060 ] Jun Gong commented on YARN-5333: Thanks [~jianhe]. Attach a new patch to address above comments. It also

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.07.patch > Some recovered apps are put into default queue when RM HA >

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15405503#comment-15405503 ] Jun Gong commented on YARN-5333: Hi [~jianhe], I think the

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15404150#comment-15404150 ] Jun Gong commented on YARN-5333: Attach a new patch. According to the suggestion, I abstracted

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.06.patch > Some recovered apps are put into default queue when RM HA >

[jira] [Comment Edited] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403741#comment-15403741 ] Jun Gong edited comment on YARN-5333 at 8/2/16 10:43 AM: - Thanks [~rohithsharma],

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403741#comment-15403741 ] Jun Gong commented on YARN-5333: Thanks [~rohithsharma], [~jianhe] for the review and comments! bq. 1.

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-01 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15403213#comment-15403213 ] Jun Gong commented on YARN-5333: Attach a new patch to fix checkstyle error. Test cases error are not

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-01 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.05.patch > Some recovered apps are put into default queue when RM HA >

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-01 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15402120#comment-15402120 ] Jun Gong commented on YARN-5333: Thanks [~rohithsharma] for verifying it and suggestion! I attached a new

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-01 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.04.patch > Some recovered apps are put into default queue when RM HA >

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15397352#comment-15397352 ] Jun Gong commented on YARN-5333: Sorry for late reply. Thanks [~rohithsharma], [~sunilg] and [~jianhe]'s

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15388710#comment-15388710 ] Jun Gong commented on YARN-5333: Thanks [~sunilg]. Yes, fail-fast seems better. {quote} However one

[jira] [Commented] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15388579#comment-15388579 ] Jun Gong commented on YARN-5043: Thanks [~sandflee]. As mentioned above, we should also wait for

[jira] [Commented] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15387782#comment-15387782 ] Jun Gong commented on YARN-5043: Thanks [~sunilg] for the review and comments. {quote} If I understood you

[jira] [Updated] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5043: --- Attachment: YARN-5043.02.patch > TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail >

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15387649#comment-15387649 ] Jun Gong commented on YARN-5333: {{refreshQueues}} will cause StandbyException, however

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15387533#comment-15387533 ] Jun Gong commented on YARN-5333: {quote}Could you also please confirm that whether you have added new queue

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15387462#comment-15387462 ] Jun Gong commented on YARN-5333: Thanks [~sunilg] for review and comments. I tested with normal config

[jira] [Commented] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-20 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15387192#comment-15387192 ] Jun Gong commented on YARN-5043: The whole process is as following: app attempt's status becomes FAILED =>

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-20 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15386003#comment-15386003 ] Jun Gong commented on YARN-5333: Attach a new patch 03.patch to fix the test case error. Could someone

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-20 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.03.patch > Some recovered apps are put into default queue when RM HA >

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-20 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15385978#comment-15385978 ] Jun Gong commented on YARN-5333: I verified it for CapacityScheduler: 1. Without the patch, apps that

[jira] [Commented] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-14 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15377184#comment-15377184 ] Jun Gong commented on YARN-5043: Attach a patch to fix the problem and delete unnecessary sleeps. >

[jira] [Updated] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-14 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5043: --- Attachment: YARN-5043.01.patch > TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail >

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-13 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15376219#comment-15376219 ] Jun Gong commented on YARN-5333: The reason for test case errors in TestRMWebServicesAppsModification(e.g.

[jira] [Resolved] (YARN-5372) TestRMWebServicesAppsModification fails in trunk

2016-07-13 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong resolved YARN-5372. Resolution: Not A Problem > TestRMWebServicesAppsModification fails in trunk >

[jira] [Created] (YARN-5372) TestRMWebServicesAppsModification fails in trunk

2016-07-13 Thread Jun Gong (JIRA)
Jun Gong created YARN-5372: -- Summary: TestRMWebServicesAppsModification fails in trunk Key: YARN-5372 URL: https://issues.apache.org/jira/browse/YARN-5372 Project: Hadoop YARN Issue Type: Test

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-13 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15375019#comment-15375019 ] Jun Gong commented on YARN-5333: Add a test case in the new patch to reproduce the problem. > Some

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-13 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.02.patch > Some recovered apps are put into default queue when RM HA >

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-12 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Summary: Some recovered apps are put into default queue when RM HA (was: Recovered apps are rejected when RM

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-12 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Description: Enable RM HA and use FairScheduler, {{yarn.scheduler.fair.allow-undeclared-pools}} is set to

[jira] [Commented] (YARN-5333) Recovered apps are rejected when RM HA

2016-07-12 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15373126#comment-15373126 ] Jun Gong commented on YARN-5333: Sorry for my mistakes: 1. We changed some code in our code, so that apps

[jira] [Updated] (YARN-5333) Recovered apps are rejected when RM HA

2016-07-12 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.01.patch > Recovered apps are rejected when RM HA >

[jira] [Assigned] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-09 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong reassigned YARN-5043: -- Assignee: Jun Gong > TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail >

[jira] [Commented] (YARN-5318) TestRMAdminService#testRefreshNodesResourceWithFileSystemBasedConfigurationProvider fails intermittently.

2016-07-09 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15368975#comment-15368975 ] Jun Gong commented on YARN-5318: Thanks [~varun_saxena] for the review and commit! >

[jira] [Commented] (YARN-5318) testRefreshNodesResourceWithFileSystemBasedConfigurationProvider may fail

2016-07-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15367826#comment-15367826 ] Jun Gong commented on YARN-5318: I reproduced the issue with adding following change, the test will fail

[jira] [Updated] (YARN-5318) testRefreshNodesResourceWithFileSystemBasedConfigurationProvider may fail

2016-07-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5318: --- Attachment: YARN-5318.01.patch > testRefreshNodesResourceWithFileSystemBasedConfigurationProvider may fail >

[jira] [Assigned] (YARN-5318) testRefreshNodesResourceWithFileSystemBasedConfigurationProvider may fail

2016-07-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong reassigned YARN-5318: -- Assignee: Jun Gong > testRefreshNodesResourceWithFileSystemBasedConfigurationProvider may fail >

[jira] [Commented] (YARN-5318) testRefreshNodesResourceWithFileSystemBasedConfigurationProvider may fail

2016-07-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15367806#comment-15367806 ] Jun Gong commented on YARN-5318: Thanks [~sandflee] for reporting the issue. I also saw this problem before

[jira] [Commented] (YARN-5333) Recovered apps are rejected when RM HA

2016-07-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15367714#comment-15367714 ] Jun Gong commented on YARN-5333: Thanks [~vinodkv] for the comments. I checked the code, IIUC

[jira] [Updated] (YARN-5333) Recovered apps are rejected when RM HA

2016-07-07 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Description: Enable RM HA and use FairScheduler, {{yarn.scheduler.fair.allow-undeclared-pools}} is set to

[jira] [Updated] (YARN-5333) Recovered apps are rejected when RM HA

2016-07-07 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Description: Enable RM HA and use FairScheduler, {{yarn.scheduler.fair.allow-undeclared-pools}} is set to

[jira] [Updated] (YARN-5333) Recovered apps are rejected when RM HA

2016-07-07 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Summary: Recovered apps are rejected when RM HA (was: apps are rejected when RM HA) > Recovered apps are

[jira] [Created] (YARN-5333) apps are rejected when RM HA

2016-07-07 Thread Jun Gong (JIRA)
Jun Gong created YARN-5333: -- Summary: apps are rejected when RM HA Key: YARN-5333 URL: https://issues.apache.org/jira/browse/YARN-5333 Project: Hadoop YARN Issue Type: Bug Reporter: Jun

[jira] [Commented] (YARN-5276) print more info when event queue is blocked

2016-07-06 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15363852#comment-15363852 ] Jun Gong commented on YARN-5276: I think it might be enough to add some debug information. When the size of

[jira] [Commented] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-05 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15363590#comment-15363590 ] Jun Gong commented on YARN-5286: Thanks [~varun_saxena] for the review, suggestions and commit! I think it

[jira] [Commented] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-04 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15361896#comment-15361896 ] Jun Gong commented on YARN-5286: Thanks [~varun_saxena] for the review and comments. Attach a new patch to

[jira] [Updated] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-04 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5286: --- Attachment: YARN-5286.05.patch > Add RPC port info in RM web service's response when getting app status >

[jira] [Commented] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-04 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15360899#comment-15360899 ] Jun Gong commented on YARN-5286: Test cases errors are not related. > Add RPC port info in RM web

[jira] [Updated] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5286: --- Attachment: YARN-5286.04.patch Fix checkstyle error and test case error. > Add RPC port info in RM web

[jira] [Commented] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15360042#comment-15360042 ] Jun Gong commented on YARN-5286: Thanks [~varun_saxena] for the review and comments! Attach a new patch to

[jira] [Updated] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5286: --- Attachment: YARN-5286.03.patch > Add RPC port info in RM web service's response when getting app status >

[jira] [Commented] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-06-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15354088#comment-15354088 ] Jun Gong commented on YARN-5286: Test case error is not related and addressed in YARN-5240. Checkstyle is

[jira] [Commented] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-06-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15353118#comment-15353118 ] Jun Gong commented on YARN-5286: Attach a new patch to fix test case error. > Add RPC port info in RM web

[jira] [Updated] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-06-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5286: --- Attachment: YARN-5286.02.patch > Add RPC port info in RM web service's response when getting app status >

[jira] [Commented] (YARN-4148) When killing app, RM releases app's resource before they are released by NM

2016-06-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15352762#comment-15352762 ] Jun Gong commented on YARN-4148: Sorry for late. Thanks [~jlowe] for your patch, the patch is more

[jira] [Updated] (YARN-4148) When killing app, RM releases app's resource before they are released by NM

2016-06-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-4148: --- Assignee: Jason Lowe (was: Jun Gong) > When killing app, RM releases app's resource before they are released

[jira] [Commented] (YARN-5168) Add port mapping handling when docker container use bridge network

2016-06-23 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15346309#comment-15346309 ] Jun Gong commented on YARN-5168: Thanks [~sidharta-s] for the comments. Agree with that it will be a large

[jira] [Commented] (YARN-5290) ResourceManager can place more containers on a node than the node size allows

2016-06-23 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15346297#comment-15346297 ] Jun Gong commented on YARN-5290: Thanks [~jlowe] for reporting the issue! We came across the issue some

[jira] [Updated] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-06-22 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5286: --- Attachment: YARN-5286.01.patch Attach a patch to fix it. > Add RPC port info in RM web service's response

[jira] [Created] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-06-21 Thread Jun Gong (JIRA)
Jun Gong created YARN-5286: -- Summary: Add RPC port info in RM web service's response when getting app status Key: YARN-5286 URL: https://issues.apache.org/jira/browse/YARN-5286 Project: Hadoop YARN

[jira] [Commented] (YARN-5168) Add port mapping handling when docker container use bridge network

2016-05-30 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15306589#comment-15306589 ] Jun Gong commented on YARN-5168: {quote} What will be the additional info sent in the context? {quote}

[jira] [Commented] (YARN-5168) Add port mapping handling when docker container use bridge network

2016-05-30 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15306580#comment-15306580 ] Jun Gong commented on YARN-5168: It might be good to create some sub tasks, however I'd like to hear more

[jira] [Commented] (YARN-5178) yarn application never can be killed when failover resource manager

2016-05-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305329#comment-15305329 ] Jun Gong commented on YARN-5178: Thanks [~tuyuri] for sharing the logs. I analyzed rs2 log, and found it

[jira] [Commented] (YARN-4007) Add support for different network setups when launching the docker container

2016-05-26 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303500#comment-15303500 ] Jun Gong commented on YARN-4007: As discussed in YARN-5168, for host network, port mapping might conflict

[jira] [Commented] (YARN-5168) Add port mapping handling when docker container use bridge network

2016-05-26 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303493#comment-15303493 ] Jun Gong commented on YARN-5168: {quote} 1. For item3, do you mean adding DNS like service registry in YARN

[jira] [Commented] (YARN-5116) Failed to execute "yarn application"

2016-05-26 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303390#comment-15303390 ] Jun Gong commented on YARN-5116: Thanks [~eepayne] for the review and commit! Thanks [~aw] for the review!

[jira] [Commented] (YARN-4007) Add support for different network setups when launching the docker container

2016-05-26 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303385#comment-15303385 ] Jun Gong commented on YARN-4007: Yes, we faced same problems and solved it by "-p", we could discuss more

[jira] [Commented] (YARN-4007) Add support for different network setups when launching the docker container

2016-05-26 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303384#comment-15303384 ] Jun Gong commented on YARN-4007: Filed YARN-5168 to address it. > Add support for different network setups

[jira] [Updated] (YARN-5168) Add port mapping handling when docker container use bridge network

2016-05-26 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5168: --- Parent Issue: YARN-3611 (was: YARN-5079) > Add port mapping handling when docker container use bridge network

[jira] [Created] (YARN-5168) Add port mapping handling when docker container use bridge network

2016-05-26 Thread Jun Gong (JIRA)
Jun Gong created YARN-5168: -- Summary: Add port mapping handling when docker container use bridge network Key: YARN-5168 URL: https://issues.apache.org/jira/browse/YARN-5168 Project: Hadoop YARN

[jira] [Commented] (YARN-4007) Add support for different network setups when launching the docker container

2016-05-26 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15302243#comment-15302243 ] Jun Gong commented on YARN-4007: Thanks [~sidharta-s] for the patch and [~vvasudev]'s review! When we use

[jira] [Assigned] (YARN-5154) DelayedProcessKiller can kill the wrong process if pid is recycled

2016-05-26 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong reassigned YARN-5154: -- Assignee: Jun Gong > DelayedProcessKiller can kill the wrong process if pid is recycled >

[jira] [Commented] (YARN-4459) container-executor should only kill process groups

2016-05-25 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15301522#comment-15301522 ] Jun Gong commented on YARN-4459: Thanks [~jlowe] for the patch, review and commit! Thanks [~Naganarasimha],

  1   2   3   4   5   >