[jira] [Commented] (YARN-3480) Recovery may get very slow with lots of services with lots of app-attempts

2015-05-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14536112#comment-14536112 ] Jun Gong commented on YARN-3480: [~vinodkv] Thanks for the suggestions. {quote} Part of wh

[jira] [Commented] (YARN-3480) Recovery may get very slow with lots of services with lots of app-attempts

2015-05-20 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553451#comment-14553451 ] Jun Gong commented on YARN-3480: {quote} Without doing this, we will unnecessarily be forci

[jira] [Created] (YARN-3712) ContainersLauncher: handle event CLEANUP_CONTAINER asynchronously

2015-05-25 Thread Jun Gong (JIRA)
Jun Gong created YARN-3712: -- Summary: ContainersLauncher: handle event CLEANUP_CONTAINER asynchronously Key: YARN-3712 URL: https://issues.apache.org/jira/browse/YARN-3712 Project: Hadoop YARN Issu

[jira] [Updated] (YARN-3712) ContainersLauncher: handle event CLEANUP_CONTAINER asynchronously

2015-05-25 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-3712: --- Attachment: YARN-3712.01.patch > ContainersLauncher: handle event CLEANUP_CONTAINER asynchronously > --

[jira] [Updated] (YARN-3712) ContainersLauncher: handle event CLEANUP_CONTAINER asynchronously

2015-05-26 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-3712: --- Attachment: YARN-3712.02.patch Fix checkstyle warnings. > ContainersLauncher: handle event CLEANUP_CONTAINER a

[jira] [Commented] (YARN-3712) ContainersLauncher: handle event CLEANUP_CONTAINER asynchronously

2015-05-27 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14560632#comment-14560632 ] Jun Gong commented on YARN-3712: [~sidharta-s] [~ashahab] Thanks for the suggestion. I am

[jira] [Commented] (YARN-3712) ContainersLauncher: handle event CLEANUP_CONTAINER asynchronously

2015-05-27 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14560679#comment-14560679 ] Jun Gong commented on YARN-3712: [~vinodkv] Our case: NM receives a event SHUTDOWN, and s

[jira] [Updated] (YARN-3712) ContainersLauncher: handle event CLEANUP_CONTAINER asynchronously

2015-05-27 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-3712: --- Priority: Minor (was: Major) > ContainersLauncher: handle event CLEANUP_CONTAINER asynchronously > ---

[jira] [Assigned] (YARN-3644) Node manager shuts down if unable to connect with RM

2015-05-27 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong reassigned YARN-3644: -- Assignee: Jun Gong (was: Raju Bairishetti) > Node manager shuts down if unable to connect with RM > ---

[jira] [Commented] (YARN-3644) Node manager shuts down if unable to connect with RM

2015-05-27 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562209#comment-14562209 ] Jun Gong commented on YARN-3644: Sorry, by mistake... > Node manager shuts down if unable

[jira] [Commented] (YARN-3644) Node manager shuts down if unable to connect with RM

2015-05-27 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562210#comment-14562210 ] Jun Gong commented on YARN-3644: Sorry, by mistake... > Node manager shuts down if unable

[jira] [Updated] (YARN-3644) Node manager shuts down if unable to connect with RM

2015-05-27 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-3644: --- Assignee: Raju Bairishetti (was: Jun Gong) > Node manager shuts down if unable to connect with RM > --

[jira] [Created] (YARN-3809) Failed to launch new attempts because ApplicationMasterLauncher's threads all hang

2015-06-15 Thread Jun Gong (JIRA)
Jun Gong created YARN-3809: -- Summary: Failed to launch new attempts because ApplicationMasterLauncher's threads all hang Key: YARN-3809 URL: https://issues.apache.org/jira/browse/YARN-3809 Project: Hadoop YA

[jira] [Commented] (YARN-3809) Failed to launch new attempts because ApplicationMasterLauncher's threads all hang

2015-06-15 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14587474#comment-14587474 ] Jun Gong commented on YARN-3809: How about setting thread pool size in ApplicationMasterLau

[jira] [Commented] (YARN-3809) Failed to launch new attempts because ApplicationMasterLauncher's threads all hang

2015-06-15 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14587555#comment-14587555 ] Jun Gong commented on YARN-3809: The stack is as following: {noformat} 2015-06-15 11:16:35,

[jira] [Updated] (YARN-3809) Failed to launch new attempts because ApplicationMasterLauncher's threads all hang

2015-06-16 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-3809: --- Attachment: YARN-3809.01.patch Attach a patch. Make thread pool size configurable, and default size is 50. > F

[jira] [Commented] (YARN-3809) Failed to launch new attempts because ApplicationMasterLauncher's threads all hang

2015-06-16 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14589164#comment-14589164 ] Jun Gong commented on YARN-3809: The checkstyle error is : YarnConfiguration.java: File len

[jira] [Commented] (YARN-3809) Failed to launch new attempts because ApplicationMasterLauncher's threads all hang

2015-06-17 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14591093#comment-14591093 ] Jun Gong commented on YARN-3809: [~devaraj.k] and [~kasha], thank you for the comments and

[jira] [Created] (YARN-3831) Localization failed when a local disk turns from bad to good without NM initializes it

2015-06-19 Thread Jun Gong (JIRA)
Jun Gong created YARN-3831: -- Summary: Localization failed when a local disk turns from bad to good without NM initializes it Key: YARN-3831 URL: https://issues.apache.org/jira/browse/YARN-3831 Project: Hadoo

[jira] [Updated] (YARN-3809) Failed to launch new attempts because ApplicationMasterLauncher's threads all hang

2015-06-19 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-3809: --- Attachment: YARN-3809.02.patch > Failed to launch new attempts because ApplicationMasterLauncher's threads all

[jira] [Commented] (YARN-3809) Failed to launch new attempts because ApplicationMasterLauncher's threads all hang

2015-06-19 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14593452#comment-14593452 ] Jun Gong commented on YARN-3809: [~jlowe] Thanks for explanation and suggestions. I misunde

[jira] [Created] (YARN-3833) TestWorkPreservingRMRestart#testSchedulerRecovery fails in trunk

2015-06-19 Thread Jun Gong (JIRA)
Jun Gong created YARN-3833: -- Summary: TestWorkPreservingRMRestart#testSchedulerRecovery fails in trunk Key: YARN-3833 URL: https://issues.apache.org/jira/browse/YARN-3833 Project: Hadoop YARN Issue

[jira] [Resolved] (YARN-3833) TestWorkPreservingRMRestart#testSchedulerRecovery fails in trunk

2015-06-19 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong resolved YARN-3833. Resolution: Duplicate > TestWorkPreservingRMRestart#testSchedulerRecovery fails in trunk > --

[jira] [Commented] (YARN-3833) TestWorkPreservingRMRestart#testSchedulerRecovery fails in trunk

2015-06-19 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14593476#comment-14593476 ] Jun Gong commented on YARN-3833: [~brahmareddy] Thank you. Closing it now. > TestWorkPrese

[jira] [Commented] (YARN-3809) Failed to launch new attempts because ApplicationMasterLauncher's threads all hang

2015-06-19 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14594300#comment-14594300 ] Jun Gong commented on YARN-3809: [~jlowe] Thank you for the very detailed suggestions. It h

[jira] [Updated] (YARN-3809) Failed to launch new attempts because ApplicationMasterLauncher's threads all hang

2015-06-19 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-3809: --- Attachment: YARN-3809.03.patch > Failed to launch new attempts because ApplicationMasterLauncher's threads all

[jira] [Commented] (YARN-3809) Failed to launch new attempts because ApplicationMasterLauncher's threads all hang

2015-06-19 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14594424#comment-14594424 ] Jun Gong commented on YARN-3809: Attach a new patch to address [~jlowe] 's suggestions. Tha

[jira] [Commented] (YARN-3831) Localization failed when a local disk turns from bad to good without NM initializes it

2015-06-23 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14597616#comment-14597616 ] Jun Gong commented on YARN-3831: [~zxu], thank you for the remind. Sorry for late reply. T

[jira] [Resolved] (YARN-3831) Localization failed when a local disk turns from bad to good without NM initializes it

2015-06-23 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong resolved YARN-3831. Resolution: Not A Problem > Localization failed when a local disk turns from bad to good without NM > initia

[jira] [Commented] (YARN-3809) Failed to launch new attempts because ApplicationMasterLauncher's threads all hang

2015-06-23 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14597622#comment-14597622 ] Jun Gong commented on YARN-3809: Same as previous explanation, checkstyle and test case err

[jira] [Commented] (YARN-3809) Failed to launch new attempts because ApplicationMasterLauncher's threads all hang

2015-06-24 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600489#comment-14600489 ] Jun Gong commented on YARN-3809: Thanks [~rohithsharma], [~devaraj.k] and [~kasha] for comm

[jira] [Created] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-06-21 Thread Jun Gong (JIRA)
Jun Gong created YARN-5286: -- Summary: Add RPC port info in RM web service's response when getting app status Key: YARN-5286 URL: https://issues.apache.org/jira/browse/YARN-5286 Project: Hadoop YARN

[jira] [Updated] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-06-22 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5286: --- Attachment: YARN-5286.01.patch Attach a patch to fix it. > Add RPC port info in RM web service's response when

[jira] [Commented] (YARN-5290) ResourceManager can place more containers on a node than the node size allows

2016-06-23 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15346297#comment-15346297 ] Jun Gong commented on YARN-5290: Thanks [~jlowe] for reporting the issue! We came across t

[jira] [Commented] (YARN-5168) Add port mapping handling when docker container use bridge network

2016-06-23 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15346309#comment-15346309 ] Jun Gong commented on YARN-5168: Thanks [~sidharta-s] for the comments. Agree with that it

[jira] [Updated] (YARN-4148) When killing app, RM releases app's resource before they are released by NM

2016-06-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-4148: --- Assignee: Jason Lowe (was: Jun Gong) > When killing app, RM releases app's resource before they are released b

[jira] [Commented] (YARN-4148) When killing app, RM releases app's resource before they are released by NM

2016-06-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15352762#comment-15352762 ] Jun Gong commented on YARN-4148: Sorry for late. Thanks [~jlowe] for your patch, the patch

[jira] [Updated] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-06-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5286: --- Attachment: YARN-5286.02.patch > Add RPC port info in RM web service's response when getting app status > -

[jira] [Commented] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-06-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15353118#comment-15353118 ] Jun Gong commented on YARN-5286: Attach a new patch to fix test case error. > Add RPC port

[jira] [Commented] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-06-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15354088#comment-15354088 ] Jun Gong commented on YARN-5286: Test case error is not related and addressed in YARN-5240.

[jira] [Updated] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5286: --- Attachment: YARN-5286.03.patch > Add RPC port info in RM web service's response when getting app status > -

[jira] [Commented] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15360042#comment-15360042 ] Jun Gong commented on YARN-5286: Thanks [~varun_saxena] for the review and comments! Attac

[jira] [Updated] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5286: --- Attachment: YARN-5286.04.patch Fix checkstyle error and test case error. > Add RPC port info in RM web service

[jira] [Commented] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15360899#comment-15360899 ] Jun Gong commented on YARN-5286: Test cases errors are not related. > Add RPC port info in

[jira] [Updated] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-04 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5286: --- Attachment: YARN-5286.05.patch > Add RPC port info in RM web service's response when getting app status > -

[jira] [Commented] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-04 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15361896#comment-15361896 ] Jun Gong commented on YARN-5286: Thanks [~varun_saxena] for the review and comments. Attac

[jira] [Commented] (YARN-5286) Add RPC port info in RM web service's response when getting app status

2016-07-05 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15363590#comment-15363590 ] Jun Gong commented on YARN-5286: Thanks [~varun_saxena] for the review, suggestions and com

[jira] [Commented] (YARN-5276) print more info when event queue is blocked

2016-07-05 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15363852#comment-15363852 ] Jun Gong commented on YARN-5276: I think it might be enough to add some debug information.

[jira] [Created] (YARN-5333) apps are rejected when RM HA

2016-07-07 Thread Jun Gong (JIRA)
Jun Gong created YARN-5333: -- Summary: apps are rejected when RM HA Key: YARN-5333 URL: https://issues.apache.org/jira/browse/YARN-5333 Project: Hadoop YARN Issue Type: Bug Reporter: Jun

[jira] [Updated] (YARN-5333) Recovered apps are rejected when RM HA

2016-07-07 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Summary: Recovered apps are rejected when RM HA (was: apps are rejected when RM HA) > Recovered apps are reje

[jira] [Updated] (YARN-5333) Recovered apps are rejected when RM HA

2016-07-07 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Description: Enable RM HA and use FairScheduler, {{yarn.scheduler.fair.allow-undeclared-pools}} is set to fals

[jira] [Updated] (YARN-5333) Recovered apps are rejected when RM HA

2016-07-07 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Description: Enable RM HA and use FairScheduler, {{yarn.scheduler.fair.allow-undeclared-pools}} is set to fals

[jira] [Commented] (YARN-5333) Recovered apps are rejected when RM HA

2016-07-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15367714#comment-15367714 ] Jun Gong commented on YARN-5333: Thanks [~vinodkv] for the comments. I checked the code,

[jira] [Commented] (YARN-5318) testRefreshNodesResourceWithFileSystemBasedConfigurationProvider may fail

2016-07-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15367806#comment-15367806 ] Jun Gong commented on YARN-5318: Thanks [~sandflee] for reporting the issue. I also saw thi

[jira] [Assigned] (YARN-5318) testRefreshNodesResourceWithFileSystemBasedConfigurationProvider may fail

2016-07-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong reassigned YARN-5318: -- Assignee: Jun Gong > testRefreshNodesResourceWithFileSystemBasedConfigurationProvider may fail > ---

[jira] [Updated] (YARN-5318) testRefreshNodesResourceWithFileSystemBasedConfigurationProvider may fail

2016-07-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5318: --- Attachment: YARN-5318.01.patch > testRefreshNodesResourceWithFileSystemBasedConfigurationProvider may fail > --

[jira] [Commented] (YARN-5318) testRefreshNodesResourceWithFileSystemBasedConfigurationProvider may fail

2016-07-08 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15367826#comment-15367826 ] Jun Gong commented on YARN-5318: I reproduced the issue with adding following change, the t

[jira] [Commented] (YARN-5318) TestRMAdminService#testRefreshNodesResourceWithFileSystemBasedConfigurationProvider fails intermittently.

2016-07-09 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15368975#comment-15368975 ] Jun Gong commented on YARN-5318: Thanks [~varun_saxena] for the review and commit! > TestR

[jira] [Assigned] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-09 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong reassigned YARN-5043: -- Assignee: Jun Gong > TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail > --

[jira] [Updated] (YARN-5333) Recovered apps are rejected when RM HA

2016-07-12 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.01.patch > Recovered apps are rejected when RM HA > -

[jira] [Commented] (YARN-5333) Recovered apps are rejected when RM HA

2016-07-12 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15373126#comment-15373126 ] Jun Gong commented on YARN-5333: Sorry for my mistakes: 1. We changed some code in our code

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-12 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Description: Enable RM HA and use FairScheduler, {{yarn.scheduler.fair.allow-undeclared-pools}} is set to fals

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-12 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Summary: Some recovered apps are put into default queue when RM HA (was: Recovered apps are rejected when RM H

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-13 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.02.patch > Some recovered apps are put into default queue when RM HA > --

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-13 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15375019#comment-15375019 ] Jun Gong commented on YARN-5333: Add a test case in the new patch to reproduce the problem.

[jira] [Created] (YARN-5372) TestRMWebServicesAppsModification fails in trunk

2016-07-13 Thread Jun Gong (JIRA)
Jun Gong created YARN-5372: -- Summary: TestRMWebServicesAppsModification fails in trunk Key: YARN-5372 URL: https://issues.apache.org/jira/browse/YARN-5372 Project: Hadoop YARN Issue Type: Test

[jira] [Resolved] (YARN-5372) TestRMWebServicesAppsModification fails in trunk

2016-07-13 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong resolved YARN-5372. Resolution: Not A Problem > TestRMWebServicesAppsModification fails in trunk > --

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-13 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15376219#comment-15376219 ] Jun Gong commented on YARN-5333: The reason for test case errors in TestRMWebServicesAppsMo

[jira] [Updated] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-14 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5043: --- Attachment: YARN-5043.01.patch > TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail > -

[jira] [Commented] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-14 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15377184#comment-15377184 ] Jun Gong commented on YARN-5043: Attach a patch to fix the problem and delete unnecessary s

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-20 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15385978#comment-15385978 ] Jun Gong commented on YARN-5333: I verified it for CapacityScheduler: 1. Without the patch,

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-20 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.03.patch > Some recovered apps are put into default queue when RM HA > --

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-20 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15386003#comment-15386003 ] Jun Gong commented on YARN-5333: Attach a new patch 03.patch to fix the test case error. C

[jira] [Commented] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-20 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15387192#comment-15387192 ] Jun Gong commented on YARN-5043: The whole process is as following: app attempt's status be

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15387462#comment-15387462 ] Jun Gong commented on YARN-5333: Thanks [~sunilg] for review and comments. I tested with n

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15387533#comment-15387533 ] Jun Gong commented on YARN-5333: {quote}Could you also please confirm that whether you have

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15387649#comment-15387649 ] Jun Gong commented on YARN-5333: {{refreshQueues}} will cause StandbyException, however {{

[jira] [Updated] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5043: --- Attachment: YARN-5043.02.patch > TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail > -

[jira] [Commented] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15387782#comment-15387782 ] Jun Gong commented on YARN-5043: Thanks [~sunilg] for the review and comments. {quote} If

[jira] [Commented] (YARN-5043) TestAMRestart.testRMAppAttemptFailuresValidityInterval random fail

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15388579#comment-15388579 ] Jun Gong commented on YARN-5043: Thanks [~sandflee]. As mentioned above, we should also wa

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-21 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15388710#comment-15388710 ] Jun Gong commented on YARN-5333: Thanks [~sunilg]. Yes, fail-fast seems better. {quote}

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-07-28 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15397352#comment-15397352 ] Jun Gong commented on YARN-5333: Sorry for late reply. Thanks [~rohithsharma], [~sunilg] an

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-01 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.04.patch > Some recovered apps are put into default queue when RM HA > --

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-01 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15402120#comment-15402120 ] Jun Gong commented on YARN-5333: Thanks [~rohithsharma] for verifying it and suggestion! I

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-01 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.05.patch > Some recovered apps are put into default queue when RM HA > --

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-01 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403213#comment-15403213 ] Jun Gong commented on YARN-5333: Attach a new patch to fix checkstyle error. Test cases err

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403741#comment-15403741 ] Jun Gong commented on YARN-5333: Thanks [~rohithsharma], [~jianhe] for the review and comme

[jira] [Comment Edited] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403741#comment-15403741 ] Jun Gong edited comment on YARN-5333 at 8/2/16 10:43 AM: - Thanks [~

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.06.patch > Some recovered apps are put into default queue when RM HA > --

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-02 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404150#comment-15404150 ] Jun Gong commented on YARN-5333: Attach a new patch. According to the suggestion, I abstra

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405503#comment-15405503 ] Jun Gong commented on YARN-5333: Hi [~jianhe], I think the [comment|https://issues.apache.

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.07.patch > Some recovered apps are put into default queue when RM HA > --

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15406060#comment-15406060 ] Jun Gong commented on YARN-5333: Thanks [~jianhe]. Attach a new patch to address above com

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.08.patch Fix test case error... > Some recovered apps are put into default queue when RM

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15407041#comment-15407041 ] Jun Gong commented on YARN-5333: Thanks [~rohithsharma] for the review. bq. refreshXXXWith

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15407189#comment-15407189 ] Jun Gong commented on YARN-5333: Attach a new patch 09.patch. Rename {{refreshXXXWithoutCh

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-03 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.09.patch > Some recovered apps are put into default queue when RM HA > --

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-04 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15407374#comment-15407374 ] Jun Gong commented on YARN-5333: Yes, I read comments in YARN-3893 again, I agree with it t

[jira] [Updated] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-04 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jun Gong updated YARN-5333: --- Attachment: YARN-5333.10.patch Attach a new patch 10.patch to address above problem. > Some recovered apps are

[jira] [Commented] (YARN-5333) Some recovered apps are put into default queue when RM HA

2016-08-04 Thread Jun Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-5333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15407442#comment-15407442 ] Jun Gong commented on YARN-5333: Hi [~sunilg], in order to reproduce the error case, we nee

<    1   2   3   4   5   >