[jira] [Commented] (YARN-9435) Add Opportunistic Scheduler metrics in ResourceManager.

2019-04-11 Thread Abhishek Modi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815133#comment-16815133 ] Abhishek Modi commented on YARN-9435: - Thanks [~giovanni.fumarola] for reviewing this. I have attached

[jira] [Updated] (YARN-9435) Add Opportunistic Scheduler metrics in ResourceManager.

2019-04-11 Thread Abhishek Modi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Abhishek Modi updated YARN-9435: Attachment: YARN-9435.004.patch > Add Opportunistic Scheduler metrics in ResourceManager. >

[jira] [Commented] (YARN-9472) Add multi-thread asynchronous scheduling to fair scheduler

2019-04-11 Thread zhuqi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815238#comment-16815238 ] zhuqi commented on YARN-9472: - [~leftnoteasy]  [~Tao Yang]  If any suggestions. Thanks. > Add multi-thread

[jira] [Commented] (YARN-9140) Code cleanup in ResourcePluginManager.initialize and in TestResourcePluginManager

2019-04-11 Thread Adam Antal (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815352#comment-16815352 ] Adam Antal commented on YARN-9140: -- Thanks for the patch [~snemeth], +1 (non-binding). > Code cleanup in

[jira] [Commented] (YARN-9435) Add Opportunistic Scheduler metrics in ResourceManager.

2019-04-11 Thread Abhishek Modi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815335#comment-16815335 ] Abhishek Modi commented on YARN-9435: - None of the findbugs warnings is related to the patch. > Add

[jira] [Updated] (YARN-9462) TestResourceTrackerService.testNodeRemovalGracefully fails sporadically

2019-04-11 Thread Prabhu Joseph (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prabhu Joseph updated YARN-9462: Attachment: YARN-9462-001.patch > TestResourceTrackerService.testNodeRemovalGracefully fails

[jira] [Updated] (YARN-9462) TestResourceTrackerService.testNodeRemovalGracefully fails sporadically

2019-04-11 Thread Prabhu Joseph (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prabhu Joseph updated YARN-9462: Attachment: (was: YARN-9462-001.patch) > TestResourceTrackerService.testNodeRemovalGracefully

[jira] [Created] (YARN-9472) Add multi-thread asynchronous scheduling to fair scheduler

2019-04-11 Thread zhuqi (JIRA)
zhuqi created YARN-9472: --- Summary: Add multi-thread asynchronous scheduling to fair scheduler Key: YARN-9472 URL: https://issues.apache.org/jira/browse/YARN-9472 Project: Hadoop YARN Issue Type:

[jira] [Commented] (YARN-7319) java.net.UnknownHostException when trying contact node by hostname

2019-04-11 Thread Liu Shaohui (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815232#comment-16815232 ] Liu Shaohui commented on YARN-7319: --- +1 for encountering the same problem when deploying hadoop on k8s

[jira] [Commented] (YARN-9052) Replace all MockRM submit method definitions with a builder

2019-04-11 Thread Adam Antal (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815311#comment-16815311 ] Adam Antal commented on YARN-9052: -- Thanks for the patch [~snemeth], massive one! I wanted to have some

[jira] [Commented] (YARN-7721) TestContinuousScheduling fails sporadically with NPE

2019-04-11 Thread Adam Antal (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815263#comment-16815263 ] Adam Antal commented on YARN-7721: -- {{SchedulerApplicationAttempt#getLastScheduledContainer}} is also

[jira] [Commented] (YARN-9430) Recovering containers does not check available resources on node

2019-04-11 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815280#comment-16815280 ] Szilard Nemeth commented on YARN-9430: -- As per our further offline discussion with [~adam.antal], our

[jira] [Commented] (YARN-9435) Add Opportunistic Scheduler metrics in ResourceManager.

2019-04-11 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815254#comment-16815254 ] Hadoop QA commented on YARN-9435: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9052) Replace all MockRM submit method definitions with a builder

2019-04-11 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815321#comment-16815321 ] Hadoop QA commented on YARN-9052: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9052) Replace all MockRM submit method definitions with a builder

2019-04-11 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815322#comment-16815322 ] Szilard Nemeth commented on YARN-9052: -- Hi [~adam.antal]! Thanks for the review! Yep, I agree that

[jira] [Commented] (YARN-9462) TestResourceTrackerService.testNodeRemovalGracefully fails sporadically

2019-04-11 Thread Prabhu Joseph (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815413#comment-16815413 ] Prabhu Joseph commented on YARN-9462: - [~giovanni.fumarola] Could you please review this if you have

[jira] [Commented] (YARN-9430) Recovering containers does not check available resources on node

2019-04-11 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815477#comment-16815477 ] Szilard Nemeth commented on YARN-9430: -- Nope, I was not assigning this to myself on purpose, anyone

[jira] [Commented] (YARN-9337) GPU auto-discovery script runs even when the resource is given by hand

2019-04-11 Thread Adam Antal (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815486#comment-16815486 ] Adam Antal commented on YARN-9337: -- Accidentally pushed a wrong patch, uploaded the correct one as v3. >

[jira] [Updated] (YARN-9052) Replace all MockRM submit method definitions with a builder

2019-04-11 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9052: - Attachment: YARN-9052.003.patch > Replace all MockRM submit method definitions with a builder >

[jira] [Commented] (YARN-7537) [Atsv2] load hbase configuration from filesystem rather than URL

2019-04-11 Thread Prabhu Joseph (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815513#comment-16815513 ] Prabhu Joseph commented on YARN-7537: - Have used {{FileSystem.newInstance(uri, conf)}} which will

[jira] [Updated] (YARN-9473) [Umbrella] Support Vector Engine ( a new accelerator hardware) based on pluggable device framework

2019-04-11 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9473: --- Summary: [Umbrella] Support Vector Engine ( a new accelerator hardware) based on pluggable device

[jira] [Updated] (YARN-9337) GPU auto-discovery script runs even when the resource is given by hand

2019-04-11 Thread Adam Antal (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-9337: - Attachment: YARN-9337.002.patch > GPU auto-discovery script runs even when the resource is given by hand

[jira] [Commented] (YARN-9430) Recovering containers does not check available resources on node

2019-04-11 Thread Adam Antal (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815422#comment-16815422 ] Adam Antal commented on YARN-9430: -- Thanks [~snemeth] for the recap. I have nothing more to add. I

[jira] [Created] (YARN-9473) [Umbrella] Support Vector Engine ( a new acceleration hardware) based on pluggable device framework

2019-04-11 Thread Zhankun Tang (JIRA)
Zhankun Tang created YARN-9473: -- Summary: [Umbrella] Support Vector Engine ( a new acceleration hardware) based on pluggable device framework Key: YARN-9473 URL: https://issues.apache.org/jira/browse/YARN-9473

[jira] [Updated] (YARN-7537) [Atsv2] load hbase configuration from filesystem rather than URL

2019-04-11 Thread Prabhu Joseph (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prabhu Joseph updated YARN-7537: Attachment: YARN-7537-03.patch > [Atsv2] load hbase configuration from filesystem rather than URL >

[jira] [Commented] (YARN-9462) TestResourceTrackerService.testNodeRemovalGracefully fails sporadically

2019-04-11 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815406#comment-16815406 ] Hadoop QA commented on YARN-9462: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-7848) Force removal of docker containers that do not get removed on first try

2019-04-11 Thread Jim Brennan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815434#comment-16815434 ] Jim Brennan commented on YARN-7848: --- Thanks for updating [~eyang]!  lgtm.   I am +1 on patch 004

[jira] [Updated] (YARN-9337) GPU auto-discovery script runs even when the resource is given by hand

2019-04-11 Thread Adam Antal (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-9337: - Attachment: YARN-9337.003.patch > GPU auto-discovery script runs even when the resource is given by hand

[jira] [Commented] (YARN-6929) yarn.nodemanager.remote-app-log-dir structure is not scalable

2019-04-11 Thread Prabhu Joseph (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815939#comment-16815939 ] Prabhu Joseph commented on YARN-6929: - [~eyang] Thanks for reviewing again. Have few doubts with new

[jira] [Commented] (YARN-9453) Clean up code long if-else chain in ApplicationCLI#run

2019-04-11 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815543#comment-16815543 ] Hadoop QA commented on YARN-9453: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-6929) yarn.nodemanager.remote-app-log-dir structure is not scalable

2019-04-11 Thread Prabhu Joseph (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815996#comment-16815996 ] Prabhu Joseph commented on YARN-6929: - This looks good to me. Will change the code accordingly.

[jira] [Commented] (YARN-6929) yarn.nodemanager.remote-app-log-dir structure is not scalable

2019-04-11 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815991#comment-16815991 ] Eric Yang commented on YARN-6929: - {quote}Looks cluster_timestamp is missed, it is also required as the

[jira] [Commented] (YARN-7537) [Atsv2] load hbase configuration from filesystem rather than URL

2019-04-11 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815581#comment-16815581 ] Hadoop QA commented on YARN-7537: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-2889) Limit the number of opportunistic container allocated per AM heartbeat

2019-04-11 Thread JIRA
[ https://issues.apache.org/jira/browse/YARN-2889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815648#comment-16815648 ] Íñigo Goiri commented on YARN-2889: --- Thanks [~abmodi] for the patch. A few comments: * Can we use

[jira] [Commented] (YARN-9337) GPU auto-discovery script runs even when the resource is given by hand

2019-04-11 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815568#comment-16815568 ] Hadoop QA commented on YARN-9337: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9339) Apps pending metric incorrect after moving app to a new queue

2019-04-11 Thread JIRA
[ https://issues.apache.org/jira/browse/YARN-9339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815641#comment-16815641 ] Íñigo Goiri commented on YARN-9339: --- Thanks [~abmodi] for the patch. A couple comments: * Why is

[jira] [Updated] (YARN-9435) Add Opportunistic Scheduler metrics in ResourceManager.

2019-04-11 Thread Giovanni Matteo Fumarola (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Giovanni Matteo Fumarola updated YARN-9435: --- Fix Version/s: 3.3.0 > Add Opportunistic Scheduler metrics in

[jira] [Commented] (YARN-9435) Add Opportunistic Scheduler metrics in ResourceManager.

2019-04-11 Thread Giovanni Matteo Fumarola (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815713#comment-16815713 ] Giovanni Matteo Fumarola commented on YARN-9435: Thanks [~abmodi] for the patch. Committed

[jira] [Commented] (YARN-9435) Add Opportunistic Scheduler metrics in ResourceManager.

2019-04-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815711#comment-16815711 ] Hudson commented on YARN-9435: -- FAILURE: Integrated in Jenkins build Hadoop-trunk-Commit #16384 (See

[jira] [Commented] (YARN-9448) Fix Opportunistic Scheduling for node local allocations.

2019-04-11 Thread JIRA
[ https://issues.apache.org/jira/browse/YARN-9448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815652#comment-16815652 ] Íñigo Goiri commented on YARN-9448: --- Thanks [~abmodi], what about extracting the {{allocationRequestId}}

[jira] [Commented] (YARN-9462) TestResourceTrackerService.testNodeRemovalGracefully fails sporadically

2019-04-11 Thread Giovanni Matteo Fumarola (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815623#comment-16815623 ] Giovanni Matteo Fumarola commented on YARN-9462: Thanks [~Prabhu Joseph] for the patch. I

[jira] [Commented] (YARN-9052) Replace all MockRM submit method definitions with a builder

2019-04-11 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815645#comment-16815645 ] Hadoop QA commented on YARN-9052: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9339) Apps pending metric incorrect after moving app to a new queue

2019-04-11 Thread Abhishek Modi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815646#comment-16815646 ] Abhishek Modi commented on YARN-9339: - Thanks [~elgoiri] for review. I will address the comments in

[jira] [Commented] (YARN-7848) Force removal of docker containers that do not get removed on first try

2019-04-11 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815709#comment-16815709 ] Eric Yang commented on YARN-7848: - [~Jim_Brennan] Thank you for tips. [~ebadger] can you commit the

[jira] [Commented] (YARN-6929) yarn.nodemanager.remote-app-log-dir structure is not scalable

2019-04-11 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16815815#comment-16815815 ] Eric Yang commented on YARN-6929: - [~Prabhu Joseph] {code}{aggregation_log_root} / / bucket_{suffix} /