[jira] [Commented] (YARN-9180) Port YARN-7033 NM recovery of assigned resources to branch-3.0/branch-2

2019-01-31 Thread Jonathan Hung (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16758042#comment-16758042 ] Jonathan Hung commented on YARN-9180: - YARN-9180-YARN-8200.003 fixes unit test (original port of

[jira] [Comment Edited] (YARN-9180) Port YARN-7033 NM recovery of assigned resources to branch-3.0/branch-2

2019-01-31 Thread Jonathan Hung (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16758042#comment-16758042 ] Jonathan Hung edited comment on YARN-9180 at 2/1/19 7:28 AM: -

[jira] [Commented] (YARN-9186) [CSI] Upgrade CSI proto to v1.0

2019-01-31 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16758040#comment-16758040 ] Weiwei Yang commented on YARN-9186: --- Hi [~sunilg] Could you help to review the patch. Changes is not as

[jira] [Updated] (YARN-9180) Port YARN-7033 NM recovery of assigned resources to branch-3.0/branch-2

2019-01-31 Thread Jonathan Hung (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Hung updated YARN-9180: Attachment: YARN-9180-YARN-8200.003.patch > Port YARN-7033 NM recovery of assigned resources to

[jira] [Updated] (YARN-9161) Absolute resources of capacity scheduler doesn't support GPU and FPGA

2019-01-31 Thread Zac Zhou (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zac Zhou updated YARN-9161: --- Attachment: YARN-9161.010.patch > Absolute resources of capacity scheduler doesn't support GPU and FPGA >

[jira] [Commented] (YARN-9150) Making TimelineSchemaCreator support different backends for Timeline Schema Creation in ATSv2

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16758022#comment-16758022 ] Hadoop QA commented on YARN-9150: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9180) Port YARN-7033 NM recovery of assigned resources to branch-3.0/branch-2

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16758012#comment-16758012 ] Hadoop QA commented on YARN-9180: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-7129) Application Catalog for YARN applications

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16758004#comment-16758004 ] Hadoop QA commented on YARN-7129: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Assigned] (YARN-6105) Support for new REST end point /clusterids

2019-01-31 Thread Sushil Ks (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sushil Ks reassigned YARN-6105: --- Assignee: Sushil Ks > Support for new REST end point /clusterids >

[jira] [Updated] (YARN-9150) Making TimelineSchemaCreator support different backends for Timeline Schema Creation in ATSv2

2019-01-31 Thread Sushil Ks (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sushil Ks updated YARN-9150: Attachment: YARN-9150-branch-2.002.patch > Making TimelineSchemaCreator support different backends for

[jira] [Commented] (YARN-9150) Making TimelineSchemaCreator support different backends for Timeline Schema Creation in ATSv2

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757991#comment-16757991 ] Hadoop QA commented on YARN-9150: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-9150) Making TimelineSchemaCreator support different backends for Timeline Schema Creation in ATSv2

2019-01-31 Thread Sushil Ks (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sushil Ks updated YARN-9150: Attachment: YARN-9150-branch-2.001.patch > Making TimelineSchemaCreator support different backends for

[jira] [Commented] (YARN-3841) [Storage implementation] Adding retry semantics to HDFS backing storage

2019-01-31 Thread Vrushali C (JIRA)
[ https://issues.apache.org/jira/browse/YARN-3841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757979#comment-16757979 ] Vrushali C commented on YARN-3841: -- Thanks Abhishek for updating the patch after our last call! I think

[jira] [Commented] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-31 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757952#comment-16757952 ] Zhankun Tang commented on YARN-9060: [~sunilg] , Thanks for the review! {quote}In below comments, it's

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-31 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.016.patch > [YARN-8851] Phase 1 - Support device isolation and use the

[jira] [Commented] (YARN-9180) Port YARN-7033 NM recovery of assigned resources to branch-3.0/branch-2

2019-01-31 Thread Jonathan Hung (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757920#comment-16757920 ] Jonathan Hung commented on YARN-9180: - Attached YARN-9180-YARN-8200.002 patch for fixing 

[jira] [Commented] (YARN-9161) Absolute resources of capacity scheduler doesn't support GPU and FPGA

2019-01-31 Thread Zac Zhou (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757911#comment-16757911 ] Zac Zhou commented on YARN-9161: Thank you [~sunilg] for triggering Jenkins. Distributedshell is ok now

[jira] [Commented] (YARN-9262) TestRMAppAttemptTransitions is failing with an NPE

2019-01-31 Thread lujie (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757916#comment-16757916 ] lujie commented on YARN-9262: - I am in the airport now, I will give the patch tonight. >

[jira] [Assigned] (YARN-9262) TestRMAppAttemptTransitions is failing with an NPE

2019-01-31 Thread lujie (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lujie reassigned YARN-9262: --- Assignee: lujie > TestRMAppAttemptTransitions is failing with an NPE >

[jira] [Commented] (YARN-9206) RMServerUtils does not count SHUTDOWN as an accepted state

2019-01-31 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757915#comment-16757915 ] Sunil Govindan commented on YARN-9206: -- +1. Test case failures are not related and tracked via

[jira] [Commented] (YARN-9194) Invalid event: REGISTERED and LAUNCH_FAILED at FAILED, and NullPointerException happens in RM while shutdown a NM

2019-01-31 Thread lujie (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757913#comment-16757913 ] lujie commented on YARN-9194: - Hi: [~leftnoteasy] and [~sunilg] Yeah I have found the error, and give the

[jira] [Updated] (YARN-9180) Port YARN-7033 NM recovery of assigned resources to branch-3.0/branch-2

2019-01-31 Thread Jonathan Hung (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Hung updated YARN-9180: Attachment: YARN-9180-YARN-8200.002.patch > Port YARN-7033 NM recovery of assigned resources to

[jira] [Comment Edited] (YARN-9194) Invalid event: REGISTERED and LAUNCH_FAILED at FAILED, and NullPointerException happens in RM while shutdown a NM

2019-01-31 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757905#comment-16757905 ] Sunil Govindan edited comment on YARN-9194 at 2/1/19 2:46 AM: --

[jira] [Commented] (YARN-9194) Invalid event: REGISTERED and LAUNCH_FAILED at FAILED, and NullPointerException happens in RM while shutdown a NM

2019-01-31 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757905#comment-16757905 ] Sunil Govindan commented on YARN-9194: --

[jira] [Created] (YARN-9262) TestRMAppAttemptTransitions is failing with an NPE

2019-01-31 Thread Sunil Govindan (JIRA)
Sunil Govindan created YARN-9262: Summary: TestRMAppAttemptTransitions is failing with an NPE Key: YARN-9262 URL: https://issues.apache.org/jira/browse/YARN-9262 Project: Hadoop YARN Issue

[jira] [Commented] (YARN-9191) Add cli option in DS to support enforceExecutionType in resource requests.

2019-01-31 Thread Abhishek Modi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757897#comment-16757897 ] Abhishek Modi commented on YARN-9191: - Thanks [~elgoiri] and [~giovanni.fumarola]. > Add cli option

[jira] [Commented] (YARN-9188) Port YARN-7136 to branch-2

2019-01-31 Thread Jonathan Hung (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757879#comment-16757879 ] Jonathan Hung commented on YARN-9188: - Thanks [~asuresh]. Deprecation warnings are gone. The unit test 

[jira] [Resolved] (YARN-9261) Backport YARN-7270 addendum to YARN-8200

2019-01-31 Thread Jonathan Hung (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Hung resolved YARN-9261. - Resolution: Fixed Clean backport. Pushed to YARN-8200 > Backport YARN-7270 addendum to YARN-8200

[jira] [Created] (YARN-9261) Backport YARN-7270 addendum to YARN-8200

2019-01-31 Thread Jonathan Hung (JIRA)
Jonathan Hung created YARN-9261: --- Summary: Backport YARN-7270 addendum to YARN-8200 Key: YARN-9261 URL: https://issues.apache.org/jira/browse/YARN-9261 Project: Hadoop YARN Issue Type:

[jira] [Commented] (YARN-9246) NPE when executing a command yarn node -status or -states without additional arguments

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757859#comment-16757859 ] Hadoop QA commented on YARN-9246: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-7129) Application Catalog for YARN applications

2019-01-31 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Yang updated YARN-7129: Attachment: YARN-7129.019.patch > Application Catalog for YARN applications >

[jira] [Updated] (YARN-7129) Application Catalog for YARN applications

2019-01-31 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Yang updated YARN-7129: Attachment: (was: YARN-7129.019.patch) > Application Catalog for YARN applications >

[jira] [Updated] (YARN-7129) Application Catalog for YARN applications

2019-01-31 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Yang updated YARN-7129: Attachment: YARN-7129.019.patch > Application Catalog for YARN applications >

[jira] [Commented] (YARN-9246) NPE when executing a command yarn node -status or -states without additional arguments

2019-01-31 Thread Masahiro Tanaka (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757817#comment-16757817 ] Masahiro Tanaka commented on YARN-9246: --- Thanks [~suma.shivaprasad] for reviewing this. I uploaded a

[jira] [Updated] (YARN-9246) NPE when executing a command yarn node -status or -states without additional arguments

2019-01-31 Thread Masahiro Tanaka (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Masahiro Tanaka updated YARN-9246: -- Attachment: YARN-9246.002.patch > NPE when executing a command yarn node -status or -states

[jira] [Commented] (YARN-8967) Change FairScheduler to use PlacementRule interface

2019-01-31 Thread Wilfred Spiegelenburg (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757815#comment-16757815 ] Wilfred Spiegelenburg commented on YARN-8967: - [~sunilg] can you please have a review of this

[jira] [Commented] (YARN-9246) NPE when executing a command yarn node -status or -states without additional arguments

2019-01-31 Thread Suma Shivaprasad (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757794#comment-16757794 ] Suma Shivaprasad commented on YARN-9246: +1 for the patch. Can you fix checkstyle errors? > NPE

[jira] [Updated] (YARN-9260) Re-Launch ApplicationMasters That Fail With OOM Using Larger Container

2019-01-31 Thread BELUGA BEHR (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] BELUGA BEHR updated YARN-9260: -- Environment: (was: If an ApplicationMaster fails with an OOM, or is killed by YARN for using more

[jira] [Commented] (YARN-9260) Re-Launch ApplicationMasters That Fail With OOM Using Larger Container

2019-01-31 Thread BELUGA BEHR (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757787#comment-16757787 ] BELUGA BEHR commented on YARN-9260: --- Thanks for the input [~eepayne]. I could see a new configuration

[jira] [Updated] (YARN-9260) Re-Launch ApplicationMasters That Fail With OOM Using Larger Container

2019-01-31 Thread BELUGA BEHR (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] BELUGA BEHR updated YARN-9260: -- Description: If an ApplicationMaster fails with an OOM, or is killed by YARN for using more memory than

[jira] [Commented] (YARN-9246) NPE when executing a command yarn node -status or -states without additional arguments

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757756#comment-16757756 ] Hadoop QA commented on YARN-9246: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9206) RMServerUtils does not count SHUTDOWN as an accepted state

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757754#comment-16757754 ] Hadoop QA commented on YARN-9206: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9200) Enable resource configuration of queue capacity for different resources independently

2019-01-31 Thread Aihua Xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757740#comment-16757740 ] Aihua Xu commented on YARN-9200: [~sunilg] Since you worked on absolute resource configuration, wants to

[jira] [Commented] (YARN-9206) RMServerUtils does not count SHUTDOWN as an accepted state

2019-01-31 Thread Jim Brennan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757718#comment-16757718 ] Jim Brennan commented on YARN-9206: --- [~kshukla] thanks for the new patch.   This looks good to me.  +1

[jira] [Commented] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757694#comment-16757694 ] Hadoop QA commented on YARN-9060: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9260) Re-Launch ApplicationMasters That Fail With OOM Using Larger Container

2019-01-31 Thread Eric Payne (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757690#comment-16757690 ] Eric Payne commented on YARN-9260: -- This may already be clear, but I just want to campaign for this

[jira] [Commented] (YARN-9246) NPE when executing a command yarn node -status or -states without additional arguments

2019-01-31 Thread Masahiro Tanaka (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757676#comment-16757676 ] Masahiro Tanaka commented on YARN-9246: --- I also found similar issues with these commands: {code}

[jira] [Commented] (YARN-9191) Add cli option in DS to support enforceExecutionType in resource requests.

2019-01-31 Thread Giovanni Matteo Fumarola (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757656#comment-16757656 ] Giovanni Matteo Fumarola commented on YARN-9191: Thanks [~abmodi] for the patch and

[jira] [Updated] (YARN-9191) Add cli option in DS to support enforceExecutionType in resource requests.

2019-01-31 Thread Giovanni Matteo Fumarola (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Giovanni Matteo Fumarola updated YARN-9191: --- Fix Version/s: 3.3.0 > Add cli option in DS to support enforceExecutionType

[jira] [Commented] (YARN-9191) Add cli option in DS to support enforceExecutionType in resource requests.

2019-01-31 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757655#comment-16757655 ] Hudson commented on YARN-9191: -- FAILURE: Integrated in Jenkins build Hadoop-trunk-Commit #15862 (See

[jira] [Updated] (YARN-9206) RMServerUtils does not count SHUTDOWN as an accepted state

2019-01-31 Thread Kuhu Shukla (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kuhu Shukla updated YARN-9206: -- Attachment: YARN-9206.004.patch > RMServerUtils does not count SHUTDOWN as an accepted state >

[jira] [Commented] (YARN-9186) [CSI] Upgrade CSI proto to v1.0

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757627#comment-16757627 ] Hadoop QA commented on YARN-9186: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9191) Add cli option in DS to support enforceExecutionType in resource requests.

2019-01-31 Thread JIRA
[ https://issues.apache.org/jira/browse/YARN-9191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757576#comment-16757576 ] Íñigo Goiri commented on YARN-9191: --- +1 on [^YARN-9191.006.patch]. > Add cli option in DS to support

[jira] [Commented] (YARN-9135) NM State store ResourceMappings serialization are tested with Strings instead of real Device objects

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757560#comment-16757560 ] Hadoop QA commented on YARN-9135: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9135) NM State store ResourceMappings serialization are tested with Strings instead of real Device objects

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757548#comment-16757548 ] Hadoop QA commented on YARN-9135: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9123) Clean up and split testcases in TestNMWebServices for GPU support

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757532#comment-16757532 ] Hadoop QA commented on YARN-9123: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Created] (YARN-9260) Re-Launch ApplicationMasters That Fail With OOM Using Larger Container

2019-01-31 Thread BELUGA BEHR (JIRA)
BELUGA BEHR created YARN-9260: - Summary: Re-Launch ApplicationMasters That Fail With OOM Using Larger Container Key: YARN-9260 URL: https://issues.apache.org/jira/browse/YARN-9260 Project: Hadoop YARN

[jira] [Commented] (YARN-9139) Simplify initializer code of GpuDiscoverer

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757524#comment-16757524 ] Hadoop QA commented on YARN-9139: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Comment Edited] (YARN-9259) Assign ApplicationMaster (AM) Heap Memory Based on Container Size

2019-01-31 Thread BELUGA BEHR (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757518#comment-16757518 ] BELUGA BEHR edited comment on YARN-9259 at 1/31/19 5:14 PM: This goes

[jira] [Commented] (YARN-9260) Re-Launch ApplicationMasters That Fail With OOM Using Larger Container

2019-01-31 Thread BELUGA BEHR (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757528#comment-16757528 ] BELUGA BEHR commented on YARN-9260: --- When re-launching the container, the AM JVM Heap memory needs to be

[jira] [Commented] (YARN-9133) Make tests more easy to comprehend in TestGpuResourceHandler

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757525#comment-16757525 ] Hadoop QA commented on YARN-9133: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9259) Assign ApplicationMaster (AM) Heap Memory Based on Container Size

2019-01-31 Thread BELUGA BEHR (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757518#comment-16757518 ] BELUGA BEHR commented on YARN-9259: --- This goes hand-in-hand with another proposed Idea:

[jira] [Updated] (YARN-9259) Assign ApplicationMaster (AM) Heap Memory Based on Container Size

2019-01-31 Thread BELUGA BEHR (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] BELUGA BEHR updated YARN-9259: -- Summary: Assign ApplicationMaster (AM) Heap Memory Based on Container Size (was: Assign

[jira] [Created] (YARN-9259) Assign ApplicationMaster (AM) Memory Based on Container Size

2019-01-31 Thread BELUGA BEHR (JIRA)
BELUGA BEHR created YARN-9259: - Summary: Assign ApplicationMaster (AM) Memory Based on Container Size Key: YARN-9259 URL: https://issues.apache.org/jira/browse/YARN-9259 Project: Hadoop YARN

[jira] [Updated] (YARN-9135) NM State store ResourceMappings serialization are tested with Strings instead of real Device objects

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9135: - Attachment: YARN-9135.003.patch > NM State store ResourceMappings serialization are tested with

[jira] [Commented] (YARN-9138) Test error handling of nvidia-smi binary execution of GpuDiscoverer

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757453#comment-16757453 ] Hadoop QA commented on YARN-9138: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-9135) NM State store ResourceMappings serialization are tested with Strings instead of real Device objects

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9135: - Attachment: (was: YARN-9135.002.patch) > NM State store ResourceMappings serialization are

[jira] [Commented] (YARN-9135) NM State store ResourceMappings serialization are tested with Strings instead of real Device objects

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757436#comment-16757436 ] Szilard Nemeth commented on YARN-9135: -- Hi [~pbacsko]! Thanks for the review! Very good point, we

[jira] [Updated] (YARN-9135) NM State store ResourceMappings serialization are tested with Strings instead of real Device objects

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9135: - Attachment: YARN-9135.002.patch > NM State store ResourceMappings serialization are tested with

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-31 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.015.patch > [YARN-8851] Phase 1 - Support device isolation and use the

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-31 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: YARN-9060-trunk.015.patch > [YARN-8851] Phase 1 - Support device isolation and use the

[jira] [Commented] (YARN-9258) DistributedShell PlacementSpec fails to parse

2019-01-31 Thread Prabhu Joseph (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757392#comment-16757392 ] Prabhu Joseph commented on YARN-9258: - Hi [~cheersyang], As per the doc

[jira] [Updated] (YARN-9123) Clean up and split testcases in TestNMWebServices for GPU support

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9123: - Attachment: YARN-9123.004.patch > Clean up and split testcases in TestNMWebServices for GPU

[jira] [Commented] (YARN-9258) DistributedShell PlacementSpec fails to parse

2019-01-31 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757408#comment-16757408 ] Weiwei Yang commented on YARN-9258: --- Ah thanks for pointing this out, I overlooked this one. It would be

[jira] [Updated] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation and use the Nvidia GPU plugin as an example

2019-01-31 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9060: --- Attachment: (was: YARN-9060-trunk.015.patch) > [YARN-8851] Phase 1 - Support device isolation and

[jira] [Commented] (YARN-9123) Clean up and split testcases in TestNMWebServices for GPU support

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757393#comment-16757393 ] Szilard Nemeth commented on YARN-9123: -- Hi [~pbacsko]! Thanks for the review! I fixed all of the

[jira] [Updated] (YARN-9123) Clean up and split testcases in TestNMWebServices for GPU support

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9123: - Attachment: YARN-9123.003.patch > Clean up and split testcases in TestNMWebServices for GPU

[jira] [Updated] (YARN-9186) [CSI] Upgrade CSI proto to v1.0

2019-01-31 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiwei Yang updated YARN-9186: -- Attachment: YARN-9186.01.patch > [CSI] Upgrade CSI proto to v1.0 > --- > >

[jira] [Commented] (YARN-9186) [CSI] Upgrade CSI proto to v1.0

2019-01-31 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757379#comment-16757379 ] Sunil Govindan commented on YARN-9186: -- Thanks [~cheersyang] for confirming. Since they marked as

[jira] [Comment Edited] (YARN-9186) [CSI] Upgrade CSI proto to v1.0

2019-01-31 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757377#comment-16757377 ] Weiwei Yang edited comment on YARN-9186 at 1/31/19 3:43 PM: Hi [~sunilg] I

[jira] [Updated] (YARN-9133) Make tests more easy to comprehend in TestGpuResourceHandler

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9133: - Attachment: YARN-9133.003.patch > Make tests more easy to comprehend in TestGpuResourceHandler >

[jira] [Commented] (YARN-9133) Make tests more easy to comprehend in TestGpuResourceHandler

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757381#comment-16757381 ] Szilard Nemeth commented on YARN-9133: -- Thanks [~pbacsko] for your review comments! I fixed all the

[jira] [Updated] (YARN-9186) [CSI] Upgrade CSI proto to v1.0

2019-01-31 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiwei Yang updated YARN-9186: -- Attachment: (was: YARN-9186.01.patch) > [CSI] Upgrade CSI proto to v1.0 >

[jira] [Commented] (YARN-9186) [CSI] Upgrade CSI proto to v1.0

2019-01-31 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757377#comment-16757377 ] Weiwei Yang commented on YARN-9186: --- Hi [~sunilg] I don't think it is possible to support both v0.3 and

[jira] [Commented] (YARN-9099) GpuResourceAllocator#getReleasingGpus calculates number of GPUs in a wrong way

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757354#comment-16757354 ] Szilard Nemeth commented on YARN-9099: -- Thanks [~sunilg] > GpuResourceAllocator#getReleasingGpus

[jira] [Commented] (YARN-9258) DistributedShell PlacementSpec fails to parse

2019-01-31 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757348#comment-16757348 ] Weiwei Yang commented on YARN-9258: --- Hi [~Prabhu Joseph] It doesn't seem to be a valid expression to

[jira] [Updated] (YARN-9138) Test error handling of nvidia-smi binary execution of GpuDiscoverer

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9138: - Attachment: YARN-9138.002.patch > Test error handling of nvidia-smi binary execution of

[jira] [Comment Edited] (YARN-9258) DistributedShell PlacementSpec fails to parse

2019-01-31 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757348#comment-16757348 ] Weiwei Yang edited comment on YARN-9258 at 1/31/19 3:27 PM: Hi [~Prabhu

[jira] [Commented] (YARN-9138) Test error handling of nvidia-smi binary execution of GpuDiscoverer

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757351#comment-16757351 ] Szilard Nemeth commented on YARN-9138: -- Hi [~pbacsko]! Thanks for the review! # Description added

[jira] [Updated] (YARN-9138) Test error handling of nvidia-smi binary execution of GpuDiscoverer

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9138: - Description: The code that executes nvidia-smi (doing GPU device auto-discovery) don't have much

[jira] [Commented] (YARN-9139) Simplify initializer code of GpuDiscoverer

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757335#comment-16757335 ] Szilard Nemeth commented on YARN-9139: -- Hi [~pbacsko]! Thanks for your review and for the point you

[jira] [Updated] (YARN-9139) Simplify initializer code of GpuDiscoverer

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9139: - Attachment: YARN-9139.002.patch > Simplify initializer code of GpuDiscoverer >

[jira] [Created] (YARN-9258) DistributedShell PlacementSpec fails to parse

2019-01-31 Thread Prabhu Joseph (JIRA)
Prabhu Joseph created YARN-9258: --- Summary: DistributedShell PlacementSpec fails to parse Key: YARN-9258 URL: https://issues.apache.org/jira/browse/YARN-9258 Project: Hadoop YARN Issue Type:

[jira] [Commented] (YARN-9118) Handle issues with parsing user defined GPU devices in GpuDiscoverer

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757240#comment-16757240 ] Szilard Nemeth commented on YARN-9118: -- UT failure seems unrelated. > Handle issues with parsing

[jira] [Commented] (YARN-9118) Handle issues with parsing user defined GPU devices in GpuDiscoverer

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757221#comment-16757221 ] Hadoop QA commented on YARN-9118: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9186) [CSI] Upgrade CSI proto to v1.0

2019-01-31 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757169#comment-16757169 ] Sunil Govindan commented on YARN-9186: -- Hi [~cheersyang] Do we want to support 0.3 version as well OR

[jira] [Commented] (YARN-9208) Distributed shell allow LocalResourceVisibility to be specified

2019-01-31 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16757168#comment-16757168 ] Hadoop QA commented on YARN-9208: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-9118) Handle issues with parsing user defined GPU devices in GpuDiscoverer

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9118: - Attachment: YARN-9118.005.patch > Handle issues with parsing user defined GPU devices in

[jira] [Updated] (YARN-9118) Handle issues with parsing user defined GPU devices in GpuDiscoverer

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9118: - Attachment: (was: YARN-9118.005.patch) > Handle issues with parsing user defined GPU devices

[jira] [Updated] (YARN-9118) Handle issues with parsing user defined GPU devices in GpuDiscoverer

2019-01-31 Thread Szilard Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9118: - Attachment: YARN-9118.006.patch > Handle issues with parsing user defined GPU devices in

  1   2   >