[jira] [Commented] (YARN-9140) Code cleanup in ResourcePluginManager.initialize and in TestResourcePluginManager

2019-08-14 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16907224#comment-16907224 ] Peter Bacsko commented on YARN-9140: ASF license warning can be ignored. > Code cleanup in

[jira] [Updated] (YARN-9217) Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing

2019-08-16 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9217: --- Attachment: YARN-9217.012.patch > Nodemanager will fail to start if GPU is misconfigured on the node

[jira] [Updated] (YARN-9217) Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing

2019-08-16 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9217: --- Attachment: YARN-9217.branch-3.2.001.patch > Nodemanager will fail to start if GPU is misconfigured

[jira] [Updated] (YARN-9100) Add tests for GpuResourceAllocator and do minor code cleanup

2019-08-16 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9100: --- Attachment: YARN-9100.branch-3.2.002.patch > Add tests for GpuResourceAllocator and do minor code

[jira] [Updated] (YARN-9217) Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing

2019-08-15 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9217: --- Attachment: YARN-9217.011.patch > Nodemanager will fail to start if GPU is misconfigured on the node

[jira] [Updated] (YARN-9217) Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing

2019-08-15 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9217: --- Attachment: (was: YARN-9217.011.patch) > Nodemanager will fail to start if GPU is misconfigured

[jira] [Updated] (YARN-9217) Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing

2019-08-15 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9217: --- Attachment: YARN-9217.011.patch > Nodemanager will fail to start if GPU is misconfigured on the node

[jira] [Created] (YARN-9749) TestAppLogAggregatorImpl#testDFSQuotaExceeded fails on trunk

2019-08-15 Thread Peter Bacsko (JIRA)
Peter Bacsko created YARN-9749: -- Summary: TestAppLogAggregatorImpl#testDFSQuotaExceeded fails on trunk Key: YARN-9749 URL: https://issues.apache.org/jira/browse/YARN-9749 Project: Hadoop YARN

[jira] [Updated] (YARN-9749) TestAppLogAggregatorImpl#testDFSQuotaExceeded fails on trunk

2019-08-15 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9749: --- Component/s: (was: yarn) test log-aggregation >

[jira] [Updated] (YARN-8586) Extract log aggregation related fields and methods from RMAppImpl

2019-08-15 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-8586: --- Attachment: YARN-8586-branch-3.1.001.patch > Extract log aggregation related fields and methods from

[jira] [Updated] (YARN-9217) Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing

2019-08-15 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9217: --- Attachment: YARN-9217.010.patch > Nodemanager will fail to start if GPU is misconfigured on the node

[jira] [Commented] (YARN-9217) Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing

2019-08-15 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16907985#comment-16907985 ] Peter Bacsko commented on YARN-9217: Rebased patch (again) + introduced new fail-fast property. >

[jira] [Updated] (YARN-9100) Add tests for GpuResourceAllocator and do minor code cleanup

2019-08-15 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9100: --- Attachment: YARN-9100-009.patch > Add tests for GpuResourceAllocator and do minor code cleanup >

[jira] [Commented] (YARN-9749) TestAppLogAggregatorImpl#testDFSQuotaExceeded fails on trunk

2019-08-15 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16908384#comment-16908384 ] Peter Bacsko commented on YARN-9749: +1 non-binding > TestAppLogAggregatorImpl#testDFSQuotaExceeded

[jira] [Commented] (YARN-9133) Make tests more easy to comprehend in TestGpuResourceHandler

2019-08-14 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16907074#comment-16907074 ] Peter Bacsko commented on YARN-9133: ASF license issue can be ignored. > Make tests more easy to

[jira] [Updated] (YARN-9100) Add tests for GpuResourceAllocator and do minor code cleanup

2019-08-10 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9100: --- Attachment: YARN-9100-006.patch > Add tests for GpuResourceAllocator and do minor code cleanup >

[jira] [Commented] (YARN-9100) Add tests for GpuResourceAllocator and do minor code cleanup

2019-08-10 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16904453#comment-16904453 ] Peter Bacsko commented on YARN-9100: Patch v5 no longer applies. Created v6. > Add tests for

[jira] [Updated] (YARN-9135) NM State store ResourceMappings serialization are tested with Strings instead of real Device objects

2019-08-10 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9135: --- Attachment: YARN-9105.branch-3.2.001.patch > NM State store ResourceMappings serialization are tested

[jira] [Updated] (YARN-9135) NM State store ResourceMappings serialization are tested with Strings instead of real Device objects

2019-08-10 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9135: --- Attachment: YARN-9105.branch-3.1.001.patch > NM State store ResourceMappings serialization are tested

[jira] [Commented] (YARN-9676) Add DEBUG and TRACE level messages to AppLogAggregatorImpl and connected classes

2019-08-13 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16906018#comment-16906018 ] Peter Bacsko commented on YARN-9676: +1 LGTM (non-binding) > Add DEBUG and TRACE level messages to

[jira] [Updated] (YARN-9133) Make tests more easy to comprehend in TestGpuResourceHandler

2019-08-13 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9133: --- Attachment: YARN-9133.007.patch > Make tests more easy to comprehend in TestGpuResourceHandler >

[jira] [Updated] (YARN-9140) Code cleanup in ResourcePluginManager.initialize and in TestResourcePluginManager

2019-08-14 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9140: --- Attachment: YARN-9140.branch-3.1.001.patch > Code cleanup in ResourcePluginManager.initialize and in

[jira] [Updated] (YARN-9100) Add tests for GpuResourceAllocator and do minor code cleanup

2019-08-16 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9100: --- Attachment: YARN-9100.branch-3.1.002.patch > Add tests for GpuResourceAllocator and do minor code

[jira] [Commented] (YARN-9100) Add tests for GpuResourceAllocator and do minor code cleanup

2019-08-16 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16908905#comment-16908905 ] Peter Bacsko commented on YARN-9100: Branch-3.1 failures are due to compilation errors. Fixed in

[jira] [Commented] (YARN-9477) Implement VE discovery using libudev

2019-09-03 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16921218#comment-16921218 ] Peter Bacsko commented on YARN-9477: [~jojochuang] thanks for informing me about this and following-up

[jira] [Updated] (YARN-6715) Fix documentation about NodeHealthScriptRunner

2019-08-28 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-6715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-6715: --- Component/s: documentation > Fix documentation about NodeHealthScriptRunner >

[jira] [Updated] (YARN-6715) Fix documentation about NodeHealthScriptRunner

2019-08-28 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-6715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-6715: --- Attachment: YARN-6715-001.patch > Fix documentation about NodeHealthScriptRunner >

[jira] [Commented] (YARN-9290) Invalid SchedulingRequest not rejected in Scheduler PlacementConstraintsHandler

2019-08-28 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16917691#comment-16917691 ] Peter Bacsko commented on YARN-9290: Some minor comments: {code} public List

[jira] [Comment Edited] (YARN-9290) Invalid SchedulingRequest not rejected in Scheduler PlacementConstraintsHandler

2019-08-28 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16917691#comment-16917691 ] Peter Bacsko edited comment on YARN-9290 at 8/28/19 11:20 AM: -- Some minor

[jira] [Updated] (YARN-6715) Fix documentation about NodeHealthScriptRunner

2019-08-28 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-6715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-6715: --- Attachment: YARN-6715-002.patch > Fix documentation about NodeHealthScriptRunner >

[jira] [Commented] (YARN-9699) Migration tool that help to generate CS configs based on FS

2019-08-29 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16918549#comment-16918549 ] Peter Bacsko commented on YARN-9699: Just add some extra thoughts: the tool would work similarly to

[jira] [Commented] (YARN-6715) Fix documentation about NodeHealthScriptRunner

2019-09-02 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-6715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16920762#comment-16920762 ] Peter Bacsko commented on YARN-6715: [~miklos.szeg...@cloudera.com] can I ask you to review & commit?

[jira] [Updated] (YARN-6715) Fix documentation about NodeHealthScriptRunner

2019-08-28 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-6715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-6715: --- Attachment: YARN-6715-003.patch > Fix documentation about NodeHealthScriptRunner >

[jira] [Commented] (YARN-9786) testCancelledDelegationToken fails intermittently

2019-08-26 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16915916#comment-16915916 ] Peter Bacsko commented on YARN-9786: [~adam.antal] this is a duplicate of YARN-9461. It's been solved

[jira] [Commented] (YARN-6715) Fix documentation about NodeHealthScriptRunner

2019-09-04 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-6715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1691#comment-1691 ] Peter Bacsko commented on YARN-6715: [~szegedim] yes, that's fine. > Fix documentation about

[jira] [Updated] (YARN-9011) Race condition during decommissioning

2019-09-12 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9011: --- Attachment: YARN-9011-001.patch > Race condition during decommissioning >

[jira] [Updated] (YARN-9011) Race condition during decommissioning

2019-09-12 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9011: --- Component/s: nodemanager > Race condition during decommissioning >

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-12 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928826#comment-16928826 ] Peter Bacsko commented on YARN-9011: Ignore patch v1, it's not enough. The proper solution is more

[jira] [Updated] (YARN-9011) Race condition during decommissioning

2019-09-13 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9011: --- Attachment: YARN-9011-002.patch > Race condition during decommissioning >

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-13 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16929234#comment-16929234 ] Peter Bacsko commented on YARN-9011: Uploaded v2, it's still considered to be a POC. > Race condition

[jira] [Updated] (YARN-9011) Race condition during decommissioning

2019-09-16 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9011: --- Attachment: YARN-9011-004.patch > Race condition during decommissioning >

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-18 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16932170#comment-16932170 ] Peter Bacsko commented on YARN-9011: [~adam.antal] absolutely, this patch needs some explanation - it

[jira] [Updated] (YARN-9011) Race condition during decommissioning

2019-09-16 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9011: --- Attachment: YARN-9011-003.patch > Race condition during decommissioning >

[jira] [Updated] (YARN-9011) Race condition during decommissioning

2019-09-16 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9011: --- Attachment: (was: YARN-9011-003.patch) > Race condition during decommissioning >

[jira] [Updated] (YARN-9011) Race condition during decommissioning

2019-09-16 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9011: --- Attachment: YARN-9011-003.patch > Race condition during decommissioning >

[jira] [Updated] (YARN-9833) Race condition when DirectoryCollection.checkDirs() runs during container launch

2019-09-16 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9833: --- Attachment: YARN-9833-001.patch > Race condition when DirectoryCollection.checkDirs() runs during

[jira] [Commented] (YARN-6715) Fix documentation about NodeHealthScriptRunner

2019-09-12 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-6715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928981#comment-16928981 ] Peter Bacsko commented on YARN-6715: ping [~szegedim] > Fix documentation about

[jira] [Created] (YARN-9833) Race condition when DirectoryCollection.checkDirs() runs during container launch

2019-09-12 Thread Peter Bacsko (Jira)
Peter Bacsko created YARN-9833: -- Summary: Race condition when DirectoryCollection.checkDirs() runs during container launch Key: YARN-9833 URL: https://issues.apache.org/jira/browse/YARN-9833 Project:

[jira] [Created] (YARN-9841) Capacity scheduler: add support for combined %user + %primary_group mapping

2019-09-19 Thread Peter Bacsko (Jira)
Peter Bacsko created YARN-9841: -- Summary: Capacity scheduler: add support for combined %user + %primary_group mapping Key: YARN-9841 URL: https://issues.apache.org/jira/browse/YARN-9841 Project: Hadoop

[jira] [Updated] (YARN-9840) Capacity scheduler: add support for Secondary Group user mapping

2019-09-19 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9840: --- Component/s: capacity scheduler > Capacity scheduler: add support for Secondary Group user mapping >

[jira] [Created] (YARN-9840) Capacity scheduler: add support for Secondary Group user mapping

2019-09-19 Thread Peter Bacsko (Jira)
Peter Bacsko created YARN-9840: -- Summary: Capacity scheduler: add support for Secondary Group user mapping Key: YARN-9840 URL: https://issues.apache.org/jira/browse/YARN-9840 Project: Hadoop YARN

[jira] [Updated] (YARN-9840) Capacity scheduler: add support for Secondary Group rule mapping

2019-09-19 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9840: --- Summary: Capacity scheduler: add support for Secondary Group rule mapping (was: Capacity scheduler:

[jira] [Updated] (YARN-9134) No test coverage for redefining FPGA / GPU resource types in TestResourceUtils

2019-08-07 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9134: --- Attachment: YARN-9134.003.patch > No test coverage for redefining FPGA / GPU resource types in

[jira] [Commented] (YARN-9134) No test coverage for redefining FPGA / GPU resource types in TestResourceUtils

2019-08-07 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16902007#comment-16902007 ] Peter Bacsko commented on YARN-9134: Rebased the patch. > No test coverage for redefining FPGA / GPU

[jira] [Commented] (YARN-9217) Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing

2019-08-07 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16901986#comment-16901986 ] Peter Bacsko commented on YARN-9217: This patch needed yet another rebase due to renames. >

[jira] [Updated] (YARN-9217) Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing

2019-08-07 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9217: --- Attachment: YARN-9217.006.patch > Nodemanager will fail to start if GPU is misconfigured on the node

[jira] [Commented] (YARN-9667) Container-executor.c duplicates messages to stdout

2019-08-05 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16900236#comment-16900236 ] Peter Bacsko commented on YARN-9667: Ah sorry it wasn't clear. I kept those only because they occur

[jira] [Commented] (YARN-9667) Container-executor.c duplicates messages to stdout

2019-08-05 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16900226#comment-16900226 ] Peter Bacsko commented on YARN-9667: Changes in the patch: # Removed unnecessary \{{fflush()}} calls

[jira] [Commented] (YARN-9667) Container-executor.c duplicates messages to stdout

2019-08-05 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16900227#comment-16900227 ] Peter Bacsko commented on YARN-9667: [~eyang] [~snemeth] pls check this out when you have some time.

[jira] [Updated] (YARN-9667) Container-executor.c duplicates messages to stdout

2019-08-05 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9667: --- Attachment: YARN-9667-001.patch > Container-executor.c duplicates messages to stdout >

[jira] [Updated] (YARN-9217) Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing

2019-08-09 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9217: --- Attachment: YARN-9217.007.patch > Nodemanager will fail to start if GPU is misconfigured on the node

[jira] [Commented] (YARN-9134) No test coverage for redefining FPGA / GPU resource types in TestResourceUtils

2019-08-09 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16903741#comment-16903741 ] Peter Bacsko commented on YARN-9134: Uploaded patch v4. _"Also, setupResourceTypes has been

[jira] [Updated] (YARN-9134) No test coverage for redefining FPGA / GPU resource types in TestResourceUtils

2019-08-09 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9134: --- Attachment: YARN-9134.004.patch > No test coverage for redefining FPGA / GPU resource types in

[jira] [Updated] (YARN-9217) Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing

2019-08-09 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9217: --- Attachment: YARN-9217.008.patch > Nodemanager will fail to start if GPU is misconfigured on the node

[jira] [Updated] (YARN-8586) Extract log aggregation related fields and methods from RMAppImpl

2019-08-09 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-8586: --- Attachment: YARN-8586.003.patch > Extract log aggregation related fields and methods from RMAppImpl >

[jira] [Commented] (YARN-9133) Make tests more easy to comprehend in TestGpuResourceHandler

2019-08-09 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16903805#comment-16903805 ] Peter Bacsko commented on YARN-9133: [~snemeth] similarly to YARN-9140, conflict resolution might not

[jira] [Commented] (YARN-9140) Code cleanup in ResourcePluginManager.initialize and in TestResourcePluginManager

2019-08-09 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16903802#comment-16903802 ] Peter Bacsko commented on YARN-9140: [~snemeth] there are already 4 commits difference between trunk

[jira] [Commented] (YARN-9217) Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing

2019-07-20 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16889508#comment-16889508 ] Peter Bacsko commented on YARN-9217: [~snemeth] I rebased the patch but the amount of conflicts forced

[jira] [Updated] (YARN-9217) Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing

2019-07-20 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9217: --- Attachment: YARN-9217.005.patch > Nodemanager will fail to start if GPU is misconfigured on the node

[jira] [Commented] (YARN-9667) Container-executor.c duplicates messages to stdout

2019-07-19 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16889002#comment-16889002 ] Peter Bacsko commented on YARN-9667: [~eyang] sure, I'm on vacation right now, but I'll upload a patch

[jira] [Assigned] (YARN-9667) Container-executor.c duplicates messages to stdout

2019-07-19 Thread Peter Bacsko (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko reassigned YARN-9667: -- Assignee: Peter Bacsko > Container-executor.c duplicates messages to stdout >

[jira] [Commented] (YARN-9840) Capacity scheduler: add support for Secondary Group rule mapping

2019-09-20 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16934601#comment-16934601 ] Peter Bacsko commented on YARN-9840: [~maniraj...@gmail.com] if you already have a patch, feel free to

[jira] [Commented] (YARN-9699) Migration tool that help to generate CS configs based on FS

2019-09-21 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16934948#comment-16934948 ] Peter Bacsko commented on YARN-9699: [~jiwq] [~Prabhu Joseph] [~shuzirra] [~sunilg] I uploaded a POC

[jira] [Updated] (YARN-9699) Migration tool that help to generate CS configs based on FS

2019-09-21 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9699: --- Attachment: FS_to_CS_migration_POC.patch > Migration tool that help to generate CS configs based on

[jira] [Updated] (YARN-9552) FairScheduler: NODE_UPDATE can cause NoSuchElementException

2019-09-21 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9552: --- Attachment: YARN-9552-branch-3.2.003.patch > FairScheduler: NODE_UPDATE can cause

[jira] [Commented] (YARN-9552) FairScheduler: NODE_UPDATE can cause NoSuchElementException

2019-09-19 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16934080#comment-16934080 ] Peter Bacsko commented on YARN-9552: [~Steven Rand] it shouldn't be a big deal to create patches that

[jira] [Comment Edited] (YARN-9699) Migration tool that help to generate CS configs based on FS

2019-09-26 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16938306#comment-16938306 ] Peter Bacsko edited comment on YARN-9699 at 9/26/19 6:08 AM: - Had a discussion

[jira] [Comment Edited] (YARN-9699) Migration tool that help to generate CS configs based on FS

2019-09-26 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16938306#comment-16938306 ] Peter Bacsko edited comment on YARN-9699 at 9/26/19 6:24 AM: - Had a discussion

[jira] [Comment Edited] (YARN-9699) Migration tool that help to generate CS configs based on FS

2019-09-26 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16938306#comment-16938306 ] Peter Bacsko edited comment on YARN-9699 at 9/26/19 6:25 AM: - Had a discussion

[jira] [Commented] (YARN-9552) FairScheduler: NODE_UPDATE can cause NoSuchElementException

2019-09-25 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16938279#comment-16938279 ] Peter Bacsko commented on YARN-9552: [~snemeth] you can now backport this patch to branch-3.1 and

[jira] [Commented] (YARN-9841) Capacity scheduler: add support for combined %user + %primary_group mapping

2019-09-25 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16938277#comment-16938277 ] Peter Bacsko commented on YARN-9841: Jenkins picked up the junit patch, re-uploading patch 001 again.

[jira] [Updated] (YARN-9841) Capacity scheduler: add support for combined %user + %primary_group mapping

2019-09-25 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9841: --- Attachment: YARN-9841.001.patch > Capacity scheduler: add support for combined %user + %primary_group

[jira] [Commented] (YARN-6715) Fix documentation about NodeHealthScriptRunner

2019-09-25 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-6715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16938280#comment-16938280 ] Peter Bacsko commented on YARN-6715: [~snemeth] patches are ready to be committed to branch-3.1 and

[jira] [Comment Edited] (YARN-9699) Migration tool that help to generate CS configs based on FS

2019-09-26 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16938306#comment-16938306 ] Peter Bacsko edited comment on YARN-9699 at 9/26/19 6:14 AM: - Had a discussion

[jira] [Commented] (YARN-9841) Capacity scheduler: add support for combined %user + %primary_group mapping

2019-09-25 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16938281#comment-16938281 ] Peter Bacsko commented on YARN-9841: [~maniraj...@gmail.com] just a thought. If we have this for

[jira] [Commented] (YARN-9841) Capacity scheduler: add support for combined %user + %primary_group mapping

2019-09-25 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16938291#comment-16938291 ] Peter Bacsko commented on YARN-9841: Just some really minor comments: {noformat} 70 if

[jira] [Commented] (YARN-9699) Migration tool that help to generate CS configs based on FS

2019-09-26 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16938306#comment-16938306 ] Peter Bacsko commented on YARN-9699: Had a discussion with [~sunilg], [~Prabhu Joseph], [~snemeth]. A

[jira] [Commented] (YARN-9841) Capacity scheduler: add support for combined %user + %primary_group mapping

2019-09-26 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16938988#comment-16938988 ] Peter Bacsko commented on YARN-9841: [~maniraj...@gmail.com] no problem, I'm fine with a separate

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-24 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16936576#comment-16936576 ] Peter Bacsko commented on YARN-9011: [~tangzhankun] yes, that's correct. The problem is that

[jira] [Comment Edited] (YARN-9011) Race condition during decommissioning

2019-09-24 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16936576#comment-16936576 ] Peter Bacsko edited comment on YARN-9011 at 9/24/19 9:06 AM: - [~tangzhankun]

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-24 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16936651#comment-16936651 ] Peter Bacsko commented on YARN-9011: Thanks for the comments [~adam.antal] Regarding the property,

[jira] [Comment Edited] (YARN-9011) Race condition during decommissioning

2019-09-23 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16935731#comment-16935731 ] Peter Bacsko edited comment on YARN-9011 at 9/23/19 10:38 AM: -- [~adam.antal]

[jira] [Comment Edited] (YARN-9011) Race condition during decommissioning

2019-09-23 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16935731#comment-16935731 ] Peter Bacsko edited comment on YARN-9011 at 9/23/19 10:38 AM: -- [~adam.antal]

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-23 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16935731#comment-16935731 ] Peter Bacsko commented on YARN-9011: [~adam.antal] so the problem is that {{ResourceTrackerService}}

[jira] [Commented] (YARN-9840) Capacity scheduler: add support for Secondary Group rule mapping

2019-09-23 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16935757#comment-16935757 ] Peter Bacsko commented on YARN-9840: Thanks for the patch [~maniraj...@gmail.com]. I triggered a

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-24 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16936867#comment-16936867 ] Peter Bacsko commented on YARN-9011: " In large clusters we cant expect that to be mills." - if that's

[jira] [Updated] (YARN-9011) Race condition during decommissioning

2019-09-24 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko updated YARN-9011: --- Attachment: YARN-9011-005.patch > Race condition during decommissioning >

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-24 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16936729#comment-16936729 ] Peter Bacsko commented on YARN-9011: [~bibinchundatt] thanks for the insights. There's no impact on

[jira] [Commented] (YARN-9699) Migration tool that help to generate CS configs based on FS

2019-09-24 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16936691#comment-16936691 ] Peter Bacsko commented on YARN-9699: Thanks [~Prabhu Joseph] for the comments. 1. This is completely

[jira] [Reopened] (YARN-9552) FairScheduler: NODE_UPDATE can cause NoSuchElementException

2019-09-20 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko reopened YARN-9552: > FairScheduler: NODE_UPDATE can cause NoSuchElementException >

<    1   2   3   4   5   6   7   8   9   10   >