[jira] [Comment Edited] (YARN-10694) Fix spotbugs warning in CapacitySchedulerConfiguration.java

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302174#comment-17302174 ] Qi Zhu edited comment on YARN-10694 at 3/16/21, 3:19 AM: - Thanks [~aajisaka] for

[jira] [Comment Edited] (YARN-10503) Support queue capacity in terms of absolute resources with gpu resourceType.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17300121#comment-17300121 ] Qi Zhu edited comment on YARN-10503 at 3/16/21, 3:06 AM: - Updated a patch for

[jira] [Updated] (YARN-10690) GPU related improvement for better usage.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10690: -- Description: This Jira will improve GPU for better usage.   > GPU related improvement for better usage. >

[jira] [Updated] (YARN-10695) Event related improvement of YARN for better usage.

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10695: -- Description: This jira, marked the event related improvement in yarn for better usage.   > Event related

[jira] [Updated] (YARN-9927) RM multi-thread event processing mechanism

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-9927: - Parent: YARN-10695 Issue Type: Sub-task (was: Improvement) > RM multi-thread event processing mechanism

[jira] [Updated] (YARN-10695) Event related improvement of YARN for better usage.

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10695: -- Description: This jira, marked the event related improvement in yarn for better usage.  cc  was: This

[jira] [Updated] (YARN-10695) Event related improvement of YARN for better usage.

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10695: -- Description: This jira, marked the event related improvement in yarn for better usage.  cc [~bibinchundatt] 

[jira] [Updated] (YARN-10695) Event related improvement of YARN for better usage.

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10695: -- Description: This jira, marked the event related improvement in yarn for better usage.  cc [~bibinchundatt] 

[jira] [Updated] (YARN-9615) Add dispatcher metrics to RM

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-9615: - Parent: YARN-10695 Issue Type: Sub-task (was: Task) > Add dispatcher metrics to RM >

[jira] [Updated] (YARN-8995) Log events info in AsyncDispatcher when event queue size cumulatively reaches a certain number every time.

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-8995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-8995: - Parent: YARN-10695 Issue Type: Sub-task (was: Improvement) > Log events info in AsyncDispatcher when

[jira] [Created] (YARN-10696) Add RMNodeEvent to single async dispatcher before YARN-9927.

2021-03-16 Thread Qi Zhu (Jira)
Qi Zhu created YARN-10696: - Summary: Add RMNodeEvent to single async dispatcher before YARN-9927. Key: YARN-10696 URL: https://issues.apache.org/jira/browse/YARN-10696 Project: Hadoop YARN Issue

[jira] [Updated] (YARN-9618) NodeListManager event improvement

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-9618: - Parent Issue: YARN-10695 (was: YARN-9871) > NodeListManager event improvement >

[jira] [Updated] (YARN-10690) GPU related improvement for better usage.

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10690: -- Description: This Jira will improve GPU for better usage.  cc [~bibinchundatt] [~pbacsko] [~ebadger] [~ztang] 

[jira] [Created] (YARN-10695) Event related improvement of YARN for better usage.

2021-03-16 Thread Qi Zhu (Jira)
Qi Zhu created YARN-10695: - Summary: Event related improvement of YARN for better usage. Key: YARN-10695 URL: https://issues.apache.org/jira/browse/YARN-10695 Project: Hadoop YARN Issue Type:

[jira] [Updated] (YARN-9618) NodeListManager event improvement

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-9618: - Attachment: YARN-9618.002.patch > NodeListManager event improvement > - > >

[jira] [Commented] (YARN-9618) NodeListManager event improvement

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302371#comment-17302371 ] Qi Zhu commented on YARN-9618: -- [~bibinchundatt] [~pbacsko] [~ebadger] [~epayne] [~gandras]  [~bteke] Could

[jira] [Updated] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-14 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10674: -- Attachment: YARN-10674.009.patch > fs2cs: should support auto created queue deletion. >

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-14 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17301094#comment-17301094 ] Qi Zhu commented on YARN-10674: --- Fixed checkstyle and findbugs.:D > fs2cs: should support auto created

[jira] [Created] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-15 Thread Qi Zhu (Jira)
Qi Zhu created YARN-10692: - Summary: Add Node GPU Utilization and apply to NodeMetrics. Key: YARN-10692 URL: https://issues.apache.org/jira/browse/YARN-10692 Project: Hadoop YARN Issue Type:

[jira] [Updated] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10692: -- Attachment: (was: YARN-10692.001.patch) > Add Node GPU Utilization and apply to NodeMetrics. >

[jira] [Updated] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10692: -- Attachment: YARN-10692.001.patch > Add Node GPU Utilization and apply to NodeMetrics. >

[jira] [Commented] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17301452#comment-17301452 ] Qi Zhu commented on YARN-10692: --- [~pbacsko]  [~Jim_Brennan]  [~ebadger]  [~gandras]   Could you help

[jira] [Created] (YARN-10693) Add document for YARN-10623 auto refresh queue conf in cs.

2021-03-15 Thread Qi Zhu (Jira)
Qi Zhu created YARN-10693: - Summary: Add document for YARN-10623 auto refresh queue conf in cs. Key: YARN-10693 URL: https://issues.apache.org/jira/browse/YARN-10693 Project: Hadoop YARN Issue Type:

[jira] [Updated] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10692: -- Description: Now there are no node level GPU Utilization, this issue will add it, and add it to NodeMetrics

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-14 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17301340#comment-17301340 ] Qi Zhu commented on YARN-10674: --- [~pbacsko] [~gandras] Finding bugs will be fixed in YARN-10689. :D

[jira] [Commented] (YARN-9618) NodeListManager event improvement

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302517#comment-17302517 ] Qi Zhu commented on YARN-9618: -- Fixed test and checkstyle in latest patch. :D > NodeListManager event

[jira] [Updated] (YARN-9618) NodeListManager event improvement

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-9618: - Attachment: YARN-9618.003.patch > NodeListManager event improvement > - > >

[jira] [Updated] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10674: -- Attachment: YARN-10674.011.patch > fs2cs: should support auto created queue deletion. >

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302595#comment-17302595 ] Qi Zhu commented on YARN-10674: --- Thanks [~pbacsko] for valid suggestion. Updated this in latest patch.:D

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-18 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17304122#comment-17304122 ] Qi Zhu commented on YARN-10674: --- Thanks a lot [~pbacsko] for patient review. Very good suggestion, it make

[jira] [Commented] (YARN-10701) The yarn.resource-types should support multi types without trimmed.

2021-03-18 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17304124#comment-17304124 ] Qi Zhu commented on YARN-10701: --- Thanks [~gandras] for your confirm. [~pbacsko] Could you help review

[jira] [Updated] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-18 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10674: -- Attachment: YARN-10674.015.patch > fs2cs: should support auto created queue deletion. >

[jira] [Comment Edited] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-18 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17304142#comment-17304142 ] Qi Zhu edited comment on YARN-10674 at 3/18/21, 1:24 PM: - Thanks [~gandras] for

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-18 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17304142#comment-17304142 ] Qi Zhu commented on YARN-10674: --- Thanks [~gandras] for reply. If we don't have  PreemptionMode.ENABLED, we

[jira] [Created] (YARN-10703) Fix potential null pointer error of gpuNodeResourceUpdateHandler in NodeResourceMonitorImpl.

2021-03-18 Thread Qi Zhu (Jira)
Qi Zhu created YARN-10703: - Summary: Fix potential null pointer error of gpuNodeResourceUpdateHandler in NodeResourceMonitorImpl. Key: YARN-10703 URL: https://issues.apache.org/jira/browse/YARN-10703

[jira] [Comment Edited] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-18 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17304142#comment-17304142 ] Qi Zhu edited comment on YARN-10674 at 3/18/21, 1:29 PM: - Thanks [~gandras] for

[jira] [Updated] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-18 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10674: -- Attachment: YARN-10674.016.patch > fs2cs: should support auto created queue deletion. >

[jira] [Comment Edited] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-18 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17304142#comment-17304142 ] Qi Zhu edited comment on YARN-10674 at 3/18/21, 1:26 PM: - Thanks [~gandras] for

[jira] [Commented] (YARN-10703) Fix potential null pointer error of gpuNodeResourceUpdateHandler in NodeResourceMonitorImpl.

2021-03-18 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17304214#comment-17304214 ] Qi Zhu commented on YARN-10703: --- [~pbacsko] [~gandras] [~ebadger]  Sorry for the potential null pointer

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-18 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17304228#comment-17304228 ] Qi Zhu commented on YARN-10674: --- [~gandras] Now i understand you, we can just use the code: {code:java}

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303024#comment-17303024 ] Qi Zhu commented on YARN-10674: --- [~pbacsko] Fixed the checkstyle in latest patch.:D > fs2cs: should

[jira] [Commented] (YARN-10616) Nodemanagers cannot detect GPU failures

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303044#comment-17303044 ] Qi Zhu commented on YARN-10616: --- [~ebadger] [~ztang] Actually we can use the graceful decommission way to

[jira] [Updated] (YARN-9618) NodeListManager event improvement

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-9618: - Attachment: YARN-9618.004.patch > NodeListManager event improvement > - > >

[jira] [Updated] (YARN-10642) Race condition: AsyncDispatcher can get stuck by the changes introduced in YARN-8995

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10642: -- Parent: YARN-10695 Issue Type: Sub-task (was: Bug) > Race condition: AsyncDispatcher can get stuck by

[jira] [Commented] (YARN-10688) ClusterMetrics should support GPU capacity related metrics.

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303039#comment-17303039 ] Qi Zhu commented on YARN-10688: --- Thanks [~ebadger] for confirm. I also think it is more reasonable to

[jira] [Updated] (YARN-10688) ClusterMetrics should support GPU capacity related metrics.

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10688: -- Attachment: YARN-10688.004.patch > ClusterMetrics should support GPU capacity related metrics. >

[jira] [Updated] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-16 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10674: -- Attachment: YARN-10674.012.patch > fs2cs: should support auto created queue deletion. >

[jira] [Updated] (YARN-9618) NodeListManager event improvement

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-9618: - Attachment: YARN-9618.005.patch > NodeListManager event improvement > - > >

[jira] [Commented] (YARN-10685) Fixed some Typo in AbstractCSQueue.

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303130#comment-17303130 ] Qi Zhu commented on YARN-10685: --- [~gandras] [~pbacsko] Could you help review this? Thanks. > Fixed some

[jira] [Comment Edited] (YARN-10497) Fix an issue in CapacityScheduler which fails to delete queues

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298059#comment-17298059 ] Qi Zhu edited comment on YARN-10497 at 3/17/21, 6:12 AM: - [~gandras] [~shuzirra] 

[jira] [Commented] (YARN-10700) Yarn can't submit application, resourcemanager get stuck but not dead

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303143#comment-17303143 ] Qi Zhu commented on YARN-10700: --- Thanks [~leix2020] for report. Actually it is the jdk bug, and we have

[jira] [Comment Edited] (YARN-10641) Refactor the max app related update, and fix maxApllications update error when add new queues.

2021-03-18 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17297081#comment-17297081 ] Qi Zhu edited comment on YARN-10641 at 3/18/21, 9:57 AM: - [~pbacsko] [~gandras]

[jira] [Comment Edited] (YARN-10641) Refactor the max app related update, and fix maxApllications update error when add new queues.

2021-03-18 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17297081#comment-17297081 ] Qi Zhu edited comment on YARN-10641 at 3/18/21, 9:57 AM: - [~pbacsko] [~gandras]

[jira] [Commented] (YARN-9618) NodeListManager event improvement

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303124#comment-17303124 ] Qi Zhu commented on YARN-9618: -- [~gandras] [~ebadger] Added the EventDispatcher in created logic, to make

[jira] [Updated] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10674: -- Attachment: YARN-10674.013.patch > fs2cs: should support auto created queue deletion. >

[jira] [Updated] (YARN-10701) The yarn.resource-types should support multi types without trimmed.

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10701: -- Attachment: YARN-10701.002.patch > The yarn.resource-types should support multi types without trimmed. >

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303448#comment-17303448 ] Qi Zhu commented on YARN-10674: --- Thanks a lot [~gandras] for patient review. [~pbacsko] I have updated

[jira] [Updated] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10692: -- Attachment: YARN-10692.002.patch > Add Node GPU Utilization and apply to NodeMetrics. >

[jira] [Commented] (YARN-10701) The yarn.resource-types should support multi types without trimmed.

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303476#comment-17303476 ] Qi Zhu commented on YARN-10701: --- [~pbacsko] Fixed checkstyle in latest patch.   > The

[jira] [Commented] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303511#comment-17303511 ] Qi Zhu commented on YARN-10692: --- [~ebadger] [~gandras] [~pbacsko] Updated this in latest patch. Thanks.

[jira] [Comment Edited] (YARN-9618) NodeListManager event improvement

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303124#comment-17303124 ] Qi Zhu edited comment on YARN-9618 at 3/17/21, 9:20 AM: [~gandras] [~ebadger] 

[jira] [Updated] (YARN-10701) The yarn.resource-types should support multi types without trimmed.

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10701: -- Description: {code:java} yarn.resource-types yarn.io/gpu, yarn.io/fpga {code} When i configured the

[jira] [Commented] (YARN-10503) Support queue capacity in terms of absolute resources with gpu resourceType.

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303206#comment-17303206 ] Qi Zhu commented on YARN-10503: --- Thanks [~gandras] [~ebadger] for review: Actually, i just want to support

[jira] [Commented] (YARN-10497) Fix an issue in CapacityScheduler which fails to delete queues

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303208#comment-17303208 ] Qi Zhu commented on YARN-10497: --- Thanks [~gandras] for review. I removed the mocking logic in

[jira] [Commented] (YARN-10701) The yarn.resource-types should support multi types without trimmed.

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303233#comment-17303233 ] Qi Zhu commented on YARN-10701: --- cc [~pbacsko] [~ebadger] [~epayne] [~gandras]  [~bteke] When i tested my

[jira] [Commented] (YARN-9618) NodeListManager event improvement

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303278#comment-17303278 ] Qi Zhu commented on YARN-9618: -- The test is not related. > NodeListManager event improvement >

[jira] [Comment Edited] (YARN-9618) NodeListManager event improvement

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303278#comment-17303278 ] Qi Zhu edited comment on YARN-9618 at 3/17/21, 10:25 AM: - The test error is not

[jira] [Updated] (YARN-10638) Add fair call queue support to event processing queue.

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10638: -- Parent: YARN-10695 Issue Type: Sub-task (was: New Feature) > Add fair call queue support to event

[jira] [Commented] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303204#comment-17303204 ] Qi Zhu commented on YARN-10692: --- Thanks [~ebadger] [~gandras] for review * If gpuList size is zero, you

[jira] [Created] (YARN-10701) The yarn.resource-types should support multi types without trimmed.

2021-03-17 Thread Qi Zhu (Jira)
Qi Zhu created YARN-10701: - Summary: The yarn.resource-types should support multi types without trimmed. Key: YARN-10701 URL: https://issues.apache.org/jira/browse/YARN-10701 Project: Hadoop YARN

[jira] [Commented] (YARN-10497) Fix an issue in CapacityScheduler which fails to delete queues

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303247#comment-17303247 ] Qi Zhu commented on YARN-10497: --- Thanks  [~pbacsko]  for confirm.  > Fix an issue in CapacityScheduler

[jira] [Updated] (YARN-10497) Fix an issue in CapacityScheduler which fails to delete queues

2021-03-17 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10497: -- Attachment: YARN-10497.006.patch > Fix an issue in CapacityScheduler which fails to delete queues >

[jira] [Commented] (YARN-10616) Nodemanagers cannot detect GPU failures

2021-03-18 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17304596#comment-17304596 ] Qi Zhu commented on YARN-10616: --- Thanks [~ebadger] for clarify. It make sense to me now. If we can realize

[jira] [Created] (YARN-10704) The CS effective capacity for absolute mode in UI should support GPU.

2021-03-18 Thread Qi Zhu (Jira)
Qi Zhu created YARN-10704: - Summary: The CS effective capacity for absolute mode in UI should support GPU. Key: YARN-10704 URL: https://issues.apache.org/jira/browse/YARN-10704 Project: Hadoop YARN

[jira] [Comment Edited] (YARN-10704) The CS effective capacity for absolute mode in UI should support GPU.

2021-03-19 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17304701#comment-17304701 ] Qi Zhu edited comment on YARN-10704 at 3/19/21, 8:00 AM: - cc [~pbacsko] 

[jira] [Commented] (YARN-10704) The CS effective capacity for absolute mode in UI should support GPU.

2021-03-19 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17304701#comment-17304701 ] Qi Zhu commented on YARN-10704: --- cc [~pbacsko]  [~gandras] [~ebadger]   Could you help review this, i

[jira] [Updated] (YARN-10497) Fix an issue in CapacityScheduler which fails to delete queues

2021-03-09 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10497: -- Attachment: YARN-10497.005.patch > Fix an issue in CapacityScheduler which fails to delete queues >

[jira] [Commented] (YARN-10497) Fix an issue in CapacityScheduler which fails to delete queues

2021-03-09 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298059#comment-17298059 ] Qi Zhu commented on YARN-10497: --- [~shuzirra] [~pbacsko] Updated a patch to use getTrimmedStringCollection

[jira] [Resolved] (YARN-10650) Create dispatcher metrics interface, and apply to RM async dispatcher.

2021-03-10 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu resolved YARN-10650. --- Resolution: Duplicate > Create dispatcher metrics interface, and apply to RM async dispatcher. >

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-10 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298802#comment-17298802 ] Qi Zhu commented on YARN-10674: --- Thanks [~pbacsko] reply. The failed is not related to this.  I am glad

[jira] [Commented] (YARN-10685) Fixed some Typo in AbstractCSQueue.

2021-03-10 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298872#comment-17298872 ] Qi Zhu commented on YARN-10685: --- cc [~pbacsko] Just find some Typo in AbstractCSQueue. Could you help

[jira] [Updated] (YARN-10685) Fixed some Typo in AbstractCSQueue.

2021-03-10 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10685: -- Attachment: YARN-10685.001.patch > Fixed some Typo in AbstractCSQueue. >

[jira] [Commented] (YARN-10571) Refactor dynamic queue handling logic

2021-03-10 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298867#comment-17298867 ] Qi Zhu commented on YARN-10571: --- Thanks [~gandras] for the patch. LGTM,  just fix the Javadoc and the

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-10 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298910#comment-17298910 ] Qi Zhu commented on YARN-10674: --- [~pbacsko] Fixed in latest patch, waiting for Jenkins. > fs2cs: should

[jira] [Updated] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-10 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10674: -- Attachment: YARN-10674.004.patch > fs2cs: should support auto created queue deletion. >

[jira] [Created] (YARN-10685) Fixed some Typo in AbstractCSQueue.

2021-03-10 Thread Qi Zhu (Jira)
Qi Zhu created YARN-10685: - Summary: Fixed some Typo in AbstractCSQueue. Key: YARN-10685 URL: https://issues.apache.org/jira/browse/YARN-10685 Project: Hadoop YARN Issue Type: Bug

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-10 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298878#comment-17298878 ] Qi Zhu commented on YARN-10674: --- Thanks [~pbacsko] for very valid suggestion. It make sense to me, and

[jira] [Commented] (YARN-9615) Add dispatcher metrics to RM

2021-03-09 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298072#comment-17298072 ] Qi Zhu commented on YARN-9615: -- Thanks [~pbacsko] for commit and review. > Add dispatcher metrics to RM >

[jira] [Updated] (YARN-10682) The scheduler monitor policies conf should support trim between ",".

2021-03-09 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10682: -- Description: When i configured scheduler monitor policies with space, the RM will start with error. The conf

[jira] [Updated] (YARN-10682) The scheduler monitor policies conf should support trim between ",".

2021-03-09 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10682: -- Description: When i configured scheduler monitor policies with space, the RM will start with error. The conf

[jira] [Comment Edited] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-09 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298076#comment-17298076 ] Qi Zhu edited comment on YARN-10674 at 3/9/21, 2:10 PM: Thanks [~pbacsko] for

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-09 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298076#comment-17298076 ] Qi Zhu commented on YARN-10674: --- Thanks [~pbacsko] for review. It not depends on YARN-10682 , "," already

[jira] [Commented] (YARN-8823) Monitor the healthy state of GPU

2021-03-09 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-8823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298127#comment-17298127 ] Qi Zhu commented on YARN-8823: -- [~adam.antal] [~tangzhankun] Is this going on? "I was wondering if this

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-09 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298120#comment-17298120 ] Qi Zhu commented on YARN-10674: --- Thanks [~pbacsko] for patient review, your suggestion is valid. I have

[jira] [Comment Edited] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-09 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298076#comment-17298076 ] Qi Zhu edited comment on YARN-10674 at 3/9/21, 1:54 PM: Thanks [~pbacsko] for

[jira] [Commented] (YARN-10685) Fixed some Typo in AbstractCSQueue.

2021-03-10 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17298923#comment-17298923 ] Qi Zhu commented on YARN-10685: --- Submitted to trigger jenkins. > Fixed some Typo in AbstractCSQueue. >

[jira] [Updated] (YARN-10685) Fixed some Typo in AbstractCSQueue.

2021-03-10 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10685: -- Attachment: (was: YARN-10685.001.patch) > Fixed some Typo in AbstractCSQueue. >

[jira] [Commented] (YARN-10685) Fixed some Typo in AbstractCSQueue.

2021-03-10 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17299242#comment-17299242 ] Qi Zhu commented on YARN-10685: --- [~pbacsko] fixed the checkstyle in latest patch. > Fixed some Typo in

[jira] [Updated] (YARN-10685) Fixed some Typo in AbstractCSQueue.

2021-03-10 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10685: -- Attachment: YARN-10685.002.patch > Fixed some Typo in AbstractCSQueue. >

[jira] [Commented] (YARN-10674) fs2cs: should support auto created queue deletion.

2021-03-10 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17299245#comment-17299245 ] Qi Zhu commented on YARN-10674: --- [~pbacsko] [~gandras] Fixed the checkstyle in latest patch.:D > fs2cs:

<    1   2   3   4   5   6   7   8   9   10   >