[jira] [Commented] (YARN-10616) Nodemanagers cannot detect GPU failures

2021-03-15 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302164#comment-17302164 ] Zhankun Tang commented on YARN-10616: - [~ebadger], Thanks for picking this up. The YARN-8823 had this

[jira] [Resolved] (YARN-9650) Set thread names for CapacityScheduler AsyncScheduleThread

2021-02-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang resolved YARN-9650. Fix Version/s: 3.4.0 Resolution: Fixed > Set thread names for CapacityScheduler

[jira] [Commented] (YARN-9650) Set thread names for CapacityScheduler AsyncScheduleThread

2021-02-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17281495#comment-17281495 ] Zhankun Tang commented on YARN-9650: [~zhuqi], Thanks for the review. [~amoghdesai], thanks for the

[jira] [Commented] (YARN-10610) Add queuePath to restful api for CapacityScheduler consistent with FairScheduler queuePath.

2021-02-04 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279391#comment-17279391 ] Zhankun Tang commented on YARN-10610: - Thanks for the contribution [~Qi Zhu]. please fix the new

[jira] [Commented] (YARN-9650) Set thread names for CapacityScheduler AsyncScheduleThread

2021-02-04 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279304#comment-17279304 ] Zhankun Tang commented on YARN-9650: [~amoghdesai] Thanks for the contribution. It looks good to me.

[jira] [Comment Edited] (YARN-10589) Improve logic of multi-node allocation

2021-02-02 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17276973#comment-17276973 ] Zhankun Tang edited comment on YARN-10589 at 2/2/21, 10:02 AM: --- [~zhuqi],

[jira] [Commented] (YARN-10589) Improve logic of multi-node allocation

2021-02-02 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17276973#comment-17276973 ] Zhankun Tang commented on YARN-10589: - [~zhuqi], Thanks a lot for the review! [~tanu.ajmera], I'm not

[jira] [Commented] (YARN-10589) Improve logic of multi-node allocation

2021-02-01 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17276170#comment-17276170 ] Zhankun Tang commented on YARN-10589: - [~zhuqi], could you please review Tanu's patch too? > Improve

[jira] [Commented] (YARN-10352) Skip schedule on not heartbeated nodes in Multi Node Placement

2021-01-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17274196#comment-17274196 ] Zhankun Tang commented on YARN-10352: - Sorry for the late reply. Thanks for the contribution

[jira] [Commented] (YARN-10463) For Federation, we should support getApplicationAttemptReport.

2020-12-20 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17252569#comment-17252569 ] Zhankun Tang commented on YARN-10463: - [~BilwaST] Thanks for the review. [~zhuqi] Thanks for the

[jira] [Updated] (YARN-10463) For Federation, we should support getApplicationAttemptReport.

2020-12-20 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-10463: Fix Version/s: 3.4.0 > For Federation, we should support getApplicationAttemptReport. >

[jira] [Commented] (YARN-10463) For Federation, we should support getApplicationAttemptReport.

2020-12-17 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17251497#comment-17251497 ] Zhankun Tang commented on YARN-10463: - [~zhuqi], I triggered a new CI and it failed. I guess it needs

[jira] [Commented] (YARN-10463) For Federation, we should support getApplicationAttemptReport.

2020-12-10 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17247657#comment-17247657 ] Zhankun Tang commented on YARN-10463: - [~zhuqi], Thanks for the contribution. [~BilwaST], I can help

[jira] [Commented] (YARN-10380) Import logic of multi-node allocation in CapacityScheduler

2020-12-09 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17246473#comment-17246473 ] Zhankun Tang commented on YARN-10380: - [~jiwq] Thanks for the review! [~zhuqi] Thanks for the

[jira] [Commented] (YARN-10380) Import logic of multi-node allocation in CapacityScheduler

2020-12-01 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17241360#comment-17241360 ] Zhankun Tang commented on YARN-10380: - [~zhuqi], It should be no problem to merge it if you've tested

[jira] [Commented] (YARN-10380) Import logic of multi-node allocation in CapacityScheduler

2020-11-30 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17241243#comment-17241243 ] Zhankun Tang commented on YARN-10380: - [~zhuqi], Thanks a lot for the contributions! It looks good to

[jira] [Commented] (YARN-10333) YarnClient obtain Delegation Token for Log Aggregation Path

2020-07-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17153405#comment-17153405 ] Zhankun Tang commented on YARN-10333: - It LGTM. +1. Thanks for your contribution! [~prabhujoseph],

[jira] [Commented] (YARN-10307) /leveldb-timeline-store.ldb/LOCK not exist

2020-06-04 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17126357#comment-17126357 ] Zhankun Tang commented on YARN-10307: - [~appleyuchi], IIRC, I don't think the "Hive on Tez" depends

[jira] [Commented] (YARN-10302) Support custom packing algorithm for FairScheduler

2020-06-01 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17121484#comment-17121484 ] Zhankun Tang commented on YARN-10302: - [~billgraham], thanks for the contribution. Could you please

[jira] [Commented] (YARN-10248) when config allowed-gpu-devices , excluded GPUs still be visible to containers

2020-05-12 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17105933#comment-17105933 ] Zhankun Tang commented on YARN-10248: - [~jasstionzyf], do you mean the existing test case

[jira] [Commented] (YARN-10248) when config allowed-gpu-devices , excluded GPUs still be visible to containers

2020-04-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17095035#comment-17095035 ] Zhankun Tang commented on YARN-10248: - [~jasstionzyf], Thanks for the contribution! Hadoop GitHub

[jira] [Assigned] (YARN-10248) when config allowed-gpu-devices , excluded GPUs still be visible to containers

2020-04-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang reassigned YARN-10248: --- Assignee: zhao yufei > when config allowed-gpu-devices , excluded GPUs still be visible to

[jira] [Commented] (YARN-10225) Support of AMD ROCm GPUs in Yarn

2020-04-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17078340#comment-17078340 ] Zhankun Tang commented on YARN-10225: - Not sure if YARN-8851 can help here. You can try to write a

[jira] [Commented] (YARN-10200) Add number of containers to RMAppManager summary

2020-03-24 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17066316#comment-17066316 ] Zhankun Tang commented on YARN-10200: - [~jhung], Thanks for the update. Looks better now. +1. > Add

[jira] [Commented] (YARN-10200) Add number of containers to RMAppManager summary

2020-03-24 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17065399#comment-17065399 ] Zhankun Tang commented on YARN-10200: - [~jhung], Thanks for the patch. +1 from me. Just one minor

[jira] [Commented] (YARN-9605) Add ZkConfiguredFailoverProxyProvider for RM HA

2020-01-21 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17020046#comment-17020046 ] Zhankun Tang commented on YARN-9605: [~cane], let me trigger again. Yeah. It seems the cc WARNING is

[jira] [Resolved] (YARN-8851) [Umbrella] A pluggable device plugin framework to ease vendor plugin development

2020-01-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang resolved YARN-8851. Fix Version/s: 3.3.0 Resolution: Fixed > [Umbrella] A pluggable device plugin framework to

[jira] [Commented] (YARN-8851) [Umbrella] A pluggable device plugin framework to ease vendor plugin development

2020-01-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010470#comment-17010470 ] Zhankun Tang commented on YARN-8851: [~brahmareddy], thanks for planning the 3.3.0 release. Yeah. Let

[jira] [Commented] (YARN-10048) NodeManager fails to start after mounting CGroup

2019-12-19 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17000594#comment-17000594 ] Zhankun Tang commented on YARN-10048: - [~Sen Zhao], thanks for catching this. Let me understand this,

[jira] [Commented] (YARN-10042) Uupgrade grpc-xxx depdencies to 1.26.0

2019-12-19 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17000578#comment-17000578 ] Zhankun Tang commented on YARN-10042: - [~cheersyang], thanks for the review. Committed to trunk.

[jira] [Updated] (YARN-10042) Uupgrade grpc-xxx depdencies to 1.26.0

2019-12-19 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-10042: Fix Version/s: 3.3.0 > Uupgrade grpc-xxx depdencies to 1.26.0 >

[jira] [Commented] (YARN-10041) Should not use AbstractPath to create unix domain socket

2019-12-19 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17000569#comment-17000569 ] Zhankun Tang commented on YARN-10041: - [~bzhaoopenstack], [~liusheng], could you please upload patch

[jira] [Commented] (YARN-10042) Uupgrade grpc-xxx depdencies to 1.26.0

2019-12-19 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1658#comment-1658 ] Zhankun Tang commented on YARN-10042: - [~seanlau], Thanks for catching this. The patch looks good to

[jira] [Commented] (YARN-10041) Should not use AbstractPath to create unix domain socket

2019-12-18 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16998975#comment-16998975 ] Zhankun Tang commented on YARN-10041: - [~bzhaoopenstack], thanks for catching this. Would you like to

[jira] [Commented] (YARN-9605) Add ZkConfiguredFailoverProxyProvider for RM HA

2019-11-05 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16968014#comment-16968014 ] Zhankun Tang commented on YARN-9605: [~cane], I triggered a new build and let's see. > Add

[jira] [Comment Edited] (YARN-9011) Race condition during decommissioning

2019-10-29 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16961935#comment-16961935 ] Zhankun Tang edited comment on YARN-9011 at 10/29/19 12:15 PM: --- [~pbacsko],

[jira] [Comment Edited] (YARN-9011) Race condition during decommissioning

2019-10-29 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16961935#comment-16961935 ] Zhankun Tang edited comment on YARN-9011 at 10/29/19 12:14 PM: --- [~pbacsko],

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-10-29 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16961935#comment-16961935 ] Zhankun Tang commented on YARN-9011: [~pbacsko], Thanks for the explanation. After the offline sync

[jira] [Comment Edited] (YARN-9011) Race condition during decommissioning

2019-10-29 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16961621#comment-16961621 ] Zhankun Tang edited comment on YARN-9011 at 10/29/19 11:54 AM: --- [~pbacsko],

[jira] [Comment Edited] (YARN-9011) Race condition during decommissioning

2019-10-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16961621#comment-16961621 ] Zhankun Tang edited comment on YARN-9011 at 10/29/19 2:49 AM: -- [~pbacsko],

[jira] [Comment Edited] (YARN-9011) Race condition during decommissioning

2019-10-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16961621#comment-16961621 ] Zhankun Tang edited comment on YARN-9011 at 10/29/19 2:49 AM: -- [~pbacsko],

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-10-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16961621#comment-16961621 ] Zhankun Tang commented on YARN-9011: [~pbacsko], Thanks for the new patch. The idea looks good to me.

[jira] [Comment Edited] (YARN-9931) Support run script before kill container

2019-10-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16960938#comment-16960938 ] Zhankun Tang edited comment on YARN-9931 at 10/28/19 11:08 AM: --- [~cane],

[jira] [Commented] (YARN-9931) Support run script before kill container

2019-10-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16960938#comment-16960938 ] Zhankun Tang commented on YARN-9931: [~cane], do you have a sample patch? > Support run script before

[jira] [Commented] (YARN-9748) Allow capacity-scheduler configuration on HDFS and support reload from HDFS

2019-10-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16960934#comment-16960934 ] Zhankun Tang commented on YARN-9748: [~cane], could you please clarify your requirement?  > Allow

[jira] [Updated] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-23 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9921: --- Fix Version/s: 3.1.4 3.3.0 > Issue in PlacementConstraint when YARN Service AM

[jira] [Commented] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-23 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16958465#comment-16958465 ] Zhankun Tang commented on YARN-9921: [~prabhujoseph], Thanks for the review. [~tarunparimi], Thanks

[jira] [Commented] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-23 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957693#comment-16957693 ] Zhankun Tang commented on YARN-9921: [~Prabhu Joseph], [~sunilg], if no more comment. I'll commit it

[jira] [Commented] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-21 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16955769#comment-16955769 ] Zhankun Tang commented on YARN-9921: [~tarunparimi], Thanks for reproducing it and find the root

[jira] [Commented] (YARN-9861) The ResourceManager log reports an error "Too many open files", the analysis is related to the service

2019-09-27 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939320#comment-16939320 ] Zhankun Tang commented on YARN-9861: [~billie.rinaldi], if any chance, could you please take a look at

[jira] [Updated] (YARN-9861) The ResourceManager log reports an error "Too many open files", the analysis is related to the service

2019-09-27 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9861: --- Attachment: submarine_kerasgesv2date20190807.json > The ResourceManager log reports an error "Too

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-24 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16936635#comment-16936635 ] Zhankun Tang commented on YARN-9011: [~pbacsko], I see. I may be missing something important. What

[jira] [Commented] (YARN-9847) ZKRMStateStore will cause zk connection loss when writing huge data into znode

2019-09-24 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16936518#comment-16936518 ] Zhankun Tang commented on YARN-9847: [~suxingfate], Thanks for the clarification. It looks good to me.

[jira] [Commented] (YARN-9847) ZKRMStateStore will cause zk connection loss when writing huge data into znode

2019-09-24 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16936476#comment-16936476 ] Zhankun Tang commented on YARN-9847: [~suxingfate], I see. Thanks! One question on the patch. In the

[jira] [Commented] (YARN-9847) ZKRMStateStore will cause zk connection loss when writing huge data into znode

2019-09-23 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16936333#comment-16936333 ] Zhankun Tang commented on YARN-9847: [~suxingfate], thanks for the clarification! Is this duplicated

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-23 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16936329#comment-16936329 ] Zhankun Tang commented on YARN-9011: [~pbacsko], Thanks for the elaboration. Not sure if I understand

[jira] [Commented] (YARN-9847) ZKRMStateStore will cause zk connection loss when writing huge data into znode

2019-09-20 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16934125#comment-16934125 ] Zhankun Tang commented on YARN-9847: [~suxingfate], thanks for reporting this. This is interesting.

[jira] [Commented] (YARN-9612) Support using ip to register NodeID

2019-09-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16925321#comment-16925321 ] Zhankun Tang commented on YARN-9612: [~cane], the background and the motivation still not clear to me.

[jira] [Commented] (YARN-9605) Add ZkConfiguredFailoverProxyProvider for RM HA

2019-09-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16925317#comment-16925317 ] Zhankun Tang commented on YARN-9605: [~cane], Thanks for contributing this. I saw there're failures in

[jira] [Commented] (YARN-9739) appsTableData in AppsBlock may cause OOM

2019-09-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16925308#comment-16925308 ] Zhankun Tang commented on YARN-9739: [~cane], Thanks for catching this point. Do you mean we should

[jira] [Updated] (YARN-9785) Fix DominantResourceCalculator when one resource is zero

2019-09-04 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9785: --- Fix Version/s: 3.1.3 > Fix DominantResourceCalculator when one resource is zero >

[jira] [Updated] (YARN-9785) Fix DominantResourceCalculator when one resource is zero

2019-09-03 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9785: --- Fix Version/s: 3.2.1 3.3.0 > Fix DominantResourceCalculator when one resource is

[jira] [Commented] (YARN-9785) Fix DominantResourceCalculator when one resource is zero

2019-09-03 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16921207#comment-16921207 ] Zhankun Tang commented on YARN-9785: [~bibinchundatt], this has been committed to trunk and

[jira] [Commented] (YARN-9797) LeafQueue#activateApplications should use resourceCalculator#fitsIn

2019-09-02 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16921128#comment-16921128 ] Zhankun Tang commented on YARN-9797: Thanks, [~bibinchundatt], [~BilwaST].  +1 from me. cc [~sunilg]

[jira] [Commented] (YARN-9785) Fix DominantResourceCalculator when one resource is zero

2019-09-02 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16921125#comment-16921125 ] Zhankun Tang commented on YARN-9785: +1 as well. Will commit this soon. > Fix

[jira] [Commented] (YARN-9797) LeafQueue#activateApplications should use resourceCalculator#fitsIn

2019-08-29 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16918478#comment-16918478 ] Zhankun Tang commented on YARN-9797: [~BilwaST], Thanks for the patch and [~bibinchundatt] for the

[jira] [Commented] (YARN-9785) Application gets activated even when AM memory has reached

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16916269#comment-16916269 ] Zhankun Tang commented on YARN-9785: [~BilwaST], Thanks for reporting this. We're going to have branch

[jira] [Commented] (YARN-9607) Auto-configuring rollover-size of IFile format for non-appendable filesystems

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16915831#comment-16915831 ] Zhankun Tang commented on YARN-9607: Bulk update: Preparing for 3.1.3 release. moved all 3.1.3

[jira] [Commented] (YARN-9718) Yarn REST API, services endpoint remote command ejection

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16915826#comment-16915826 ] Zhankun Tang commented on YARN-9718: Bulk update: Preparing for 3.1.3 release. moved all 3.1.3

[jira] [Updated] (YARN-9718) Yarn REST API, services endpoint remote command ejection

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9718: --- Target Version/s: 3.3.0, 3.2.1, 3.1.4 (was: 3.3.0, 3.2.1, 3.1.3) > Yarn REST API, services endpoint

[jira] [Updated] (YARN-9607) Auto-configuring rollover-size of IFile format for non-appendable filesystems

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9607: --- Target Version/s: 3.3.0, 3.2.1, 3.1.4 (was: 3.3.0, 3.2.1, 3.1.3) > Auto-configuring rollover-size of

[jira] [Updated] (YARN-8453) Additional Unit tests to verify queue limit and max-limit with multiple resource types

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8453: --- Target Version/s: 3.0.4, 3.1.4 (was: 3.0.4, 3.1.3) > Additional Unit tests to verify queue limit

[jira] [Commented] (YARN-8453) Additional Unit tests to verify queue limit and max-limit with multiple resource types

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16915818#comment-16915818 ] Zhankun Tang commented on YARN-8453: Bulk update: Preparing for 3.1.3 release. moved all 3.1.3

[jira] [Commented] (YARN-9642) AbstractYarnScheduler#clearPendingContainerCache could run even after transitiontostandby

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16915799#comment-16915799 ] Zhankun Tang commented on YARN-9642: Triggered a rebuild just now. Let's see the result if it finishes

[jira] [Updated] (YARN-8257) Native service should automatically adding escapes for environment/launch cmd before sending to YARN

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8257: --- Target Version/s: 3.1.4 (was: 3.1.3) Bulk update: Preparing for 3.1.3 release. Moved all 3.1.3

[jira] [Updated] (YARN-8417) Should skip passing HDFS_HOME, HADOOP_CONF_DIR, JAVA_HOME, etc. to Docker container.

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8417: --- Target Version/s: 3.1.4 (was: 3.1.3) Bulk update: Preparing for 3.1.3 release. Moved all 3.1.3

[jira] [Updated] (YARN-8052) Move overwriting of service definition during flex to service master

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8052: --- Target Version/s: 3.1.4 (was: 3.1.3) Bulk update: Preparing for 3.1.3 release. Moved all 3.1.3

[jira] [Updated] (YARN-8552) [DS] Container report fails for distributed containers

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8552: --- Target Version/s: 3.1.4 (was: 3.1.3) Bulk update: Preparing for 3.1.3 release. Moved all 3.1.3

[jira] [Updated] (YARN-8234) Improve RM system metrics publisher's performance by pushing events to timeline server in batch

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8234: --- Target Version/s: 3.1.4 (was: 3.1.3) Bulk update: Preparing for 3.1.3 release. Moved all 3.1.3

[jira] [Updated] (YARN-9376) too many ContainerIdComparator instances are not necessary

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9376: --- Bulk update: Preparing for 3.1.3 release. Moved the incorrect "3.1.2" non-blocker issues to 3.1.4,

[jira] [Updated] (YARN-9330) Add support to query scheduler endpoint filtered via queue (/scheduler/queue=abc)

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9330: --- Bulk update: Preparing for 3.1.3 release. Moved the incorrect "3.1.2" non-blocker issues to 3.1.4,

[jira] [Updated] (YARN-9674) Max AM Resource calculation is wrong

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9674: --- Bulk update: Preparing for 3.1.3 release. Moved the incorrect "3.1.2" non-blocker issues to 3.1.4,

[jira] [Updated] (YARN-8657) User limit calculation should be read-lock-protected within LeafQueue

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8657: --- Bulk update: Preparing for 3.1.3 release. Moved the incorrect "3.1.2" non-blocker issues to 3.1.4,

[jira] [Updated] (YARN-9720) MR job submitted to a queue with default partition accessing the non-exclusive label resources

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9720: --- Bulk update: Preparing for 3.1.3 release. Moved the incorrect "3.1.2" non-blocker issues to 3.1.4,

[jira] [Updated] (YARN-9681) AM resource limit is incorrect for queue

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9681: --- Bulk update: Preparing for 3.1.3 release. Moved the incorrect "3.1.2" non-blocker issues to 3.1.4,

[jira] [Updated] (YARN-9674) Max AM Resource calculation is wrong

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9674: --- Target Version/s: 3.1.4 (was: 3.1.2) > Max AM Resource calculation is wrong >

[jira] [Updated] (YARN-9376) too many ContainerIdComparator instances are not necessary

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9376: --- Target Version/s: 3.1.4 (was: 3.1.2) > too many ContainerIdComparator instances are not necessary >

[jira] [Updated] (YARN-9720) MR job submitted to a queue with default partition accessing the non-exclusive label resources

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9720: --- Target Version/s: 3.1.4 (was: 3.1.2) > MR job submitted to a queue with default partition accessing

[jira] [Updated] (YARN-9681) AM resource limit is incorrect for queue

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9681: --- Target Version/s: 3.1.4 (was: 3.1.2) > AM resource limit is incorrect for queue >

[jira] [Updated] (YARN-9330) Add support to query scheduler endpoint filtered via queue (/scheduler/queue=abc)

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9330: --- Target Version/s: 3.1.4 (was: 3.1.2) > Add support to query scheduler endpoint filtered via queue >

[jira] [Updated] (YARN-9106) Add option to graceful decommission to not wait for applications

2019-08-13 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9106: --- Issue Type: Sub-task (was: Improvement) Parent: YARN-914 > Add option to graceful

[jira] [Comment Edited] (YARN-9721) An easy method to exclude a nodemanager from the yarn cluster cleanly

2019-08-06 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16901656#comment-16901656 ] Zhankun Tang edited comment on YARN-9721 at 8/7/19 3:20 AM: [~yuan_zac],

[jira] [Commented] (YARN-9721) An easy method to exclude a nodemanager from the yarn cluster cleanly

2019-08-06 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16901656#comment-16901656 ] Zhankun Tang commented on YARN-9721: [~yuan_zac], Thanks for raising this issue! This is very helpful

[jira] [Updated] (YARN-9584) Should put initializeProcessTrees method call before get pid

2019-07-05 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9584: --- Fix Version/s: (was: 3.1.2) (was: 3.0.3) (was:

[jira] [Updated] (YARN-9584) Should put initializeProcessTrees method call before get pid

2019-07-05 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9584: --- Fix Version/s: (was: 3.2.0) > Should put initializeProcessTrees method call before get pid >

[jira] [Updated] (YARN-9584) Should put initializeProcessTrees method call before get pid

2019-07-05 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9584: --- Fix Version/s: (was: 3.1.0) > Should put initializeProcessTrees method call before get pid >

[jira] [Commented] (YARN-9480) createAppDir() in LogAggregationService shouldn't block dispatcher thread of ContainerManagerImpl

2019-07-01 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16876210#comment-16876210 ] Zhankun Tang commented on YARN-9480: [~yoelee], added [~Yunyao Zhang]. Thanks [~Weiwei Yang] ! >

[jira] [Commented] (YARN-9640) Slow event processing could cause too many attempt unregister events

2019-06-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16874723#comment-16874723 ] Zhankun Tang commented on YARN-9640: [~bibinchundatt], yeah. agree. > Slow event processing could

[jira] [Commented] (YARN-9477) Implement VE discovery using libudev

2019-06-26 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16873460#comment-16873460 ] Zhankun Tang commented on YARN-9477: [~snemeth], thanks for the review. [~pbacsko], Thanks for the

[jira] [Commented] (YARN-9640) Slow event processing could cause too many attempt unregister events

2019-06-23 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16870582#comment-16870582 ] Zhankun Tang commented on YARN-9640: [~bibinchundatt] , Thanks for the patch! One question is that how

  1   2   3   4   5   6   7   8   9   10   >