[jira] [Commented] (YARN-10333) YarnClient obtain Delegation Token for Log Aggregation Path

2020-07-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17153405#comment-17153405 ] Zhankun Tang commented on YARN-10333: - It LGTM. +1. Thanks for your contribution! [~p

[jira] [Commented] (YARN-10380) Import logic of multi-node allocation in CapacityScheduler

2020-11-30 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17241243#comment-17241243 ] Zhankun Tang commented on YARN-10380: - [~zhuqi], Thanks a lot for the contributions!

[jira] [Commented] (YARN-10380) Import logic of multi-node allocation in CapacityScheduler

2020-12-01 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17241360#comment-17241360 ] Zhankun Tang commented on YARN-10380: - [~zhuqi], It should be no problem to merge it

[jira] [Commented] (YARN-10380) Import logic of multi-node allocation in CapacityScheduler

2020-12-09 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17246473#comment-17246473 ] Zhankun Tang commented on YARN-10380: - [~jiwq] Thanks for the review! [~zhuqi] Thanks

[jira] [Commented] (YARN-10463) For Federation, we should support getApplicationAttemptReport.

2020-12-10 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17247657#comment-17247657 ] Zhankun Tang commented on YARN-10463: - [~zhuqi], Thanks for the contribution. [~Bilwa

[jira] [Commented] (YARN-10463) For Federation, we should support getApplicationAttemptReport.

2020-12-17 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17251497#comment-17251497 ] Zhankun Tang commented on YARN-10463: - [~zhuqi], I triggered a new CI and it failed.

[jira] [Updated] (YARN-10463) For Federation, we should support getApplicationAttemptReport.

2020-12-20 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-10463: Fix Version/s: 3.4.0 > For Federation, we should support getApplicationAttemptReport. > ---

[jira] [Commented] (YARN-10463) For Federation, we should support getApplicationAttemptReport.

2020-12-20 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17252569#comment-17252569 ] Zhankun Tang commented on YARN-10463: - [~BilwaST] Thanks for the review. [~zhuqi] Tha

[jira] [Commented] (YARN-10352) Skip schedule on not heartbeated nodes in Multi Node Placement

2021-01-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17274196#comment-17274196 ] Zhankun Tang commented on YARN-10352: - Sorry for the late reply. Thanks for the contr

[jira] [Commented] (YARN-10589) Improve logic of multi-node allocation

2021-02-01 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17276170#comment-17276170 ] Zhankun Tang commented on YARN-10589: - [~zhuqi], could you please review Tanu's patch

[jira] [Commented] (YARN-10589) Improve logic of multi-node allocation

2021-02-02 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17276973#comment-17276973 ] Zhankun Tang commented on YARN-10589: - [~zhuqi], Thanks a lot for the review! [~tanu.

[jira] [Comment Edited] (YARN-10589) Improve logic of multi-node allocation

2021-02-02 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17276973#comment-17276973 ] Zhankun Tang edited comment on YARN-10589 at 2/2/21, 10:02 AM:

[jira] [Commented] (YARN-9650) Set thread names for CapacityScheduler AsyncScheduleThread

2021-02-04 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17279304#comment-17279304 ] Zhankun Tang commented on YARN-9650: [~amoghdesai] Thanks for the contribution. It loo

[jira] [Commented] (YARN-10610) Add queuePath to restful api for CapacityScheduler consistent with FairScheduler queuePath.

2021-02-04 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17279391#comment-17279391 ] Zhankun Tang commented on YARN-10610: - Thanks for the contribution [~Qi Zhu]. please

[jira] [Commented] (YARN-9650) Set thread names for CapacityScheduler AsyncScheduleThread

2021-02-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17281495#comment-17281495 ] Zhankun Tang commented on YARN-9650: [~zhuqi], Thanks for the review. [~amoghdesai], t

[jira] [Resolved] (YARN-9650) Set thread names for CapacityScheduler AsyncScheduleThread

2021-02-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang resolved YARN-9650. Fix Version/s: 3.4.0 Resolution: Fixed > Set thread names for CapacityScheduler AsyncSchedule

[jira] [Commented] (YARN-10616) Nodemanagers cannot detect GPU failures

2021-03-15 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17302164#comment-17302164 ] Zhankun Tang commented on YARN-10616: - [~ebadger], Thanks for picking this up. The YA

[jira] [Comment Edited] (YARN-9721) An easy method to exclude a nodemanager from the yarn cluster cleanly

2019-08-06 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16901656#comment-16901656 ] Zhankun Tang edited comment on YARN-9721 at 8/7/19 3:20 AM: [~

[jira] [Commented] (YARN-9721) An easy method to exclude a nodemanager from the yarn cluster cleanly

2019-08-06 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16901656#comment-16901656 ] Zhankun Tang commented on YARN-9721: [~yuan_zac], Thanks for raising this issue! This

[jira] [Updated] (YARN-9106) Add option to graceful decommission to not wait for applications

2019-08-13 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9106: --- Issue Type: Sub-task (was: Improvement) Parent: YARN-914 > Add option to graceful decommissio

[jira] [Updated] (YARN-9330) Add support to query scheduler endpoint filtered via queue (/scheduler/queue=abc)

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9330: --- Target Version/s: 3.1.4 (was: 3.1.2) > Add support to query scheduler endpoint filtered via queue >

[jira] [Updated] (YARN-9376) too many ContainerIdComparator instances are not necessary

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9376: --- Target Version/s: 3.1.4 (was: 3.1.2) > too many ContainerIdComparator instances are not necessary > -

[jira] [Updated] (YARN-9720) MR job submitted to a queue with default partition accessing the non-exclusive label resources

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9720: --- Target Version/s: 3.1.4 (was: 3.1.2) > MR job submitted to a queue with default partition accessing t

[jira] [Updated] (YARN-9681) AM resource limit is incorrect for queue

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9681: --- Target Version/s: 3.1.4 (was: 3.1.2) > AM resource limit is incorrect for queue > ---

[jira] [Updated] (YARN-9674) Max AM Resource calculation is wrong

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9674: --- Target Version/s: 3.1.4 (was: 3.1.2) > Max AM Resource calculation is wrong > ---

[jira] [Updated] (YARN-8657) User limit calculation should be read-lock-protected within LeafQueue

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8657: --- Bulk update: Preparing for 3.1.3 release. Moved the incorrect "3.1.2" non-blocker issues to 3.1.4, please

[jira] [Updated] (YARN-9720) MR job submitted to a queue with default partition accessing the non-exclusive label resources

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9720: --- Bulk update: Preparing for 3.1.3 release. Moved the incorrect "3.1.2" non-blocker issues to 3.1.4, please

[jira] [Updated] (YARN-9681) AM resource limit is incorrect for queue

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9681: --- Bulk update: Preparing for 3.1.3 release. Moved the incorrect "3.1.2" non-blocker issues to 3.1.4, please

[jira] [Updated] (YARN-9674) Max AM Resource calculation is wrong

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9674: --- Bulk update: Preparing for 3.1.3 release. Moved the incorrect "3.1.2" non-blocker issues to 3.1.4, please

[jira] [Updated] (YARN-9330) Add support to query scheduler endpoint filtered via queue (/scheduler/queue=abc)

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9330: --- Bulk update: Preparing for 3.1.3 release. Moved the incorrect "3.1.2" non-blocker issues to 3.1.4, please

[jira] [Updated] (YARN-9376) too many ContainerIdComparator instances are not necessary

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9376: --- Bulk update: Preparing for 3.1.3 release. Moved the incorrect "3.1.2" non-blocker issues to 3.1.4, please

[jira] [Updated] (YARN-8234) Improve RM system metrics publisher's performance by pushing events to timeline server in batch

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8234: --- Target Version/s: 3.1.4 (was: 3.1.3) Bulk update: Preparing for 3.1.3 release. Moved all 3.1.3 non-bl

[jira] [Updated] (YARN-8552) [DS] Container report fails for distributed containers

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8552: --- Target Version/s: 3.1.4 (was: 3.1.3) Bulk update: Preparing for 3.1.3 release. Moved all 3.1.3 non-bl

[jira] [Updated] (YARN-8052) Move overwriting of service definition during flex to service master

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8052: --- Target Version/s: 3.1.4 (was: 3.1.3) Bulk update: Preparing for 3.1.3 release. Moved all 3.1.3 non-bl

[jira] [Updated] (YARN-8257) Native service should automatically adding escapes for environment/launch cmd before sending to YARN

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8257: --- Target Version/s: 3.1.4 (was: 3.1.3) Bulk update: Preparing for 3.1.3 release. Moved all 3.1.3 non-bl

[jira] [Updated] (YARN-8417) Should skip passing HDFS_HOME, HADOOP_CONF_DIR, JAVA_HOME, etc. to Docker container.

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8417: --- Target Version/s: 3.1.4 (was: 3.1.3) Bulk update: Preparing for 3.1.3 release. Moved all 3.1.3 non-bl

[jira] [Commented] (YARN-9642) AbstractYarnScheduler#clearPendingContainerCache could run even after transitiontostandby

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16915799#comment-16915799 ] Zhankun Tang commented on YARN-9642: Triggered a rebuild just now. Let's see the resul

[jira] [Updated] (YARN-8453) Additional Unit tests to verify queue limit and max-limit with multiple resource types

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8453: --- Target Version/s: 3.0.4, 3.1.4 (was: 3.0.4, 3.1.3) > Additional Unit tests to verify queue limit and

[jira] [Commented] (YARN-8453) Additional Unit tests to verify queue limit and max-limit with multiple resource types

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16915818#comment-16915818 ] Zhankun Tang commented on YARN-8453: Bulk update: Preparing for 3.1.3 release. moved a

[jira] [Updated] (YARN-9607) Auto-configuring rollover-size of IFile format for non-appendable filesystems

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9607: --- Target Version/s: 3.3.0, 3.2.1, 3.1.4 (was: 3.3.0, 3.2.1, 3.1.3) > Auto-configuring rollover-size of

[jira] [Updated] (YARN-9718) Yarn REST API, services endpoint remote command ejection

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9718: --- Target Version/s: 3.3.0, 3.2.1, 3.1.4 (was: 3.3.0, 3.2.1, 3.1.3) > Yarn REST API, services endpoint r

[jira] [Commented] (YARN-9718) Yarn REST API, services endpoint remote command ejection

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16915826#comment-16915826 ] Zhankun Tang commented on YARN-9718: Bulk update: Preparing for 3.1.3 release. moved a

[jira] [Commented] (YARN-9607) Auto-configuring rollover-size of IFile format for non-appendable filesystems

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16915831#comment-16915831 ] Zhankun Tang commented on YARN-9607: Bulk update: Preparing for 3.1.3 release. moved a

[jira] [Commented] (YARN-9785) Application gets activated even when AM memory has reached

2019-08-26 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16916269#comment-16916269 ] Zhankun Tang commented on YARN-9785: [~BilwaST], Thanks for reporting this. We're goin

[jira] [Commented] (YARN-9797) LeafQueue#activateApplications should use resourceCalculator#fitsIn

2019-08-29 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16918478#comment-16918478 ] Zhankun Tang commented on YARN-9797: [~BilwaST], Thanks for the patch and [~bibinchund

[jira] [Commented] (YARN-9785) Fix DominantResourceCalculator when one resource is zero

2019-09-02 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16921125#comment-16921125 ] Zhankun Tang commented on YARN-9785: +1 as well. Will commit this soon. > Fix Dominan

[jira] [Commented] (YARN-9797) LeafQueue#activateApplications should use resourceCalculator#fitsIn

2019-09-02 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16921128#comment-16921128 ] Zhankun Tang commented on YARN-9797: Thanks, [~bibinchundatt], [~BilwaST].  +1 from me

[jira] [Commented] (YARN-9785) Fix DominantResourceCalculator when one resource is zero

2019-09-03 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16921207#comment-16921207 ] Zhankun Tang commented on YARN-9785: [~bibinchundatt], this has been committed to trun

[jira] [Updated] (YARN-9785) Fix DominantResourceCalculator when one resource is zero

2019-09-03 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9785: --- Fix Version/s: 3.2.1 3.3.0 > Fix DominantResourceCalculator when one resource is ze

[jira] [Updated] (YARN-9785) Fix DominantResourceCalculator when one resource is zero

2019-09-04 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9785: --- Fix Version/s: 3.1.3 > Fix DominantResourceCalculator when one resource is zero >

[jira] [Commented] (YARN-9739) appsTableData in AppsBlock may cause OOM

2019-09-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16925308#comment-16925308 ] Zhankun Tang commented on YARN-9739: [~cane], Thanks for catching this point. Do you m

[jira] [Commented] (YARN-9605) Add ZkConfiguredFailoverProxyProvider for RM HA

2019-09-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16925317#comment-16925317 ] Zhankun Tang commented on YARN-9605: [~cane], Thanks for contributing this. I saw ther

[jira] [Commented] (YARN-9612) Support using ip to register NodeID

2019-09-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16925321#comment-16925321 ] Zhankun Tang commented on YARN-9612: [~cane], the background and the motivation still

[jira] [Commented] (YARN-9847) ZKRMStateStore will cause zk connection loss when writing huge data into znode

2019-09-19 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16934125#comment-16934125 ] Zhankun Tang commented on YARN-9847: [~suxingfate], thanks for reporting this. This is

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-23 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936329#comment-16936329 ] Zhankun Tang commented on YARN-9011: [~pbacsko], Thanks for the elaboration. Not sure

[jira] [Commented] (YARN-9847) ZKRMStateStore will cause zk connection loss when writing huge data into znode

2019-09-23 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936333#comment-16936333 ] Zhankun Tang commented on YARN-9847: [~suxingfate], thanks for the clarification! Is t

[jira] [Commented] (YARN-9847) ZKRMStateStore will cause zk connection loss when writing huge data into znode

2019-09-24 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936476#comment-16936476 ] Zhankun Tang commented on YARN-9847: [~suxingfate], I see. Thanks! One question on the

[jira] [Commented] (YARN-9847) ZKRMStateStore will cause zk connection loss when writing huge data into znode

2019-09-24 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936518#comment-16936518 ] Zhankun Tang commented on YARN-9847: [~suxingfate], Thanks for the clarification. It l

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-09-24 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936635#comment-16936635 ] Zhankun Tang commented on YARN-9011: [~pbacsko], I see. I may be missing something imp

[jira] [Updated] (YARN-9861) The ResourceManager log reports an error "Too many open files", the analysis is related to the service

2019-09-27 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9861: --- Attachment: submarine_kerasgesv2date20190807.json > The ResourceManager log reports an error "Too many

[jira] [Commented] (YARN-9861) The ResourceManager log reports an error "Too many open files", the analysis is related to the service

2019-09-27 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939320#comment-16939320 ] Zhankun Tang commented on YARN-9861: [~billie.rinaldi], if any chance, could you pleas

[jira] [Commented] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-20 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16955769#comment-16955769 ] Zhankun Tang commented on YARN-9921: [~tarunparimi], Thanks for reproducing it and fin

[jira] [Commented] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-23 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16957693#comment-16957693 ] Zhankun Tang commented on YARN-9921: [~Prabhu Joseph], [~sunilg], if no more comment.

[jira] [Commented] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-23 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16958465#comment-16958465 ] Zhankun Tang commented on YARN-9921: [~prabhujoseph], Thanks for the review. [~tarunp

[jira] [Updated] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-23 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9921: --- Fix Version/s: 3.1.4 3.3.0 > Issue in PlacementConstraint when YARN Service AM retr

[jira] [Commented] (YARN-9748) Allow capacity-scheduler configuration on HDFS and support reload from HDFS

2019-10-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16960934#comment-16960934 ] Zhankun Tang commented on YARN-9748: [~cane], could you please clarify your requiremen

[jira] [Comment Edited] (YARN-9931) Support run script before kill container

2019-10-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16960938#comment-16960938 ] Zhankun Tang edited comment on YARN-9931 at 10/28/19 11:08 AM: -

[jira] [Commented] (YARN-9931) Support run script before kill container

2019-10-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16960938#comment-16960938 ] Zhankun Tang commented on YARN-9931: [~cane], do you have a sample patch? > Support r

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-10-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16961621#comment-16961621 ] Zhankun Tang commented on YARN-9011: [~pbacsko], Thanks for the new patch. The idea lo

[jira] [Comment Edited] (YARN-9011) Race condition during decommissioning

2019-10-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16961621#comment-16961621 ] Zhankun Tang edited comment on YARN-9011 at 10/29/19 2:49 AM: --

[jira] [Comment Edited] (YARN-9011) Race condition during decommissioning

2019-10-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16961621#comment-16961621 ] Zhankun Tang edited comment on YARN-9011 at 10/29/19 2:49 AM: --

[jira] [Comment Edited] (YARN-9011) Race condition during decommissioning

2019-10-29 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16961621#comment-16961621 ] Zhankun Tang edited comment on YARN-9011 at 10/29/19 11:54 AM: -

[jira] [Commented] (YARN-9011) Race condition during decommissioning

2019-10-29 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16961935#comment-16961935 ] Zhankun Tang commented on YARN-9011: [~pbacsko], Thanks for the explanation. After the

[jira] [Comment Edited] (YARN-9011) Race condition during decommissioning

2019-10-29 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16961935#comment-16961935 ] Zhankun Tang edited comment on YARN-9011 at 10/29/19 12:14 PM: -

[jira] [Comment Edited] (YARN-9011) Race condition during decommissioning

2019-10-29 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16961935#comment-16961935 ] Zhankun Tang edited comment on YARN-9011 at 10/29/19 12:15 PM: -

[jira] [Commented] (YARN-9605) Add ZkConfiguredFailoverProxyProvider for RM HA

2019-11-05 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968014#comment-16968014 ] Zhankun Tang commented on YARN-9605: [~cane], I triggered a new build and let's see.

[jira] [Commented] (YARN-10041) Should not use AbstractPath to create unix domain socket

2019-12-18 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16998975#comment-16998975 ] Zhankun Tang commented on YARN-10041: - [~bzhaoopenstack], thanks for catching this. W

[jira] [Commented] (YARN-10042) Uupgrade grpc-xxx depdencies to 1.26.0

2019-12-19 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1658#comment-1658 ] Zhankun Tang commented on YARN-10042: - [~seanlau], Thanks for catching this. The patc

[jira] [Commented] (YARN-10041) Should not use AbstractPath to create unix domain socket

2019-12-19 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17000569#comment-17000569 ] Zhankun Tang commented on YARN-10041: - [~bzhaoopenstack], [~liusheng], could you plea

[jira] [Updated] (YARN-10042) Uupgrade grpc-xxx depdencies to 1.26.0

2019-12-19 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-10042: Fix Version/s: 3.3.0 > Uupgrade grpc-xxx depdencies to 1.26.0 > ---

[jira] [Commented] (YARN-10042) Uupgrade grpc-xxx depdencies to 1.26.0

2019-12-19 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17000578#comment-17000578 ] Zhankun Tang commented on YARN-10042: - [~cheersyang], thanks for the review. Committe

[jira] [Commented] (YARN-10048) NodeManager fails to start after mounting CGroup

2019-12-19 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17000594#comment-17000594 ] Zhankun Tang commented on YARN-10048: - [~Sen Zhao], thanks for catching this. Let me

[jira] [Commented] (YARN-8851) [Umbrella] A pluggable device plugin framework to ease vendor plugin development

2020-01-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17010470#comment-17010470 ] Zhankun Tang commented on YARN-8851: [~brahmareddy], thanks for planning the 3.3.0 rel

[jira] [Resolved] (YARN-8851) [Umbrella] A pluggable device plugin framework to ease vendor plugin development

2020-01-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-8851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang resolved YARN-8851. Fix Version/s: 3.3.0 Resolution: Fixed > [Umbrella] A pluggable device plugin framework to ea

[jira] [Commented] (YARN-9605) Add ZkConfiguredFailoverProxyProvider for RM HA

2020-01-21 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17020046#comment-17020046 ] Zhankun Tang commented on YARN-9605: [~cane], let me trigger again. Yeah. It seems the

[jira] [Commented] (YARN-10200) Add number of containers to RMAppManager summary

2020-03-24 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17065399#comment-17065399 ] Zhankun Tang commented on YARN-10200: - [~jhung], Thanks for the patch. +1 from me. Ju

[jira] [Commented] (YARN-10200) Add number of containers to RMAppManager summary

2020-03-24 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17066316#comment-17066316 ] Zhankun Tang commented on YARN-10200: - [~jhung], Thanks for the update. Looks better

[jira] [Commented] (YARN-10225) Support of AMD ROCm GPUs in Yarn

2020-04-08 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17078340#comment-17078340 ] Zhankun Tang commented on YARN-10225: - Not sure if YARN-8851 can help here. You can t

[jira] [Assigned] (YARN-10248) when config allowed-gpu-devices , excluded GPUs still be visible to containers

2020-04-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang reassigned YARN-10248: --- Assignee: zhao yufei > when config allowed-gpu-devices , excluded GPUs still be visible to c

[jira] [Commented] (YARN-10248) when config allowed-gpu-devices , excluded GPUs still be visible to containers

2020-04-28 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17095035#comment-17095035 ] Zhankun Tang commented on YARN-10248: - [~jasstionzyf], Thanks for the contribution! H

[jira] [Commented] (YARN-10248) when config allowed-gpu-devices , excluded GPUs still be visible to containers

2020-05-12 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17105933#comment-17105933 ] Zhankun Tang commented on YARN-10248: - [~jasstionzyf], do you mean the existing test

[jira] [Commented] (YARN-10302) Support custom packing algorithm for FairScheduler

2020-06-01 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17121484#comment-17121484 ] Zhankun Tang commented on YARN-10302: - [~billgraham], thanks for the contribution. Co

[jira] [Commented] (YARN-10307) /leveldb-timeline-store.ldb/LOCK not exist

2020-06-04 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126357#comment-17126357 ] Zhankun Tang commented on YARN-10307: - [~appleyuchi], IIRC, I don't think the "Hive o

[jira] [Commented] (YARN-7277) Container Launch expand environment needs to consider bracket matching

2018-11-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16702621#comment-16702621 ] Zhankun Tang commented on YARN-7277: [~leftnoteasy], so if we change any A project cha

[jira] [Updated] (YARN-8885) Phase 1 - Support NM APIs to query device resource allocation

2018-11-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8885: --- Attachment: YARN-8885-trunk.002.patch > Phase 1 - Support NM APIs to query device resource allocation

[jira] [Updated] (YARN-9015) Phase 1 - Add an interface for device plugin to provide customized scheduler

2018-11-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9015: --- Attachment: YARN-9015-trunk.001.patch > Phase 1 - Add an interface for device plugin to provide custom

[jira] [Updated] (YARN-9015) Phase 1 - Add an interface for device plugin to provide customized scheduler

2018-11-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9015: --- Attachment: YARN-9015-trunk.002.patch > Phase 1 - Add an interface for device plugin to provide custom

[jira] [Updated] (YARN-8885) Phase 1 - Support NM APIs to query device resource allocation

2018-11-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-8885: --- Attachment: YARN-8885-trunk.003.patch > Phase 1 - Support NM APIs to query device resource allocation

[jira] [Commented] (YARN-9060) [YARN-8851] Phase 1 - Support device isolation in native container-executor

2018-11-29 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16703023#comment-16703023 ] Zhankun Tang commented on YARN-9060: Thanks for the review, [~leftnoteasy]! {quote}1)

[jira] [Commented] (YARN-9069) Handle SchedulerInfo#getSchedulerType for customized schedulers

2018-11-29 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16703208#comment-16703208 ] Zhankun Tang commented on YARN-9069: +1 LGTM > Handle SchedulerInfo#getSchedulerType

  1   2   3   4   5   6   7   8   9   10   >