[
https://issues.apache.org/jira/browse/YARN-9875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956718#comment-16956718
]
Prabhu Joseph commented on YARN-9875:
-
Thanks [~eyang].
> FSSchedulerConfigurationStore fails to
[
https://issues.apache.org/jira/browse/YARN-9927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
hcarrot updated YARN-9927:
--
Description: Recently, we have observed serious event blocking in RM event
dispatcher queue. After analysis of
[
https://issues.apache.org/jira/browse/YARN-9928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tarun Parimi updated YARN-9928:
---
Component/s: ATSv2
> ATSv2 can make NM go down with a FATAL error while it is resyncing with RM
>
[
https://issues.apache.org/jira/browse/YARN-9928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tarun Parimi updated YARN-9928:
---
Affects Version/s: 3.1.0
> ATSv2 can make NM go down with a FATAL error while it is resyncing with RM
[
https://issues.apache.org/jira/browse/YARN-9781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956934#comment-16956934
]
Peter Bacsko commented on YARN-9781:
LGTM +1 (non-binding)
> SchedConfCli to get current stored
[
https://issues.apache.org/jira/browse/YARN-9780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956941#comment-16956941
]
Peter Bacsko commented on YARN-9780:
[~Prabhu Joseph] I have some minor comments:
#1 Nit: pay
[
https://issues.apache.org/jira/browse/YARN-9927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
hcarrot updated YARN-9927:
--
Description:
Recently, we have observed serious event blocking in RM event dispatcher queue.
After analysis of
[
https://issues.apache.org/jira/browse/YARN-9927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
hcarrot updated YARN-9927:
--
Priority: Major (was: Minor)
> RM multi-thread event processing mechanism
>
[
https://issues.apache.org/jira/browse/YARN-9927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956764#comment-16956764
]
Adam Antal commented on YARN-9927:
--
Thanks for filing this [~hcarrot], interesting approach.
One
[
https://issues.apache.org/jira/browse/YARN-9886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kinga Marton updated YARN-9886:
---
Attachment: YARN-9886.001.patch
> Queue mapping based on userid passed through application tag
>
[
https://issues.apache.org/jira/browse/YARN-9886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956730#comment-16956730
]
Kinga Marton commented on YARN-9886:
[~wangda] yes. I will add whitelist, where it can be defined who
[
https://issues.apache.org/jira/browse/YARN-9886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956730#comment-16956730
]
Kinga Marton edited comment on YARN-9886 at 10/22/19 7:41 AM:
--
[~wangda] yes.
[
https://issues.apache.org/jira/browse/YARN-9927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
hcarrot updated YARN-9927:
--
Affects Version/s: 3.0.0
2.9.2
> RM multi-thread event processing mechanism
>
[
https://issues.apache.org/jira/browse/YARN-9886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956938#comment-16956938
]
Kinga Marton commented on YARN-9886:
In the attached patch 001 I have addressed the following issues:
hcarrot created YARN-9926:
-
Summary: RM multi-thread event processing mechanism
Key: YARN-9926
URL: https://issues.apache.org/jira/browse/YARN-9926
Project: Hadoop YARN
Issue Type: Improvement
hcarrot created YARN-9927:
-
Summary: RM multi-thread event processing mechanism
Key: YARN-9927
URL: https://issues.apache.org/jira/browse/YARN-9927
Project: Hadoop YARN
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/YARN-9511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956747#comment-16956747
]
Adam Antal commented on YARN-9511:
--
Hi [~seanlau],
I can repro the steps you described above with one
Tarun Parimi created YARN-9928:
--
Summary: ATSv2 can make NM go down with a FATAL error while it is
resyncing with RM
Key: YARN-9928
URL: https://issues.apache.org/jira/browse/YARN-9928
Project: Hadoop
[
https://issues.apache.org/jira/browse/YARN-9927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956949#comment-16956949
]
hcarrot commented on YARN-9927:
---
The performance bottleneck is the single-thread RMEventDispatcher mode.
[
https://issues.apache.org/jira/browse/YARN-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956961#comment-16956961
]
zhoukang commented on YARN-9537:
Ok i will fix now!thanks [~snemeth]
> Add configuration to disable AM
[
https://issues.apache.org/jira/browse/YARN-9605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956959#comment-16956959
]
zhoukang commented on YARN-9605:
[~weichiu][~tangzhankun] Any suggestion?thanks
> Add
[
https://issues.apache.org/jira/browse/YARN-9789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957015#comment-16957015
]
Peter Bacsko commented on YARN-9789:
Patch looks straightforward, +1 non-binding.
> Disable Option
[
https://issues.apache.org/jira/browse/YARN-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhoukang updated YARN-9537:
---
Attachment: YARN-9537-002.patch
> Add configuration to disable AM preemption
>
[
https://issues.apache.org/jira/browse/YARN-9748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957041#comment-16957041
]
zhoukang commented on YARN-9748:
I want add a service like
{code:java}
AllocationFileLoaderService
{code}
[
https://issues.apache.org/jira/browse/YARN-9788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Prabhu Joseph updated YARN-9788:
Attachment: YARN-9788-008.patch
> Queue Management API - does not support parallel updates
>
[
https://issues.apache.org/jira/browse/YARN-9689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956994#comment-16956994
]
zhoukang commented on YARN-9689:
Could you help review this? [~botong][~giovanni.fumarola][~tangzhankun]
zhoukang created YARN-9930:
--
Summary: Support max running app logic for CapacityScheduler
Key: YARN-9930
URL: https://issues.apache.org/jira/browse/YARN-9930
Project: Hadoop YARN
Issue Type:
[
https://issues.apache.org/jira/browse/YARN-9928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957101#comment-16957101
]
Tarun Parimi commented on YARN-9928:
The issue is occurring since container returned in below code
[
https://issues.apache.org/jira/browse/YARN-9605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956958#comment-16956958
]
zhoukang commented on YARN-9605:
The failed test is below which i think is not related with this patch:
[
https://issues.apache.org/jira/browse/YARN-9788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956957#comment-16956957
]
Peter Bacsko commented on YARN-9788:
Thanks for the patch [~Prabhu Joseph]. I think the patch looks
[
https://issues.apache.org/jira/browse/YARN-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhoukang updated YARN-9537:
---
Attachment: (was: YARN-9537-002.patch)
> Add configuration to disable AM preemption
>
[
https://issues.apache.org/jira/browse/YARN-9929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957039#comment-16957039
]
kyungwan nam commented on YARN-9929:
attaches a patch, which set the timeout for
[
https://issues.apache.org/jira/browse/YARN-9863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957077#comment-16957077
]
David Mollitor commented on YARN-9863:
--
[~szegedim] Any chance you've been able to review my remarks?
[
https://issues.apache.org/jira/browse/YARN-9916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956952#comment-16956952
]
Adam Antal commented on YARN-9916:
--
I think this is related (if not the dupe) to YARN-9927.
> Improving
[
https://issues.apache.org/jira/browse/YARN-9689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhoukang reassigned YARN-9689:
--
Assignee: zhoukang
> Router does not support kerberos proxy when in secure mode
>
[
https://issues.apache.org/jira/browse/YARN-9927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957037#comment-16957037
]
zhoukang commented on YARN-9927:
nice idea, we also want to do similar job. looking forward for the poc
[
https://issues.apache.org/jira/browse/YARN-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957069#comment-16957069
]
zhoukang commented on YARN-7621:
[~jiwq] Agree with you, Sorry for late reply.And any progress for this
[
https://issues.apache.org/jira/browse/YARN-9931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhoukang updated YARN-9931:
---
Description:
Like node health check script. We can add a pre-kill script which run before
kill container.
[
https://issues.apache.org/jira/browse/YARN-9931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhoukang updated YARN-9931:
---
Component/s: nodemanager
> Support run script before kill container
>
[
https://issues.apache.org/jira/browse/YARN-9931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957095#comment-16957095
]
zhoukang commented on YARN-9931:
[~weiweiyagn666] [~tangzhankun]
> Support run script before kill
[
https://issues.apache.org/jira/browse/YARN-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhoukang updated YARN-9537:
---
Attachment: YARN-9537-002.patch
> Add configuration to disable AM preemption
>
[
https://issues.apache.org/jira/browse/YARN-9851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhoukang resolved YARN-9851.
Resolution: Duplicate
> Make execution type check compatiable
> -
>
>
kyungwan nam created YARN-9929:
--
Summary: NodeManager OOM because of stuck DeletionService
Key: YARN-9929
URL: https://issues.apache.org/jira/browse/YARN-9929
Project: Hadoop YARN
Issue Type:
[
https://issues.apache.org/jira/browse/YARN-9781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957073#comment-16957073
]
Prabhu Joseph commented on YARN-9781:
-
Thanks [~pbacsko] for the review.
[~snemeth] Can you review
[
https://issues.apache.org/jira/browse/YARN-9789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957074#comment-16957074
]
Prabhu Joseph commented on YARN-9789:
-
Thanks [~pbacsko] for the review.
[~snemeth] Can you review
[
https://issues.apache.org/jira/browse/YARN-9886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957114#comment-16957114
]
Hadoop QA commented on YARN-9886:
-
| (/) *{color:green}+1 overall{color}* |
\\
\\
|| Vote || Subsystem ||
[
https://issues.apache.org/jira/browse/YARN-9925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Prabhu Joseph updated YARN-9925:
Attachment: YARN-9925-001.patch
> CapacitySchedulerQueueManager allows unsupported Queue hierarchy
[
https://issues.apache.org/jira/browse/YARN-9697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956962#comment-16956962
]
Bibin Chundatt commented on YARN-9697:
--
Thank you [~abmodi] for updating patch
Few comments and
[
https://issues.apache.org/jira/browse/YARN-9923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956995#comment-16956995
]
Peter Bacsko commented on YARN-9923:
_"NONE (default): preserving the current behaviour [...]"_
Even
[
https://issues.apache.org/jira/browse/YARN-9923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956995#comment-16956995
]
Peter Bacsko edited comment on YARN-9923 at 10/22/19 12:16 PM:
---
_"NONE
[
https://issues.apache.org/jira/browse/YARN-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957020#comment-16957020
]
zhoukang commented on YARN-9537:
A new patch has been attached [~snemeth]
> Add configuration to disable
[
https://issues.apache.org/jira/browse/YARN-9929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
kyungwan nam updated YARN-9929:
---
Attachment: nm_heapdump.png
> NodeManager OOM because of stuck DeletionService
>
[
https://issues.apache.org/jira/browse/YARN-9929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
kyungwan nam updated YARN-9929:
---
Attachment: YARN-9929.001.patch
> NodeManager OOM because of stuck DeletionService
>
[
https://issues.apache.org/jira/browse/YARN-9930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhoukang updated YARN-9930:
---
Parent: YARN-9698
Issue Type: Sub-task (was: Improvement)
> Support max running app logic for
[
https://issues.apache.org/jira/browse/YARN-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957071#comment-16957071
]
zhoukang edited comment on YARN-7621 at 10/22/19 1:40 PM:
--
I think we can solve
[
https://issues.apache.org/jira/browse/YARN-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957071#comment-16957071
]
zhoukang commented on YARN-7621:
I think with full path we can solve this problem [~wilfreds]
> Support
[
https://issues.apache.org/jira/browse/YARN-9925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Prabhu Joseph updated YARN-9925:
Description:
CapacitySchedulerQueueManager allows unsupported Queue hierarchy. When creating
a
zhoukang created YARN-9931:
--
Summary: Support run script before kill container
Key: YARN-9931
URL: https://issues.apache.org/jira/browse/YARN-9931
Project: Hadoop YARN
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/YARN-9929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957125#comment-16957125
]
Hadoop QA commented on YARN-9929:
-
| (x) *{color:red}-1 overall{color}* |
\\
\\
|| Vote || Subsystem ||
[
https://issues.apache.org/jira/browse/YARN-9915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956715#comment-16956715
]
Prabhu Joseph commented on YARN-9915:
-
Thanks [~epayne].
> Fix FindBug issue in QueueMetrics
>
[
https://issues.apache.org/jira/browse/YARN-9897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16956777#comment-16956777
]
Zhenyu Zheng commented on YARN-9897:
Some updates, our team has succesfully donated ARM resources and
[
https://issues.apache.org/jira/browse/YARN-9788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957315#comment-16957315
]
Hadoop QA commented on YARN-9788:
-
| (/) *{color:green}+1 overall{color}* |
\\
\\
|| Vote || Subsystem ||
[
https://issues.apache.org/jira/browse/YARN-9697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957341#comment-16957341
]
Hadoop QA commented on YARN-9697:
-
| (/) *{color:green}+1 overall{color}* |
\\
\\
|| Vote || Subsystem ||
[
https://issues.apache.org/jira/browse/YARN-9918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957232#comment-16957232
]
Manikandan R commented on YARN-9918:
Can you add more details to reproduce this issue? FYI, this
[
https://issues.apache.org/jira/browse/YARN-9780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Prabhu Joseph updated YARN-9780:
Attachment: YARN-9780-004.patch
> SchedulerConf Mutation Api does not Allow Stop and Remove Queue
[
https://issues.apache.org/jira/browse/YARN-9930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957231#comment-16957231
]
Manikandan R commented on YARN-9930:
Is this different from YARN-9887?
> Support max running app
[
https://issues.apache.org/jira/browse/YARN-9788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957179#comment-16957179
]
Hadoop QA commented on YARN-9788:
-
| (x) *{color:red}-1 overall{color}* |
\\
\\
|| Vote || Subsystem ||
[
https://issues.apache.org/jira/browse/YARN-9925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Prabhu Joseph updated YARN-9925:
Attachment: YARN-9925-002.patch
> CapacitySchedulerQueueManager allows unsupported Queue hierarchy
[
https://issues.apache.org/jira/browse/YARN-9925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957161#comment-16957161
]
Hadoop QA commented on YARN-9925:
-
| (x) *{color:red}-1 overall{color}* |
\\
\\
|| Vote || Subsystem ||
[
https://issues.apache.org/jira/browse/YARN-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957168#comment-16957168
]
Hadoop QA commented on YARN-9537:
-
| (/) *{color:green}+1 overall{color}* |
\\
\\
|| Vote || Subsystem ||
[
https://issues.apache.org/jira/browse/YARN-9788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Prabhu Joseph updated YARN-9788:
Attachment: YARN-9788-009.patch
> Queue Management API - does not support parallel updates
>
[
https://issues.apache.org/jira/browse/YARN-9926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bibin Chundatt resolved YARN-9926.
--
Resolution: Duplicate
> RM multi-thread event processing mechanism
>
[
https://issues.apache.org/jira/browse/YARN-9925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957230#comment-16957230
]
Manikandan R commented on YARN-9925:
YARN-9772 has been created to address the concerns raised here.
[
https://issues.apache.org/jira/browse/YARN-9780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957248#comment-16957248
]
Hadoop QA commented on YARN-9780:
-
| (/) *{color:green}+1 overall{color}* |
\\
\\
|| Vote || Subsystem ||
[
https://issues.apache.org/jira/browse/YARN-9697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Abhishek Modi updated YARN-9697:
Attachment: YARN-9697.008.patch
> Efficient allocation of Opportunistic containers.
>
[
https://issues.apache.org/jira/browse/YARN-9925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957262#comment-16957262
]
Hadoop QA commented on YARN-9925:
-
| (x) *{color:red}-1 overall{color}* |
\\
\\
|| Vote || Subsystem ||
[
https://issues.apache.org/jira/browse/YARN-9697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957250#comment-16957250
]
Abhishek Modi commented on YARN-9697:
-
Thanks [~bibinchundatt] for the review. I have addressed most
[
https://issues.apache.org/jira/browse/YARN-9689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957445#comment-16957445
]
Botong Huang commented on YARN-9689:
+1 lgtm
> Router does not support kerberos proxy when in secure
[
https://issues.apache.org/jira/browse/YARN-9923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957379#comment-16957379
]
Eric Badger commented on YARN-9923:
---
Isn't it more appropriate for this to be in the nm health check
[
https://issues.apache.org/jira/browse/YARN-9897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957475#comment-16957475
]
Eric Yang commented on YARN-9897:
-
[~Kevin_Zheng] The patch looks good to me. I am surprised how little
[
https://issues.apache.org/jira/browse/YARN-9897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957484#comment-16957484
]
liusheng commented on YARN-9897:
Hi [~eyang],
Looks like both these two tests are OK, see:
{code:java}
[
https://issues.apache.org/jira/browse/YARN-9897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957484#comment-16957484
]
liusheng edited comment on YARN-9897 at 10/23/19 1:34 AM:
--
Hi [~eyang],
I have
[
https://issues.apache.org/jira/browse/YARN-9897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957487#comment-16957487
]
Zhenyu Zheng commented on YARN-9897:
[~eyang]BTW, we have actually started to run tests and debug for
[
https://issues.apache.org/jira/browse/YARN-9927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957378#comment-16957378
]
Wangda Tan commented on YARN-9927:
--
Thanks [~hcarrot] for working on this.
Tagging: [~prabhujoseph] ,
84 matches
Mail list logo