[jira] [Assigned] (YARN-10306) Create simple copy log aggregation file controller

2020-10-20 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal reassigned YARN-10306: - Assignee: Andras Gyori (was: Adam Antal) > Create simple copy log aggregation file controller

[jira] [Commented] (YARN-10448) SLS should set default user to handle SYNTH format

2020-10-12 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17212423#comment-17212423 ] Adam Antal commented on YARN-10448: --- Thanks for the patch [~zhuqi], looks good to me. Could you please

[jira] [Commented] (YARN-10420) Update CS MappingRule documentation with the new format and features

2020-10-12 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17212411#comment-17212411 ] Adam Antal commented on YARN-10420: --- Thanks for the patch [~pbacsko]. I'll attach my reply inline. 1.

[jira] [Commented] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-10-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209595#comment-17209595 ] Adam Antal commented on YARN-10393: --- Committed to branch-2.10. Thanks [~Jim_Brennan]. > MR job live

[jira] [Updated] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-10-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-10393: -- Fix Version/s: 2.10.2 > MR job live lock caused by completed state container leak in heartbeat >

[jira] [Updated] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-10-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-10393: -- Attachment: YARN-10393-branch-2.10.001.patch > MR job live lock caused by completed state container

[jira] [Commented] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-10-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209483#comment-17209483 ] Adam Antal commented on YARN-10393: --- Reuploaded patch for branch-2.10, pending on jenkins. > MR job

[jira] [Reopened] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-10-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal reopened YARN-10393: --- Reopening the issue to trigger jenkins. > MR job live lock caused by completed state container leak in

[jira] [Commented] (YARN-10420) Update CS MappingRule documentation with the new format and features

2020-10-06 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17208690#comment-17208690 ] Adam Antal commented on YARN-10420: --- Thanks for the patch [~pbacsko]. Awesome patch, thanks for the

[jira] [Commented] (YARN-10031) Create a general purpose log request with additional query parameters

2020-10-06 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17208611#comment-17208611 ] Adam Antal commented on YARN-10031: --- Thanks for the patch [~gandras]. Sorry for the late with the

[jira] [Commented] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-10-05 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17207941#comment-17207941 ] Adam Antal commented on YARN-10393: --- Cherry-picked to branch-3.3, branch-3.2, branch-3.1, branch-3.0. I

[jira] [Updated] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-10-05 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-10393: -- Fix Version/s: 3.1.5 3.3.1 3.2.2 3.0.4 > MR

[jira] [Updated] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-10-05 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-10393: -- Fix Version/s: 3.4.0 > MR job live lock caused by completed state container leak in heartbeat >

[jira] [Commented] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-10-05 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17207912#comment-17207912 ] Adam Antal commented on YARN-10393: --- Committed to trunk, I will cherry-pick this to other branches now.

[jira] [Commented] (YARN-10448) SLS should set default user to handle SYNTH format

2020-10-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17205449#comment-17205449 ] Adam Antal commented on YARN-10448: --- Thanks for the patch [~zhuqi]. Would you please include a sample

[jira] [Commented] (YARN-10447) TestLeafQueue: ActivitiesManager thread might interfere with ongoing stubbing

2020-10-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17205397#comment-17205397 ] Adam Antal commented on YARN-10447: --- Committed to trunk, thanks for the contribution [~pbacsko]. >

[jira] [Commented] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-10-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17205385#comment-17205385 ] Adam Antal commented on YARN-10393: --- Agreed, +1 to v2 patch. Any comments [~yuanbo], [~wzzdreamer]? >

[jira] [Commented] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-09-30 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17204736#comment-17204736 ] Adam Antal commented on YARN-10393: --- Hi [~Jim_Brennan], Thanks for the patch, overall looks good. As I

[jira] [Commented] (YARN-8737) Race condition in ParentQueue when reinitializing and sorting child queues in the meanwhile

2020-09-30 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-8737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17204724#comment-17204724 ] Adam Antal commented on YARN-8737: -- Thanks for the patch [~Tao Yang]. The patch looks straightforward,

[jira] [Commented] (YARN-10447) TestLeafQueue: ActivitiesManager thread might interfere with ongoing stubbing

2020-09-30 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17204701#comment-17204701 ] Adam Antal commented on YARN-10447: --- Thanks [~pbacsko]. The patch looks good to me, but unit tests have

[jira] [Comment Edited] (YARN-10447) TestLeafQueue: ActivitiesManager thread might interfere with ongoing stubbing

2020-09-29 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17203975#comment-17203975 ] Adam Antal edited comment on YARN-10447 at 9/29/20, 2:22 PM: - Thanks

[jira] [Comment Edited] (YARN-10447) TestLeafQueue: ActivitiesManager thread might interfere with ongoing stubbing

2020-09-29 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17203975#comment-17203975 ] Adam Antal edited comment on YARN-10447 at 9/29/20, 2:21 PM: - Thanks

[jira] [Comment Edited] (YARN-10447) TestLeafQueue: ActivitiesManager thread might interfere with ongoing stubbing

2020-09-29 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17203975#comment-17203975 ] Adam Antal edited comment on YARN-10447 at 9/29/20, 2:21 PM: - Thanks

[jira] [Commented] (YARN-10447) TestLeafQueue: ActivitiesManager thread might interfere with ongoing stubbing

2020-09-29 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17203975#comment-17203975 ] Adam Antal commented on YARN-10447: --- Thanks [~pbacsko] for the patch. I think this patch does not

[jira] [Updated] (YARN-10114) Tail -f style CLI tool for extracting logs of running containers

2020-09-28 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-10114: -- Summary: Tail -f style CLI tool for extracting logs of running containers (was: Tail -f styled CLI

[jira] [Assigned] (YARN-10443) Document options of logs CLI

2020-09-21 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal reassigned YARN-10443: - Assignee: Ankit Kumar > Document options of logs CLI > > >

[jira] [Commented] (YARN-10443) Document options of logs CLI

2020-09-21 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17199289#comment-17199289 ] Adam Antal commented on YARN-10443: --- Hey [~akumar], No I'm not, I assigned it to you. Will be happy to

[jira] [Created] (YARN-10443) Document options of logs CLI

2020-09-18 Thread Adam Antal (Jira)
Adam Antal created YARN-10443: - Summary: Document options of logs CLI Key: YARN-10443 URL: https://issues.apache.org/jira/browse/YARN-10443 Project: Hadoop YARN Issue Type: Bug

[jira] [Commented] (YARN-950) Ability to limit or avoid aggregating logs beyond a certain size

2020-09-17 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17197732#comment-17197732 ] Adam Antal commented on YARN-950: - Hi [~epayne], Sorry for the late answer. I unassigned this from myself,

[jira] [Assigned] (YARN-950) Ability to limit or avoid aggregating logs beyond a certain size

2020-09-17 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal reassigned YARN-950: --- Assignee: (was: Adam Antal) > Ability to limit or avoid aggregating logs beyond a certain size >

[jira] [Commented] (YARN-10031) Create a general purpose log request with additional query parameters

2020-09-17 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17197729#comment-17197729 ] Adam Antal commented on YARN-10031: --- Thanks for the response [~gandras]. I accidentally made an error

[jira] [Commented] (YARN-9333) TestFairSchedulerPreemption.testRelaxLocalityPreemptionWithNoLessAMInRemainingNodes fails intermittent

2020-09-17 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17197718#comment-17197718 ] Adam Antal commented on YARN-9333: -- I think the potential benefits outweigh the cons, so I'm +1 on

[jira] [Commented] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-09-17 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17197715#comment-17197715 ] Adam Antal commented on YARN-10393: --- Thanks for the valuable comments [~Jim_Brennan] [~yuanbo]

[jira] [Commented] (YARN-9333) TestFairSchedulerPreemption.testRelaxLocalityPreemptionWithNoLessAMInRemainingNodes fails intermittent

2020-09-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17191759#comment-17191759 ] Adam Antal commented on YARN-9333: -- I've just seen this failure occurring in a much higher frequency than

[jira] [Resolved] (YARN-10329) Flaky test cases in Fair Scheduler

2020-09-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal resolved YARN-10329. --- Resolution: Duplicate > Flaky test cases in Fair Scheduler > -- > >

[jira] [Assigned] (YARN-10329) Flaky test cases in Fair Scheduler

2020-09-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal reassigned YARN-10329: - Assignee: Adam Antal > Flaky test cases in Fair Scheduler > --

[jira] [Commented] (YARN-10329) Flaky test cases in Fair Scheduler

2020-09-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17191756#comment-17191756 ] Adam Antal commented on YARN-10329: --- I think the first failure is handled in YARN-10297 and the second

[jira] [Commented] (YARN-10031) Create a general purpose log request with additional query parameters

2020-09-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17191754#comment-17191754 ] Adam Antal commented on YARN-10031: --- Thanks for the patch [~gandras]. Looks good overall: I agree

[jira] [Commented] (YARN-10332) RESOURCE_UPDATE event was repeatedly registered in DECOMMISSIONING state

2020-09-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17191621#comment-17191621 ] Adam Antal commented on YARN-10332: --- Thanks for the patch [~yehuanhuan]. Committed to trunk, branch-3.3

[jira] [Updated] (YARN-10332) RESOURCE_UPDATE event was repeatedly registered in DECOMMISSIONING state

2020-09-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-10332: -- Fix Version/s: 3.3.1 3.4.0 3.2.2 > RESOURCE_UPDATE event was

[jira] [Commented] (YARN-9136) getNMResourceInfo NodeManager REST API method is not documented

2020-09-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17191572#comment-17191572 ] Adam Antal commented on YARN-9136: -- Thanks for the patch [~mhudaky], committed to trunk. Thanks for the

[jira] [Updated] (YARN-9136) getNMResourceInfo NodeManager REST API method is not documented

2020-09-07 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-9136: - Fix Version/s: 3.4.0 > getNMResourceInfo NodeManager REST API method is not documented >

[jira] [Commented] (YARN-9136) getNMResourceInfo NodeManager REST API method is not documented

2020-09-03 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17190110#comment-17190110 ] Adam Antal commented on YARN-9136: -- +1. Any additional comments [~snemeth], [~pbacsko]? >

[jira] [Commented] (YARN-10419) Javadoc error in hadoop-yarn-server-common module

2020-09-03 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17190042#comment-17190042 ] Adam Antal commented on YARN-10419: --- Thanks for filing this issue [~aajisaka]. I don't know either why

[jira] [Commented] (YARN-10386) Create new JSON schema for Placement Rules

2020-09-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17188421#comment-17188421 ] Adam Antal commented on YARN-10386: --- Thanks for the addendum patch [~pbacsko]. +1 from me, committed to

[jira] [Commented] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-09-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17188306#comment-17188306 ] Adam Antal commented on YARN-10393: --- Thanks for the discussion above, I think we've seen several

[jira] [Commented] (YARN-10332) RESOURCE_UPDATE event was repeatedly registered in DECOMMISSIONING state

2020-08-31 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17187603#comment-17187603 ] Adam Antal commented on YARN-10332: --- Also we probably need a branch-3.3 and branch-3.2 patch if you

[jira] [Commented] (YARN-10332) RESOURCE_UPDATE event was repeatedly registered in DECOMMISSIONING state

2020-08-31 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17187602#comment-17187602 ] Adam Antal commented on YARN-10332: --- [~yehuanhuan]: sorry for the delay. Could you please reupload your

[jira] [Commented] (YARN-9136) getNMResourceInfo NodeManager REST API method is not documented

2020-08-27 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17185765#comment-17185765 ] Adam Antal commented on YARN-9136: -- Thanks for working on this [~mhudaky]. AFAIU then let's skip the

[jira] [Comment Edited] (YARN-10386) Create new JSON schema for Placement Rules

2020-08-27 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17185764#comment-17185764 ] Adam Antal edited comment on YARN-10386 at 8/27/20, 10:25 AM: -- Thanks for

[jira] [Commented] (YARN-10386) Create new JSON schema for Placement Rules

2020-08-27 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17185764#comment-17185764 ] Adam Antal commented on YARN-10386: --- Thanks for working on this [~pbacsko]. I saw you had some

[jira] [Commented] (YARN-4946) RM should not consider an application as COMPLETED when log aggregation is not in a terminal state

2020-08-26 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-4946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17185230#comment-17185230 ] Adam Antal commented on YARN-4946: -- This issue is reverted in YARN-9848 for 3.3.0. Please revert this

[jira] [Commented] (YARN-10304) Create an endpoint for remote application log directory path query

2020-08-25 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17183947#comment-17183947 ] Adam Antal commented on YARN-10304: --- Thanks for the work [~gandras], committed to trunk. Thanks for the

[jira] [Commented] (YARN-10106) Yarn logs CLI filtering by application attempt

2020-08-25 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17183837#comment-17183837 ] Adam Antal commented on YARN-10106: --- Thanks for the patch [~mhudaky], committed to trunk. Appreciated

[jira] [Commented] (YARN-10106) Yarn logs CLI filtering by application attempt

2020-08-24 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17183351#comment-17183351 ] Adam Antal commented on YARN-10106: --- Thanks for the latest patch [~mhudaky]! If you fix the last 7

[jira] [Updated] (YARN-10406) YARN log processor

2020-08-24 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-10406: -- Description: YARN currently does not have any utility that would enable cluster administrators to

[jira] [Updated] (YARN-10406) YARN log processor

2020-08-24 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-10406: -- Description: YARN currently does not have any utility that would enable cluster administrators to

[jira] [Created] (YARN-10406) YARN log processor

2020-08-24 Thread Adam Antal (Jira)
Adam Antal created YARN-10406: - Summary: YARN log processor Key: YARN-10406 URL: https://issues.apache.org/jira/browse/YARN-10406 Project: Hadoop YARN Issue Type: New Feature

[jira] [Commented] (YARN-10393) MR job live lock caused by completed state container leak in heartbeat between node manager and RM

2020-08-13 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17176984#comment-17176984 ] Adam Antal commented on YARN-10393: --- Nice finding [~wzzdreamer] and thorough explanation. Let me check

[jira] [Commented] (YARN-4783) Log aggregation failure for application when Nodemanager is restarted

2020-08-10 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-4783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17174123#comment-17174123 ] Adam Antal commented on YARN-4783: -- Thanks for the patch [~gandras]. I am not entirely convinced that

[jira] [Commented] (YARN-4783) Log aggregation failure for application when Nodemanager is restarted

2020-08-06 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-4783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17172399#comment-17172399 ] Adam Antal commented on YARN-4783: -- Thanks for the patch [~gandras]. Somehow I could not open the compile

[jira] [Commented] (YARN-10304) Create an endpoint for remote application log directory path query

2020-08-03 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17169945#comment-17169945 ] Adam Antal commented on YARN-10304: --- There's one last checkstyle issue. Could you please handle that

[jira] [Commented] (YARN-10304) Create an endpoint for remote application log directory path query

2020-07-31 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17168815#comment-17168815 ] Adam Antal commented on YARN-10304: --- [~gandras] could you please rebase & reupload the latest patch to

[jira] [Commented] (YARN-10304) Create an endpoint for remote application log directory path query

2020-07-23 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17163600#comment-17163600 ] Adam Antal commented on YARN-10304: --- +1 from me. Thanks for working on this [~gandras]. > Create an

[jira] [Commented] (YARN-10332) RESOURCE_UPDATE event was repeatedly registered in DECOMMISSIONING state

2020-07-23 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17163446#comment-17163446 ] Adam Antal commented on YARN-10332: --- I'm sorry, I missed this too. Nice catch [~yehuanhuan]. +1 for

[jira] [Commented] (YARN-10315) Avoid sending RMNodeResourceupdate event if resource is same

2020-07-23 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17163423#comment-17163423 ] Adam Antal commented on YARN-10315: --- +1 from me on v2. Thanks for the patch [~Sushil-K-S]. > Avoid

[jira] [Commented] (YARN-10319) Record Last N Scheduler Activities from ActivitiesManager

2020-07-23 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17163418#comment-17163418 ] Adam Antal commented on YARN-10319: --- Indeed, the test failure is not related. +1 from me, thanks for

[jira] [Commented] (YARN-10319) Record Last N Scheduler Activities from ActivitiesManager

2020-07-21 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17162016#comment-17162016 ] Adam Antal commented on YARN-10319: --- Check the markdown and I could not understand this sentence: "This

[jira] [Commented] (YARN-10106) Yarn logs CLI filtering by application attempt

2020-07-20 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17161306#comment-17161306 ] Adam Antal commented on YARN-10106: --- +1 from me. > Yarn logs CLI filtering by application attempt >

[jira] [Resolved] (YARN-10266) Setting debug delay to a too high number will cause NM fail to start

2020-07-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal resolved YARN-10266. --- Assignee: Adam Antal Resolution: Won't Fix > Setting debug delay to a too high number will

[jira] [Commented] (YARN-10266) Setting debug delay to a too high number will cause NM fail to start

2020-07-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17149311#comment-17149311 ] Adam Antal commented on YARN-10266: --- I agree [~BilwaST]. It makes no sense to handle this exception in

[jira] [Commented] (YARN-10106) Yarn logs CLI filtering by application attempt

2020-07-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17149277#comment-17149277 ] Adam Antal commented on YARN-10106: --- Thanks for the patch [~mhudaky]. For backward compatibility

[jira] [Commented] (YARN-10334) TestDistributedShell leaks resources on timeout/failure

2020-07-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17149259#comment-17149259 ] Adam Antal commented on YARN-10334: --- Nice finding [~ahussein]. It could potentially cause lots of

[jira] [Commented] (YARN-10319) Record Last N Scheduler Activities from ActivitiesManager

2020-07-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17149253#comment-17149253 ] Adam Antal commented on YARN-10319: --- Thanks for the patch [~prabhujoseph]. I have some minor nits if

[jira] [Commented] (YARN-10332) RESOURCE_UPDATE event was repeatedly registered in DECOMMISSIONING state

2020-07-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17149233#comment-17149233 ] Adam Antal commented on YARN-10332: --- Moved this under YARN-914. I agree with [~bibinchundatt],

[jira] [Commented] (YARN-10315) Avoid sending RMNodeResoureupdate event if resource is same

2020-07-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17149232#comment-17149232 ] Adam Antal commented on YARN-10315: --- Moved this under YARN-914. > Avoid sending RMNodeResoureupdate

[jira] [Updated] (YARN-10315) Avoid sending RMNodeResoureupdate event if resource is same

2020-07-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-10315: -- Parent: YARN-914 Issue Type: Sub-task (was: Improvement) > Avoid sending RMNodeResoureupdate

[jira] [Updated] (YARN-10332) RESOURCE_UPDATE event was repeatedly registered in DECOMMISSIONING state

2020-07-01 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-10332: -- Parent: YARN-914 Issue Type: Sub-task (was: Improvement) > RESOURCE_UPDATE event was

[jira] [Commented] (YARN-10279) Avoid unnecessary QueueMappingEntity creations

2020-06-23 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17142690#comment-17142690 ] Adam Antal commented on YARN-10279: --- Thanks for the patch [~mhudaky]. The unit test failures seem

[jira] [Commented] (YARN-9930) Support max running app logic for CapacityScheduler

2020-06-19 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17140489#comment-17140489 ] Adam Antal commented on YARN-9930: -- Thanks for the effort on pushing this through [~pbacsko], +1 >

[jira] [Assigned] (YARN-10281) Redundant QueuePath usage in UserGroupMappingPlacementRule and AppNameMappingPlacementRule

2020-06-17 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal reassigned YARN-10281: - Assignee: Adam Antal (was: Gergely Pollak) > Redundant QueuePath usage in

[jira] [Assigned] (YARN-10281) Redundant QueuePath usage in UserGroupMappingPlacementRule and AppNameMappingPlacementRule

2020-06-17 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal reassigned YARN-10281: - Assignee: Gergely Pollak (was: Adam Antal) > Redundant QueuePath usage in

[jira] [Assigned] (YARN-9136) getNMResourceInfo NodeManager REST API method is not documented

2020-06-16 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal reassigned YARN-9136: Assignee: Hudáky Márton Gyula (was: Gergely Pollak) > getNMResourceInfo NodeManager REST API

[jira] [Commented] (YARN-9136) getNMResourceInfo NodeManager REST API method is not documented

2020-06-16 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17136622#comment-17136622 ] Adam Antal commented on YARN-9136: -- I hope you don't mind if I take this over [~shuzirra]. >

[jira] [Commented] (YARN-10304) Create an endpoint for remote application log directory path query

2020-06-16 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17136542#comment-17136542 ] Adam Antal commented on YARN-10304: --- Let me start with the bad news. I am very sorry that I come up

[jira] [Commented] (YARN-9930) Support max running app logic for CapacityScheduler

2020-06-16 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17136503#comment-17136503 ] Adam Antal commented on YARN-9930: -- I was trying to make a meaningful review, but stuck on a few

[jira] [Commented] (YARN-10304) Create an endpoint for remote application log directory path query

2020-06-10 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17130651#comment-17130651 ] Adam Antal commented on YARN-10304: --- Thanks for the updated patch [~gandras]! Looks much better. Some

[jira] [Commented] (YARN-10166) Add detail log for ApplicationAttemptNotFoundException

2020-06-05 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17126768#comment-17126768 ] Adam Antal commented on YARN-10166: --- I think the ERROR level is right: it is definitely an error when

[jira] [Commented] (YARN-9883) Reshape SchedulerHealth class

2020-06-05 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17126764#comment-17126764 ] Adam Antal commented on YARN-9883: -- Hi [~BilwaST], Sorry for the delay. I totally agree, thanks for

[jira] [Commented] (YARN-9930) Support max running app logic for CapacityScheduler

2020-06-05 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17126751#comment-17126751 ] Adam Antal commented on YARN-9930: -- Thanks for the POC [~pbacsko]. Conceptually it looks good. Some

[jira] [Commented] (YARN-10304) Create an endpoint for remote application log directory path query

2020-06-05 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17126732#comment-17126732 ] Adam Antal commented on YARN-10304: --- Thanks for the draft [~gandras]! If we want a general remote log

[jira] [Commented] (YARN-10295) CapacityScheduler NPE can cause apps to get stuck without resources

2020-06-05 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17126545#comment-17126545 ] Adam Antal commented on YARN-10295: --- Thanks for the explanation [~bteke]. LGTM (non-binding). >

[jira] [Commented] (YARN-10295) CapacityScheduler NPE can cause apps to get stuck without resources

2020-06-04 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17125968#comment-17125968 ] Adam Antal commented on YARN-10295: --- Thanks for the investigation [~bteke]. I have a question here:

[jira] [Commented] (YARN-10279) Avoid unnecessary QueueMappingEntity creations

2020-06-04 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17125861#comment-17125861 ] Adam Antal commented on YARN-10279: --- Thanks! > Avoid unnecessary QueueMappingEntity creations >

[jira] [Assigned] (YARN-10279) Avoid unnecessary QueueMappingEntity creations

2020-06-04 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal reassigned YARN-10279: - Assignee: Hudáky Márton Gyula (was: Bilwa S T) > Avoid unnecessary QueueMappingEntity

[jira] [Commented] (YARN-10279) Avoid unnecessary QueueMappingEntity creations

2020-06-03 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17124876#comment-17124876 ] Adam Antal commented on YARN-10279: --- Hi [~BilwaST], Do you plan to work on this on the near future? >

[jira] [Commented] (YARN-10303) One yarn rest api example of yarn document is error

2020-06-03 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17124870#comment-17124870 ] Adam Antal commented on YARN-10303: --- Thanks for raising this [~zgw]! I see that it is currently

[jira] [Updated] (YARN-10303) One yarn rest api example of yarn document is error

2020-06-03 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-10303: -- Labels: documentation newbie (was: ) > One yarn rest api example of yarn document is error >

[jira] [Assigned] (YARN-10303) One yarn rest api example of yarn document is error

2020-06-03 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal reassigned YARN-10303: - Assignee: Hudáky Márton Gyula > One yarn rest api example of yarn document is error >

[jira] [Updated] (YARN-10306) Create simple copy log aggregation file controller

2020-06-03 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-10306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-10306: -- Summary: Create simple copy log aggregation file controller (was: Create copy log aggregation file

  1   2   3   4   5   6   7   8   9   >