[jira] [Updated] (YARN-9561) Add C changes for the new RuncContainerRuntime

2019-11-08 Thread Eric Badger (Jira)
[ https://issues.apache.org/jira/browse/YARN-9561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Badger updated YARN-9561: -- Attachment: YARN-9561.011.patch > Add C changes for the new RuncContainerRuntime >

[jira] [Commented] (YARN-9561) Add C changes for the new RuncContainerRuntime

2019-11-08 Thread Eric Badger (Jira)
[ https://issues.apache.org/jira/browse/YARN-9561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970535#comment-16970535 ] Eric Badger commented on YARN-9561: --- Thanks for the review, [~Jim_Brennan]! bq. stat_file_as_nm should

[jira] [Commented] (YARN-9923) Detect missing Docker binary or not running Docker daemon

2019-11-08 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-9923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970541#comment-16970541 ] Hadoop QA commented on YARN-9923: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9920) YarnAuthorizationProvider AccessRequest gets Null RemoteAddress from FairScheduler

2019-11-08 Thread Wilfred Spiegelenburg (Jira)
[ https://issues.apache.org/jira/browse/YARN-9920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970376#comment-16970376 ] Wilfred Spiegelenburg commented on YARN-9920: - I am not sure what you are after with this

[jira] [Commented] (YARN-8373) RM Received RMFatalEvent of type CRITICAL_THREAD_CRASH

2019-11-08 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-8373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970388#comment-16970388 ] Hadoop QA commented on YARN-8373: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-9952) Continuous scheduling thread crashes

2019-11-08 Thread Szilard Nemeth (Jira)
[ https://issues.apache.org/jira/browse/YARN-9952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Szilard Nemeth updated YARN-9952: - Summary: Continuous scheduling thread crashes (was: ontinuous scheduling thread crashes) >

[jira] [Commented] (YARN-9930) Support max running app logic for CapacityScheduler

2019-11-08 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YARN-9930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970421#comment-16970421 ] Peter Bacsko commented on YARN-9930: [~epayne] thanks for explaining - now it's clear. I do agree that

[jira] [Commented] (YARN-9564) Create docker-to-squash tool for image conversion

2019-11-08 Thread Jim Brennan (Jira)
[ https://issues.apache.org/jira/browse/YARN-9564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970382#comment-16970382 ] Jim Brennan commented on YARN-9564: --- Thanks for the updates [~ebadger]!  I am +1 (non-binding) on patch

[jira] [Commented] (YARN-8992) Fair scheduler can delete a dynamic queue while an application attempt is being added to the queue

2019-11-08 Thread Wilfred Spiegelenburg (Jira)
[ https://issues.apache.org/jira/browse/YARN-8992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970394#comment-16970394 ] Wilfred Spiegelenburg commented on YARN-8992: - For YARN-8992 the fix version is set

[jira] [Commented] (YARN-9930) Support max running app logic for CapacityScheduler

2019-11-08 Thread Eric Payne (Jira)
[ https://issues.apache.org/jira/browse/YARN-9930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970417#comment-16970417 ] Eric Payne commented on YARN-9930: -- The Max Apps Per User Setting exists in CS, but it's a calculated

[jira] [Commented] (YARN-8373) RM Received RMFatalEvent of type CRITICAL_THREAD_CRASH

2019-11-08 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-8373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970386#comment-16970386 ] Hadoop QA commented on YARN-8373: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9562) Add Java changes for the new RuncContainerRuntime

2019-11-08 Thread Jim Brennan (Jira)
[ https://issues.apache.org/jira/browse/YARN-9562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970381#comment-16970381 ] Jim Brennan commented on YARN-9562: --- Thanks for the updates [~ebadger]!  I am +1 (non-binding) on patch

[jira] [Commented] (YARN-9561) Add C changes for the new RuncContainerRuntime

2019-11-08 Thread Jim Brennan (Jira)
[ https://issues.apache.org/jira/browse/YARN-9561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970380#comment-16970380 ] Jim Brennan commented on YARN-9561: --- Thanks for the updates [~ebadger]! A couple comments on the new

[jira] [Commented] (YARN-8990) Fix fair scheduler race condition in app submit and queue cleanup

2019-11-08 Thread Wilfred Spiegelenburg (Jira)
[ https://issues.apache.org/jira/browse/YARN-8990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970395#comment-16970395 ] Wilfred Spiegelenburg commented on YARN-8990: - Thank you [~Steven Rand] for making us aware of

[jira] [Commented] (YARN-9561) Add C changes for the new RuncContainerRuntime

2019-11-08 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-9561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970666#comment-16970666 ] Hadoop QA commented on YARN-9561: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9561) Add C changes for the new RuncContainerRuntime

2019-11-08 Thread Eric Yang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970646#comment-16970646 ] Eric Yang commented on YARN-9561: - [~ebadger] What is the right way to run test_runc_util with patch 11?

[jira] [Commented] (YARN-9965) Fix NodeManager failing to start when Hdfs Auxillary Jar is set

2019-11-08 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-9965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970341#comment-16970341 ] Hadoop QA commented on YARN-9965: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9940) avoid continuous scheduling thread crashes while sorting nodes get 'Comparison method violates its general contract'

2019-11-08 Thread kailiu_dev (Jira)
[ https://issues.apache.org/jira/browse/YARN-9940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970342#comment-16970342 ] kailiu_dev commented on YARN-9940: -- [~wilfreds] , thank you. Yes, in our code now,   is protected void

[jira] [Commented] (YARN-9537) Add configuration to disable AM preemption

2019-11-08 Thread zhoukang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16969933#comment-16969933 ] zhoukang commented on YARN-9537: new patch added [~yufeigu] > Add configuration to disable AM preemption

[jira] [Updated] (YARN-9537) Add configuration to disable AM preemption

2019-11-08 Thread zhoukang (Jira)
[ https://issues.apache.org/jira/browse/YARN-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhoukang updated YARN-9537: --- Attachment: YARN-9537.004.patch > Add configuration to disable AM preemption >

[jira] [Comment Edited] (YARN-9957) The first container we recover may not be the AM

2019-11-08 Thread Xianghao Lu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16969940#comment-16969940 ] Xianghao Lu edited comment on YARN-9957 at 11/8/19 8:30 AM: IMO, the root

[jira] [Comment Edited] (YARN-9957) The first container we recover may not be the AM

2019-11-08 Thread Xianghao Lu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16969940#comment-16969940 ] Xianghao Lu edited comment on YARN-9957 at 11/8/19 8:29 AM: IMO, the root

[jira] [Updated] (YARN-9886) Queue mapping based on userid passed through application tag

2019-11-08 Thread Kinga Marton (Jira)
[ https://issues.apache.org/jira/browse/YARN-9886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kinga Marton updated YARN-9886: --- Attachment: YARN-9886.003.patch > Queue mapping based on userid passed through application tag >

[jira] [Created] (YARN-9965) ClassNotFoundException when auxiliary service is loaded from HDFS on subsequent restart of NM

2019-11-08 Thread Prabhu Joseph (Jira)
Prabhu Joseph created YARN-9965: --- Summary: ClassNotFoundException when auxiliary service is loaded from HDFS on subsequent restart of NM Key: YARN-9965 URL: https://issues.apache.org/jira/browse/YARN-9965

[jira] [Updated] (YARN-9965) ClassNotFoundException when auxiliary service is loaded from HDFS on subsequent restart of NM

2019-11-08 Thread Prabhu Joseph (Jira)
[ https://issues.apache.org/jira/browse/YARN-9965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prabhu Joseph updated YARN-9965: Component/s: nodemanager auxservices > ClassNotFoundException when auxiliary

[jira] [Updated] (YARN-9965) ClassNotFoundException when auxiliary service is loaded from HDFS on subsequent restart of NM

2019-11-08 Thread Prabhu Joseph (Jira)
[ https://issues.apache.org/jira/browse/YARN-9965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prabhu Joseph updated YARN-9965: Affects Version/s: 3.2.0 > ClassNotFoundException when auxiliary service is loaded from HDFS on >

[jira] [Updated] (YARN-9965) ClassNotFoundException when auxiliary service is loaded from HDFS on subsequent restart of NM

2019-11-08 Thread Prabhu Joseph (Jira)
[ https://issues.apache.org/jira/browse/YARN-9965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prabhu Joseph updated YARN-9965: Description: Loading an auxiliary jar from a Hdfs location on a node manager works as expected on

[jira] [Updated] (YARN-9965) ClassNotFoundException when auxiliary service is loaded from HDFS on subsequent restart of NM

2019-11-08 Thread Prabhu Joseph (Jira)
[ https://issues.apache.org/jira/browse/YARN-9965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prabhu Joseph updated YARN-9965: Description: Loading an auxiliary jar from a Hdfs location on a node manager works as expected on

[jira] [Commented] (YARN-9865) Capacity scheduler: add support for combined %user + %secondary_group mapping

2019-11-08 Thread Szilard Nemeth (Jira)
[ https://issues.apache.org/jira/browse/YARN-9865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16969938#comment-16969938 ] Szilard Nemeth commented on YARN-9865: -- Hi [~maniraj...@gmail.com]! Can you ask someone else please? 

[jira] [Updated] (YARN-9957) The first container we recover may not be the AM

2019-11-08 Thread Xianghao Lu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianghao Lu updated YARN-9957: -- Attachment: YARN-9957-branch-2.9.1.002.patch > The first container we recover may not be the AM >

[jira] [Commented] (YARN-9537) Add configuration to disable AM preemption

2019-11-08 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970053#comment-16970053 ] Hadoop QA commented on YARN-9537: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9957) The first container we recover may not be the AM

2019-11-08 Thread Xianghao Lu (Jira)
[ https://issues.apache.org/jira/browse/YARN-9957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16969940#comment-16969940 ] Xianghao Lu commented on YARN-9957: --- IMO, the root cause of the following case in YARN-7382 is

[jira] [Commented] (YARN-9948) Remove attempts that are beyond max-attempt limit from RMAppImpl

2019-11-08 Thread Hu Ziqian (Jira)
[ https://issues.apache.org/jira/browse/YARN-9948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970038#comment-16970038 ] Hu Ziqian commented on YARN-9948: - Hi [~hex108], in our cluster, one attempt will use more than 1k memory

[jira] [Resolved] (YARN-9861) The ResourceManager log reports an error "Too many open files", the analysis is related to the service

2019-11-08 Thread huiyangjian (Jira)
[ https://issues.apache.org/jira/browse/YARN-9861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] huiyangjian resolved YARN-9861. --- Resolution: Fixed https://issues.apache.org/jira/browse/YARN-9837 The stream is not closed,The load

[jira] [Commented] (YARN-9957) The first container we recover may not be the AM

2019-11-08 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-9957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16969942#comment-16969942 ] Hadoop QA commented on YARN-9957: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9886) Queue mapping based on userid passed through application tag

2019-11-08 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-9886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970269#comment-16970269 ] Hadoop QA commented on YARN-9886: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-9923) Detect missing Docker binary or not running Docker daemon

2019-11-08 Thread Adam Antal (Jira)
[ https://issues.apache.org/jira/browse/YARN-9923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Antal updated YARN-9923: - Attachment: YARN-9923.002.patch > Detect missing Docker binary or not running Docker daemon >

[jira] [Updated] (YARN-9965) Fix NodeManager failing to start when Hdfs Auxillary Jar is set

2019-11-08 Thread Prabhu Joseph (Jira)
[ https://issues.apache.org/jira/browse/YARN-9965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prabhu Joseph updated YARN-9965: Summary: Fix NodeManager failing to start when Hdfs Auxillary Jar is set (was:

[jira] [Commented] (YARN-8373) RM Received RMFatalEvent of type CRITICAL_THREAD_CRASH

2019-11-08 Thread Wilfred Spiegelenburg (Jira)
[ https://issues.apache.org/jira/browse/YARN-8373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970224#comment-16970224 ] Wilfred Spiegelenburg commented on YARN-8373: - Your link points to code in master not in

[jira] [Updated] (YARN-8373) RM Received RMFatalEvent of type CRITICAL_THREAD_CRASH

2019-11-08 Thread Wilfred Spiegelenburg (Jira)
[ https://issues.apache.org/jira/browse/YARN-8373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wilfred Spiegelenburg updated YARN-8373: Attachment: YARN-8373.002.patch > RM Received RMFatalEvent of type

[jira] [Updated] (YARN-9965) Fix NodeManager failing to start when Hdfs Auxillary Jar is set

2019-11-08 Thread Prabhu Joseph (Jira)
[ https://issues.apache.org/jira/browse/YARN-9965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prabhu Joseph updated YARN-9965: Attachment: YARN-9965-001.patch > Fix NodeManager failing to start when Hdfs Auxillary Jar is set >

[jira] [Commented] (YARN-8373) RM Received RMFatalEvent of type CRITICAL_THREAD_CRASH

2019-11-08 Thread Wilfred Spiegelenburg (Jira)
[ https://issues.apache.org/jira/browse/YARN-8373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970241#comment-16970241 ] Wilfred Spiegelenburg commented on YARN-8373: - patch-003 to fix the imports the IDE had

[jira] [Updated] (YARN-8373) RM Received RMFatalEvent of type CRITICAL_THREAD_CRASH

2019-11-08 Thread Wilfred Spiegelenburg (Jira)
[ https://issues.apache.org/jira/browse/YARN-8373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wilfred Spiegelenburg updated YARN-8373: Attachment: YARN-8373.003.patch > RM Received RMFatalEvent of type

[jira] [Resolved] (YARN-9952) ontinuous scheduling thread crashes

2019-11-08 Thread Wilfred Spiegelenburg (Jira)
[ https://issues.apache.org/jira/browse/YARN-9952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wilfred Spiegelenburg resolved YARN-9952. - Resolution: Duplicate > ontinuous scheduling thread crashes >

[jira] [Commented] (YARN-9940) avoid continuous scheduling thread crashes while sorting nodes get 'Comparison method violates its general contract'

2019-11-08 Thread Wilfred Spiegelenburg (Jira)
[ https://issues.apache.org/jira/browse/YARN-9940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16970251#comment-16970251 ] Wilfred Spiegelenburg commented on YARN-9940: - The changes you are making are also not helpful