[jira] [Updated] (YARN-8718) Merge related work for YARN-3409

2018-08-28 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sunil Govindan updated YARN-8718: - Attachment: YARN-8718.004.patch > Merge related work for YARN-3409 >

[jira] [Updated] (YARN-8718) Merge related work for YARN-3409

2018-08-28 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sunil Govindan updated YARN-8718: - Attachment: (was: YARN-3409.004.patch) > Merge related work for YARN-3409 >

[jira] [Commented] (YARN-8718) Merge related work for YARN-3409

2018-08-28 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595953#comment-16595953 ] Sunil Govindan commented on YARN-8718: -- Thanks [~leftnoteasy]. Yes, i took from a wrong commit id

[jira] [Updated] (YARN-8718) Merge related work for YARN-3409

2018-08-28 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sunil Govindan updated YARN-8718: - Attachment: YARN-3409.004.patch > Merge related work for YARN-3409 >

[jira] [Commented] (YARN-8720) CapacityScheduler does not enforce yarn.scheduler.capacity..maximum-allocation-mb/vcores when configured

2018-08-28 Thread Tarun Parimi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595951#comment-16595951 ] Tarun Parimi commented on YARN-8720: [~sunilg] Can you help review this? Thanks > CapacityScheduler

[jira] [Commented] (YARN-8569) Create an interface to provide cluster information to application

2018-08-28 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595949#comment-16595949 ] Wangda Tan commented on YARN-8569: -- [~eyang], As we discussed offline, the use case is not clear to me.

[jira] [Updated] (YARN-8535) Fix DistributedShell unit tests

2018-08-28 Thread Abhishek Modi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Abhishek Modi updated YARN-8535: Attachment: YARN-8535.002.patch > Fix DistributedShell unit tests > ---

[jira] [Commented] (YARN-8535) Fix DistributedShell unit tests

2018-08-28 Thread Abhishek Modi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595947#comment-16595947 ] Abhishek Modi commented on YARN-8535: - Added v2 patch with minor changes in TestDistributedShell to

[jira] [Commented] (YARN-8718) Merge related work for YARN-3409

2018-08-28 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595945#comment-16595945 ] Wangda Tan commented on YARN-8718: -- [~sunilg], the attached patch doesn't look correct. > Merge related

[jira] [Commented] (YARN-8220) Running Tensorflow on YARN with GPU and Docker - Examples

2018-08-28 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595944#comment-16595944 ] Wangda Tan commented on YARN-8220: -- Thanks [~sunilg], I think we should close this JIRA. > Running

[jira] [Commented] (YARN-8220) Running Tensorflow on YARN with GPU and Docker - Examples

2018-08-28 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595942#comment-16595942 ] Sunil Govindan commented on YARN-8220: -- hi [~leftnoteasy] As Submarine is in, i think this work is

[jira] [Commented] (YARN-8680) YARN NM: Implement Iterable Abstraction for LocalResourceTrackerstate

2018-08-28 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595939#comment-16595939 ] Sunil Govindan commented on YARN-8680: -- Hi [~pradeepambati] As this jira is marked for 3.2 as a

[jira] [Commented] (YARN-8657) User limit calculation should be read-lock-protected within LeafQueue

2018-08-28 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595937#comment-16595937 ] Sunil Govindan commented on YARN-8657: -- I think there are no changes in ordering. Only annotation was

[jira] [Commented] (YARN-7505) RM REST endpoints generate malformed JSON

2018-08-28 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595928#comment-16595928 ] Sunil Govindan commented on YARN-7505: -- [~templedf] As this jira is marked for 3.2 as a critical,

[jira] [Commented] (YARN-8340) Capacity Scheduler Intra Queue Preemption Should Work When 3rd or more resources enabled.

2018-08-28 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595923#comment-16595923 ] Sunil Govindan commented on YARN-8340: -- [~leftnoteasy] [~Zian Chen] [~eepayne] As this jira is

[jira] [Commented] (YARN-8286) Add NMClient callback on container relaunch

2018-08-28 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595920#comment-16595920 ] Sunil Govindan commented on YARN-8286: -- Ping [~billie.rinaldi] As this jira is marked for 3.2 as a

[jira] [Commented] (YARN-8709) intra-queue preemption checker always fail since one under-served queue was deleted

2018-08-28 Thread Tao Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595896#comment-16595896 ] Tao Yang commented on YARN-8709: Thanks [~eepayne], [~sunilg] for your suggestion! It makes sense to me, I

[jira] [Commented] (YARN-8709) intra-queue preemption checker always fail since one under-served queue was deleted

2018-08-28 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595890#comment-16595890 ] Sunil Govindan commented on YARN-8709: -- Thanks [~Tao Yang] for raising this issue. Yes, I agree with

[jira] [Commented] (YARN-8723) Fix a typo in CS init error message when resource calculator is not correctly set

2018-08-28 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595878#comment-16595878 ] Weiwei Yang commented on YARN-8723: --- Just pushed this to trunk and branch-3.1, thanks for getting this

[jira] [Updated] (YARN-8718) Merge related work for YARN-3409

2018-08-28 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sunil Govindan updated YARN-8718: - Attachment: YARN-8718.003.patch > Merge related work for YARN-3409 >

[jira] [Updated] (YARN-8717) set memory.limit_in_bytes when NodeManager starting

2018-08-28 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiwei Yang updated YARN-8717: -- Labels: cgroups (was: ) > set memory.limit_in_bytes when NodeManager starting >

[jira] [Updated] (YARN-8723) Fix a typo in CS init error message when resource calculator is not correctly set

2018-08-28 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiwei Yang updated YARN-8723: -- Summary: Fix a typo in CS init error message when resource calculator is not correctly set (was: Typo

[jira] [Commented] (YARN-8723) Typo in CS init error message when resource calculator is not correctly set

2018-08-28 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595867#comment-16595867 ] Weiwei Yang commented on YARN-8723: --- +1, committing now. > Typo in CS init error message when resource

[jira] [Commented] (YARN-8722) Failed to get native service application status when security is enabled

2018-08-28 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595853#comment-16595853 ] Eric Yang commented on YARN-8722: - [~yuan_zac] {quote}In the current implementation, we need to add the

[jira] [Commented] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595851#comment-16595851 ] Eric Yang commented on YARN-8706: - {quote}But then we have redundant configs for no reason. And we would

[jira] [Commented] (YARN-8722) Failed to get native service application status when security is enabled

2018-08-28 Thread Zac Zhou (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595840#comment-16595840 ] Zac Zhou commented on YARN-8722: [~leftnoteasy] Yeah, without the proxy settings,we can't submit the job

[jira] [Updated] (YARN-8664) ApplicationMasterProtocolPBServiceImpl#allocate throw NPE when NM losting

2018-08-28 Thread Jiandan Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jiandan Yang updated YARN-8664: Attachment: YARN-8664-branch-2.8.002.pathch > ApplicationMasterProtocolPBServiceImpl#allocate throw

[jira] [Commented] (YARN-7865) Node attributes documentation

2018-08-28 Thread Naganarasimha G R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595792#comment-16595792 ] Naganarasimha G R commented on YARN-7865: - [~bibinchundatt], Have attached a patch without all

[jira] [Updated] (YARN-7865) Node attributes documentation

2018-08-28 Thread Naganarasimha G R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Naganarasimha G R updated YARN-7865: Attachment: YARN-7865-YARN-3409.003.patch > Node attributes documentation >

[jira] [Commented] (YARN-8569) Create an interface to provide cluster information to application

2018-08-28 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595777#comment-16595777 ] Eric Yang commented on YARN-8569: - Patch 2 is a fully working implementation. When YARN service become

[jira] [Updated] (YARN-8569) Create an interface to provide cluster information to application

2018-08-28 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Yang updated YARN-8569: Attachment: YARN-8569.002.patch > Create an interface to provide cluster information to application >

[jira] [Commented] (YARN-8658) Metrics for AMRMClientRelayer inside FederationInterceptor

2018-08-28 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595757#comment-16595757 ] genericqa commented on YARN-8658: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-8648) Container cgroups are leaked when using docker

2018-08-28 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595740#comment-16595740 ] genericqa commented on YARN-8648: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-8697) LocalityMulticastAMRMProxyPolicy should fallback to random sub-cluster when cannot resolve resource

2018-08-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595725#comment-16595725 ] Hudson commented on YARN-8697: -- SUCCESS: Integrated in Jenkins build Hadoop-trunk-Commit #14854 (See

[jira] [Commented] (YARN-8468) Limit container sizes per queue in FairScheduler

2018-08-28 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595712#comment-16595712 ] Wangda Tan commented on YARN-8468: -- 1) Is it sufficient to make changes like YARN-1582, IIUC, it doesn't

[jira] [Commented] (YARN-8697) LocalityMulticastAMRMProxyPolicy should fallback to random sub-cluster when cannot resolve resource

2018-08-28 Thread Giovanni Matteo Fumarola (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595709#comment-16595709 ] Giovanni Matteo Fumarola commented on YARN-8697: Done. > LocalityMulticastAMRMProxyPolicy

[jira] [Commented] (YARN-8697) LocalityMulticastAMRMProxyPolicy should fallback to random sub-cluster when cannot resolve resource

2018-08-28 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595705#comment-16595705 ] Botong Huang commented on YARN-8697: Thanks [~giovanni.fumarola]! Can you please cherry-pick to

[jira] [Updated] (YARN-8697) LocalityMulticastAMRMProxyPolicy should fallback to random sub-cluster when cannot resolve resource

2018-08-28 Thread Giovanni Matteo Fumarola (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Giovanni Matteo Fumarola updated YARN-8697: --- Fix Version/s: 3.2.0 > LocalityMulticastAMRMProxyPolicy should fallback to

[jira] [Commented] (YARN-8697) LocalityMulticastAMRMProxyPolicy should fallback to random sub-cluster when cannot resolve resource

2018-08-28 Thread Giovanni Matteo Fumarola (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595703#comment-16595703 ] Giovanni Matteo Fumarola commented on YARN-8697: Thanks [~botong] . The change is

[jira] [Comment Edited] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Chandni Singh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595603#comment-16595603 ] Chandni Singh edited comment on YARN-8706 at 8/28/18 10:24 PM: --- {quote}

[jira] [Updated] (YARN-8648) Container cgroups are leaked when using docker

2018-08-28 Thread Jim Brennan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Brennan updated YARN-8648: -- Attachment: YARN-8648.001.patch > Container cgroups are leaked when using docker >

[jira] [Updated] (YARN-8658) Metrics for AMRMClientRelayer inside FederationInterceptor

2018-08-28 Thread Young Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Young Chen updated YARN-8658: - Attachment: YARN-8658.01.patch > Metrics for AMRMClientRelayer inside FederationInterceptor >

[jira] [Commented] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Eric Badger (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595623#comment-16595623 ] Eric Badger commented on YARN-8706: --- bq. If this is setup properly, code only needs to ensure

[jira] [Commented] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595612#comment-16595612 ] Eric Yang commented on YARN-8706: - [~csingh] We can arrange it as NM_SLEEP_DELAY_BEFORE_SIGKILL_MS to be

[jira] [Comment Edited] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Chandni Singh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595603#comment-16595603 ] Chandni Singh edited comment on YARN-8706 at 8/28/18 9:15 PM: -- {quote} Docker

[jira] [Commented] (YARN-8509) Total pending resource calculation in preemption should use user-limit factor instead of minimum-user-limit-percent

2018-08-28 Thread Eric Payne (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595610#comment-16595610 ] Eric Payne commented on YARN-8509: -- bq. I'll comment on the updates once I have some progress. [~Zian

[jira] [Comment Edited] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Chandni Singh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595603#comment-16595603 ] Chandni Singh edited comment on YARN-8706 at 8/28/18 9:08 PM: -- {quote} Docker

[jira] [Commented] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Chandni Singh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595603#comment-16595603 ] Chandni Singh commented on YARN-8706: - {quote} Docker stop already covers sending the custom signal,

[jira] [Commented] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595585#comment-16595585 ] Eric Yang commented on YARN-8706: - [~ebadger] suggested solution of discover STOPSIGNAL and perform this

[jira] [Commented] (YARN-7018) Interface for adding extra behavior to node heartbeats

2018-08-28 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595575#comment-16595575 ] Wangda Tan commented on YARN-7018: -- [~jlowe], given the fields need to be updated should all inside

[jira] [Commented] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Shane Kumpf (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595562#comment-16595562 ] Shane Kumpf commented on YARN-8706: --- Seems like a reasonable solution to me. {{docker stop}} has been a

[jira] [Commented] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Eric Badger (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595535#comment-16595535 ] Eric Badger commented on YARN-8706: --- bq. I can work on it, if there aren't any concerns? Sounds good to

[jira] [Commented] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Chandni Singh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595515#comment-16595515 ] Chandni Singh commented on YARN-8706: - Thanks [~shaneku...@gmail.com] and [~ebadger] for the

[jira] [Commented] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Shane Kumpf (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595507#comment-16595507 ] Shane Kumpf commented on YARN-8706: --- Thanks for reporting this, [~csingh]. I know several of us

[jira] [Assigned] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Shane Kumpf (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shane Kumpf reassigned YARN-8706: - Assignee: Shane Kumpf (was: Chandni Singh) > DelayedProcessKiller is executed for Docker

[jira] [Assigned] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Shane Kumpf (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shane Kumpf reassigned YARN-8706: - Assignee: Chandni Singh (was: Shane Kumpf) > DelayedProcessKiller is executed for Docker

[jira] [Commented] (YARN-8696) FederationInterceptor upgrade: home sub-cluster heartbeat async

2018-08-28 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595492#comment-16595492 ] genericqa commented on YARN-8696: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-8623) Update Docker examples to use image which exists

2018-08-28 Thread Shane Kumpf (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595488#comment-16595488 ] Shane Kumpf commented on YARN-8623: --- Thanks [~elek]. Craig and I had a quick offline chat. The public

[jira] [Comment Edited] (YARN-8642) Add support for tmpfs mounts with the Docker runtime

2018-08-28 Thread Craig Condit (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595336#comment-16595336 ] Craig Condit edited comment on YARN-8642 at 8/28/18 7:24 PM: - [~ebadger], the

[jira] [Commented] (YARN-8642) Add support for tmpfs mounts with the Docker runtime

2018-08-28 Thread Shane Kumpf (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595446#comment-16595446 ] Shane Kumpf commented on YARN-8642: --- Thanks for the patch, [~ccondit-target]! With this patch (+ prior

[jira] [Commented] (YARN-8722) Failed to get native service application status when security is enabled

2018-08-28 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595440#comment-16595440 ] Eric Yang commented on YARN-8722: - YARN implementation is built as a thick client and thick server.  The

[jira] [Comment Edited] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Chandni Singh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595410#comment-16595410 ] Chandni Singh edited comment on YARN-8706 at 8/28/18 6:48 PM: -- {quote} Really

[jira] [Commented] (YARN-8535) Fix DistributedShell unit tests

2018-08-28 Thread Bibin A Chundatt (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595425#comment-16595425 ] Bibin A Chundatt commented on YARN-8535: Jenkins didn't trigger required testclass . Could you

[jira] [Updated] (YARN-8535) Fix DistributedShell unit tests

2018-08-28 Thread Bibin A Chundatt (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bibin A Chundatt updated YARN-8535: --- Summary: Fix DistributedShell unit tests (was: DistributedShell unit tests are failing) >

[jira] [Commented] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Chandni Singh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595410#comment-16595410 ] Chandni Singh commented on YARN-8706: - {quote} Really it would be better if we didn't send the kill

[jira] [Commented] (YARN-8488) YARN service/components/instances should have SUCCEEDED/FAILED states

2018-08-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595403#comment-16595403 ] Hudson commented on YARN-8488: -- SUCCESS: Integrated in Jenkins build Hadoop-trunk-Commit #14848 (See

[jira] [Comment Edited] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Chandni Singh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16594348#comment-16594348 ] Chandni Singh edited comment on YARN-8706 at 8/28/18 6:15 PM: -- I can see 2

[jira] [Comment Edited] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Chandni Singh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16594348#comment-16594348 ] Chandni Singh edited comment on YARN-8706 at 8/28/18 6:13 PM: -- I can see 2

[jira] [Commented] (YARN-8468) Limit container sizes per queue in FairScheduler

2018-08-28 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595380#comment-16595380 ] genericqa commented on YARN-8468: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-7018) Interface for adding extra behavior to node heartbeats

2018-08-28 Thread Manikandan R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manikandan R updated YARN-7018: --- Attachment: YARN-7018.POC.002.patch > Interface for adding extra behavior to node heartbeats >

[jira] [Commented] (YARN-7018) Interface for adding extra behavior to node heartbeats

2018-08-28 Thread Manikandan R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595356#comment-16595356 ] Manikandan R commented on YARN-7018: [~jlowe] Thanks for detailed comments. {quote}That's why it may

[jira] [Commented] (YARN-8642) Add support for tmpfs mounts with the Docker runtime

2018-08-28 Thread Eric Badger (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595349#comment-16595349 ] Eric Badger commented on YARN-8642: --- Ahhh that's right, we invoke using exec now. I had forgotten about

[jira] [Issue Comment Deleted] (YARN-7018) Interface for adding extra behavior to node heartbeats

2018-08-28 Thread Manikandan R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manikandan R updated YARN-7018: --- Comment: was deleted (was: [~jlowe] Thanks for detailed comments. {quote}That's why it may be

[jira] [Commented] (YARN-7018) Interface for adding extra behavior to node heartbeats

2018-08-28 Thread Manikandan R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595347#comment-16595347 ] Manikandan R commented on YARN-7018: [~jlowe] Thanks for detailed comments. {quote}That's why it may

[jira] [Commented] (YARN-8722) Failed to get native service application status when security is enabled

2018-08-28 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595340#comment-16595340 ] Wangda Tan commented on YARN-8722: -- Thanks [~eyang], [~yuan_zac], are you able to *submit* job without

[jira] [Commented] (YARN-8642) Add support for tmpfs mounts with the Docker runtime

2018-08-28 Thread Craig Condit (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595336#comment-16595336 ] Craig Condit commented on YARN-8642: [~ebadger], the regex is based on the one used for normal mounts.

[jira] [Commented] (YARN-8722) Failed to get native service application status when security is enabled

2018-08-28 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595310#comment-16595310 ] Eric Yang commented on YARN-8722: - This looks like a misconfiguration on the tested cluster.  YARN user is

[jira] [Commented] (YARN-8709) intra-queue preemption checker always fail since one under-served queue was deleted

2018-08-28 Thread Eric Payne (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595277#comment-16595277 ] Eric Payne commented on YARN-8709: -- [~Tao Yang], thanks for raising this issue and suggesting a fix:

[jira] [Commented] (YARN-8488) YARN service/components/instances should have SUCCEEDED/FAILED states

2018-08-28 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595276#comment-16595276 ] Eric Yang commented on YARN-8488: - +1 looks good to me.  Committing shortly. > YARN

[jira] [Updated] (YARN-8696) FederationInterceptor upgrade: home sub-cluster heartbeat async

2018-08-28 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Botong Huang updated YARN-8696: --- Attachment: YARN-8696.v4.patch > FederationInterceptor upgrade: home sub-cluster heartbeat async >

[jira] [Commented] (YARN-7865) Node attributes documentation

2018-08-28 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595186#comment-16595186 ] genericqa commented on YARN-7865: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-8722) Failed to get native service application status when security is enabled

2018-08-28 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595185#comment-16595185 ] Wangda Tan commented on YARN-8722: -- [~eyang], [~billie.rinaldi], have we seen this issue when trying to

[jira] [Commented] (YARN-8699) Add Yarnclient#yarnclusterMetrics API implementation in router

2018-08-28 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595184#comment-16595184 ] genericqa commented on YARN-8699: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-8722) Failed to get native service application status when security is enabled

2018-08-28 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wangda Tan updated YARN-8722: - Environment: (was: The environment context is as follows: 1) Security enabled. kerberos 2) Klist

[jira] [Updated] (YARN-8722) Failed to get native service application status when security is enabled

2018-08-28 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wangda Tan updated YARN-8722: - Description: Can't get job status with the following command, after a submarine job is submitted.

[jira] [Commented] (YARN-8706) DelayedProcessKiller is executed for Docker containers even though docker stop sends a KILL signal after the specified grace period

2018-08-28 Thread Eric Badger (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595170#comment-16595170 ] Eric Badger commented on YARN-8706: --- Really it would be better if we didn't send the kill from docker

[jira] [Updated] (YARN-8468) Limit container sizes per queue in FairScheduler

2018-08-28 Thread JIRA
[ https://issues.apache.org/jira/browse/YARN-8468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antal Bálint Steinbach updated YARN-8468: - Attachment: YARN-8468.007.patch > Limit container sizes per queue in

[jira] [Commented] (YARN-7865) Node attributes documentation

2018-08-28 Thread Naganarasimha G R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595119#comment-16595119 ] Naganarasimha G R commented on YARN-7865: - Thanks [~bibinchundatt] for the review comments and

[jira] [Updated] (YARN-7865) Node attributes documentation

2018-08-28 Thread Naganarasimha G R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Naganarasimha G R updated YARN-7865: Attachment: YARN-7865-YARN-3409.002.patch > Node attributes documentation >

[jira] [Updated] (YARN-7865) Node attributes documentation

2018-08-28 Thread Naganarasimha G R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Naganarasimha G R updated YARN-7865: Attachment: NodeAttributes.html > Node attributes documentation >

[jira] [Updated] (YARN-7865) Node attributes documentation

2018-08-28 Thread Naganarasimha G R (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Naganarasimha G R updated YARN-7865: Attachment: (was: NodeAttributes.html) > Node attributes documentation >

[jira] [Commented] (YARN-8535) DistributedShell unit tests are failing

2018-08-28 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595095#comment-16595095 ] genericqa commented on YARN-8535: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-8535) DistributedShell unit tests are failing

2018-08-28 Thread Bibin A Chundatt (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595012#comment-16595012 ] Bibin A Chundatt commented on YARN-8535: [~abmodi] Thank you for analysis . As per analysis make

[jira] [Commented] (YARN-8699) Add Yarnclient#yarnclusterMetrics API implementation in router

2018-08-28 Thread Bibin A Chundatt (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16595005#comment-16595005 ] Bibin A Chundatt commented on YARN-8699: Thank you [~giovanni.fumarola] for review Attaching

[jira] [Updated] (YARN-8699) Add Yarnclient#yarnclusterMetrics API implementation in router

2018-08-28 Thread Bibin A Chundatt (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bibin A Chundatt updated YARN-8699: --- Attachment: YARN-8699.002.patch > Add Yarnclient#yarnclusterMetrics API implementation in

[jira] [Updated] (YARN-8535) DistributedShell unit tests are failing

2018-08-28 Thread Abhishek Modi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Abhishek Modi updated YARN-8535: Attachment: YARN-8535.001.patch > DistributedShell unit tests are failing >

[jira] [Commented] (YARN-8725) Submarine job staging directory has a lot of useless PRIMARY_WORKER-launch-script-***.sh scripts when submitting a job multiple times

2018-08-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16594966#comment-16594966 ] Zhankun Tang commented on YARN-8725: [~yuan_zac] Thanks for pointing this. We should clean up the

[jira] [Assigned] (YARN-8725) Submarine job staging directory has a lot of useless PRIMARY_WORKER-launch-script-***.sh scripts when submitting a job multiple times

2018-08-28 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang reassigned YARN-8725: -- Assignee: Zhankun Tang (was: Zac Zhou) > Submarine job staging directory has a lot of useless

[jira] [Updated] (YARN-8725) Submarine job staging directory has a lot of useless PRIMARY_WORKER-launch-script-***.sh scripts when submitting a job multiple times

2018-08-28 Thread Zac Zhou (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zac Zhou updated YARN-8725: --- Description: Submarine jobs upload core-site.xml, hdfs-site.xml, job.info and 

[jira] [Updated] (YARN-8725) Submarine job staging directory has a lot of useless PRIMARY_WORKER-launch-script-***.sh scripts when submitting a job multiple times

2018-08-28 Thread Zac Zhou (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zac Zhou updated YARN-8725: --- Description: Submarine jobs upload core-site.xml, hdfs-site.xml, job.info and 

  1   2   >