[jira] [Created] (YARN-8669) Yarn application has already ended! It might have been killed or unable to launch application master.

2018-08-15 Thread Bheemidi Vikram Reddy (JIRA)
Bheemidi Vikram Reddy created YARN-8669: --- Summary: Yarn application has already ended! It might have been killed or unable to launch application master. Key: YARN-8669 URL:

[jira] [Assigned] (YARN-8613) Old RM UI shows wrong vcores total value

2018-08-15 Thread Sen Zhao (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sen Zhao reassigned YARN-8613: -- Assignee: (was: Sen Zhao) > Old RM UI shows wrong vcores total value >

[jira] [Commented] (YARN-8668) Inconsistency between capacity and fair scheduler in the aspect of computing node available resource

2018-08-15 Thread Yeliang Cang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581866#comment-16581866 ] Yeliang Cang commented on YARN-8668: Thanks [~leftnoteasy] for clarifying this, close this Jira as not

[jira] [Commented] (YARN-8667) Container Relaunch fails with "find: File system loop detected;" for tar ball artifacts

2018-08-15 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581842#comment-16581842 ] genericqa commented on YARN-8667: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-8662) Fair Scheduler stops scheduling when a queue is configured only CPU and memory

2018-08-15 Thread Sen Zhao (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sen Zhao updated YARN-8662: --- Component/s: fairscheduler > Fair Scheduler stops scheduling when a queue is configured only CPU and memory >

[jira] [Updated] (YARN-8597) Build Worker utility for MaWo Application

2018-08-15 Thread Yesha Vora (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yesha Vora updated YARN-8597: - Attachment: YARN-8597.001.patch > Build Worker utility for MaWo Application >

[jira] [Commented] (YARN-8667) Container Relaunch fails with "find: File system loop detected;" for tar ball artifacts

2018-08-15 Thread Chandni Singh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581802#comment-16581802 ] Chandni Singh commented on YARN-8667: - Patch 1 contains a fix and a unit test.  [~billie.rinaldi]

[jira] [Updated] (YARN-8667) Container Relaunch fails with "find: File system loop detected;" for tar ball artifacts

2018-08-15 Thread Chandni Singh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chandni Singh updated YARN-8667: Attachment: YARN-8667.001.patch > Container Relaunch fails with "find: File system loop detected;"

[jira] [Assigned] (YARN-8569) Create an interface to provide cluster information to application

2018-08-15 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Yang reassigned YARN-8569: --- Assignee: Eric Yang > Create an interface to provide cluster information to application >

[jira] [Commented] (YARN-8488) YARN service/components/instances should have SUCCEEDED/FAILED states

2018-08-15 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581598#comment-16581598 ] Eric Yang commented on YARN-8488: - [~suma.shivaprasad], thank you for the patch.  A few minor nitpicks: #

[jira] [Comment Edited] (YARN-8668) Inconsistency between capacity and fair scheduler in the aspect of computing node available resource

2018-08-15 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581574#comment-16581574 ] Wangda Tan edited comment on YARN-8668 at 8/15/18 8:34 PM: --- Thanks [~Cyl] for

[jira] [Commented] (YARN-8509) Total pending resource calculation in preemption should use user-limit factor instead of minimum-user-limit-percent

2018-08-15 Thread Zian Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581573#comment-16581573 ] Zian Chen commented on YARN-8509: - Offline discussed with Eric and Wangda, will upload a new patch to

[jira] [Commented] (YARN-8668) Inconsistency between capacity and fair scheduler in the aspect of computing node available resource

2018-08-15 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581574#comment-16581574 ] Wangda Tan commented on YARN-8668: -- Thanks [~Cyl] for reporting the issue, this is by design in CS.

[jira] [Commented] (YARN-8474) sleeper service fails to launch with "Authentication Required"

2018-08-15 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581558#comment-16581558 ] genericqa commented on YARN-8474: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-8667) Container Relaunch fails with "find: File system loop detected;" for tar ball artifacts

2018-08-15 Thread Billie Rinaldi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581548#comment-16581548 ] Billie Rinaldi commented on YARN-8667: -- That sounds like the issue. Thanks for figuring out the

[jira] [Commented] (YARN-8667) Container Relaunch fails with "find: File system loop detected;" for tar ball artifacts

2018-08-15 Thread Chandni Singh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581524#comment-16581524 ] Chandni Singh commented on YARN-8667: - Before relaunch, container script and container tokens file is

[jira] [Updated] (YARN-8474) sleeper service fails to launch with "Authentication Required"

2018-08-15 Thread Billie Rinaldi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Billie Rinaldi updated YARN-8474: - Attachment: YARN-8474.006.patch > sleeper service fails to launch with "Authentication Required"

[jira] [Commented] (YARN-8474) sleeper service fails to launch with "Authentication Required"

2018-08-15 Thread Billie Rinaldi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581488#comment-16581488 ] Billie Rinaldi commented on YARN-8474: -- Patch 6 fixes checkstyle issues. > sleeper service fails to

[jira] [Commented] (YARN-8474) sleeper service fails to launch with "Authentication Required"

2018-08-15 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581470#comment-16581470 ] genericqa commented on YARN-8474: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Assigned] (YARN-8667) Container Relaunch fails with "find: File system loop detected;" for tar ball artifacts

2018-08-15 Thread Chandni Singh (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chandni Singh reassigned YARN-8667: --- Assignee: Chandni Singh > Container Relaunch fails with "find: File system loop detected;"

[jira] [Commented] (YARN-8242) YARN NM: OOM error while reading back the state store on recovery

2018-08-15 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581419#comment-16581419 ] Jason Lowe commented on YARN-8242: -- Thanks for updating the patch! bq. The problem/issue that I faced

[jira] [Updated] (YARN-8474) sleeper service fails to launch with "Authentication Required"

2018-08-15 Thread Billie Rinaldi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Billie Rinaldi updated YARN-8474: - Attachment: YARN-8474.005.patch > sleeper service fails to launch with "Authentication Required"

[jira] [Commented] (YARN-8474) sleeper service fails to launch with "Authentication Required"

2018-08-15 Thread Billie Rinaldi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581410#comment-16581410 ] Billie Rinaldi commented on YARN-8474: -- Attached patch 5 based on patch 4 plus dependency cleanup. >

[jira] [Assigned] (YARN-8474) sleeper service fails to launch with "Authentication Required"

2018-08-15 Thread Billie Rinaldi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Billie Rinaldi reassigned YARN-8474: Assignee: Billie Rinaldi (was: Eric Yang) > sleeper service fails to launch with

[jira] [Commented] (YARN-7708) [GPG] Load based policy generator

2018-08-15 Thread Botong Huang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581348#comment-16581348 ] Botong Huang commented on YARN-7708: Committed to YARN-7402. Thanks [~youchen] for the patch!  >

[jira] [Commented] (YARN-7708) [GPG] Load based policy generator

2018-08-15 Thread Young Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581323#comment-16581323 ] Young Chen commented on YARN-7708: -- Unit test failure is unrelated. > [GPG] Load based policy generator

[jira] [Commented] (YARN-8129) Improve error message for invalid value in fields attribute

2018-08-15 Thread Suma Shivaprasad (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581269#comment-16581269 ] Suma Shivaprasad commented on YARN-8129: Thanks for the patch [~abmodi] Patch LGTM . +1 > Improve

[jira] [Commented] (YARN-8474) sleeper service fails to launch with "Authentication Required"

2018-08-15 Thread Billie Rinaldi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581217#comment-16581217 ] Billie Rinaldi commented on YARN-8474: -- I have done some testing with patch 4 and it looks pretty

[jira] [Commented] (YARN-8656) container-executor should not write cgroup tasks files for docker containers

2018-08-15 Thread Jim Brennan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16581110#comment-16581110 ] Jim Brennan commented on YARN-8656: --- I am unable to repro the unit test failure in 

[jira] [Updated] (YARN-8668) Inconsistency between capacity and fair scheduler in the aspect of computing node available resource

2018-08-15 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8668: - Labels: capacityscheduler (was: ) > Inconsistency between capacity and fair scheduler in the aspect of

[jira] [Updated] (YARN-8664) ApplicationMasterProtocolPBServiceImpl#allocate throw NPE when NM losting

2018-08-15 Thread Jiandan Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jiandan Yang updated YARN-8664: Description: ResourceManager logs about exception is: {code:java} 2018-08-09 00:52:30,746 WARN [IPC

[jira] [Commented] (YARN-8668) Inconsistency between capacity and fair scheduler in the aspect of computing node available resource

2018-08-15 Thread genericqa (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16580937#comment-16580937 ] genericqa commented on YARN-8668: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-8664) ApplicationMasterProtocolPBServiceImpl#allocate throw NPE when NM losting

2018-08-15 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16580902#comment-16580902 ] Weiwei Yang commented on YARN-8664: --- Hi [~yangjiandan] Yeah, seems like the jenkins env is broken on

[jira] [Commented] (YARN-8668) Inconsistency between capacity and fair scheduler in the aspect of computing node available resource

2018-08-15 Thread Yeliang Cang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16580892#comment-16580892 ] Yeliang Cang commented on YARN-8668: Submit a patch to resolve this! > Inconsistency between capacity

[jira] [Updated] (YARN-8668) Inconsistency between capacity and fair scheduler in the aspect of computing node available resource

2018-08-15 Thread Yeliang Cang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yeliang Cang updated YARN-8668: --- Attachment: YARN-8668.001.patch > Inconsistency between capacity and fair scheduler in the aspect of

[jira] [Updated] (YARN-8668) Inconsistency between capacity and fair scheduler in the aspect of computing node available resource

2018-08-15 Thread Yeliang Cang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yeliang Cang updated YARN-8668: --- Description: We have observed that given capacityScheduler and defaultResourceCalculor,   when there

[jira] [Created] (YARN-8668) Inconsistency between capacity and fair scheduler in the aspect of computing node available resource

2018-08-15 Thread Yeliang Cang (JIRA)
Yeliang Cang created YARN-8668: -- Summary: Inconsistency between capacity and fair scheduler in the aspect of computing node available resource Key: YARN-8668 URL: https://issues.apache.org/jira/browse/YARN-8668

[jira] [Commented] (YARN-8664) ApplicationMasterProtocolPBServiceImpl#allocate throw NPE when NM losting

2018-08-15 Thread Jiandan Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16580881#comment-16580881 ] Jiandan Yang commented on YARN-8664: - [~cheersyang] Jenkins is probably not OK. Would you please fix

[jira] [Commented] (YARN-8513) CapacityScheduler infinite loop when queue is near fully utilized

2018-08-15 Thread Chen Yufei (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16580862#comment-16580862 ] Chen Yufei commented on YARN-8513: -- We got infinite loops two times recently with 2.9.1, restarting

[jira] [Commented] (YARN-8667) Container Relaunch fails with "find: File system loop detected;" for tar ball artifacts

2018-08-15 Thread Rohith Sharma K S (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16580751#comment-16580751 ] Rohith Sharma K S commented on YARN-8667: - Container Relaunch shares same working directory. As a

[jira] [Created] (YARN-8667) Container Relaunch fails with "find: File system loop detected;" for tar ball artifacts

2018-08-15 Thread Rohith Sharma K S (JIRA)
Rohith Sharma K S created YARN-8667: --- Summary: Container Relaunch fails with "find: File system loop detected;" for tar ball artifacts Key: YARN-8667 URL: https://issues.apache.org/jira/browse/YARN-8667