[jira] [Updated] (YARN-1365) ApplicationMasterService to allow Register and Unregister of an app that was running before restart

2014-06-19 Thread Anubhav Dhoot (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anubhav Dhoot updated YARN-1365: Attachment: YARN-1365.006.patch Addressed comments. Also includes changes for no duplicate events

[jira] [Commented] (YARN-2142) Add one service to check the nodes' TRUST status

2014-06-19 Thread Vinod Kumar Vavilapalli (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037045#comment-14037045 ] Vinod Kumar Vavilapalli commented on YARN-2142: --- bq. Because of critical

[jira] [Commented] (YARN-2175) Container localization has no timeouts and tasks can be stuck there for a long time

2014-06-19 Thread Vinod Kumar Vavilapalli (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037055#comment-14037055 ] Vinod Kumar Vavilapalli commented on YARN-2175: --- bq. there is no way to kill

[jira] [Commented] (YARN-2142) Add one service to check the nodes' TRUST status

2014-06-19 Thread anders (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037068#comment-14037068 ] anders commented on YARN-2142: -- In the cluster ,every node has been registered on one machine

[jira] [Commented] (YARN-2144) Add logs when preemption occurs

2014-06-19 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037115#comment-14037115 ] Wangda Tan commented on YARN-2144: -- Attached a patch only contains preemption log related

[jira] [Updated] (YARN-2142) Add one service to check the nodes' TRUST status

2014-06-19 Thread anders (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] anders updated YARN-2142: - Attachment: trust.patch Fix the WebUI. Add one service to check the nodes' TRUST status

[jira] [Commented] (YARN-2142) Add one service to check the nodes' TRUST status

2014-06-19 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037130#comment-14037130 ] Hadoop QA commented on YARN-2142: - {color:red}-1 overall{color}. Here are the results of

[jira] [Updated] (YARN-2142) Add one service to check the nodes' TRUST status

2014-06-19 Thread anders (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] anders updated YARN-2142: - Description: Because of critical computing environment ,we must test every node's TRUST status in the cluster

[jira] [Commented] (YARN-2175) Container localization has no timeouts and tasks can be stuck there for a long time

2014-06-19 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037337#comment-14037337 ] Jason Lowe commented on YARN-2175: -- I also wonder if there's been a regression, since at

[jira] [Commented] (YARN-2178) TestApplicationMasterService sometimes fails in trunk

2014-06-19 Thread Mit Desai (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037399#comment-14037399 ] Mit Desai commented on YARN-2178: - Hi Ted, How did you reproduce this? I tried mvn clean

[jira] [Updated] (YARN-2052) ContainerId creation after work preserving restart is broken

2014-06-19 Thread Tsuyoshi OZAWA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tsuyoshi OZAWA updated YARN-2052: - Attachment: YARN-2052.4.patch ContainerId creation after work preserving restart is broken

[jira] [Commented] (YARN-2178) TestApplicationMasterService sometimes fails in trunk

2014-06-19 Thread Ted Yu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037451#comment-14037451 ] Ted Yu commented on YARN-2178: -- I ran TestApplicationMasterService on Mac and it passed. Let

[jira] [Commented] (YARN-2052) ContainerId creation after work preserving restart is broken

2014-06-19 Thread Tsuyoshi OZAWA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037463#comment-14037463 ] Tsuyoshi OZAWA commented on YARN-2052: -- Updated patch to address the comments by

[jira] [Commented] (YARN-2052) ContainerId creation after work preserving restart is broken

2014-06-19 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037481#comment-14037481 ] Hadoop QA commented on YARN-2052: - {color:green}+1 overall{color}. Here are the results of

[jira] [Created] (YARN-2182) Update ContainerId#toString() to avoid conflicts before and after RM restart

2014-06-19 Thread Tsuyoshi OZAWA (JIRA)
Tsuyoshi OZAWA created YARN-2182: Summary: Update ContainerId#toString() to avoid conflicts before and after RM restart Key: YARN-2182 URL: https://issues.apache.org/jira/browse/YARN-2182 Project:

[jira] [Updated] (YARN-2182) Update ContainerId#toString() to avoid conflicts before and after RM restart

2014-06-19 Thread Tsuyoshi OZAWA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tsuyoshi OZAWA updated YARN-2182: - Issue Type: Sub-task (was: Improvement) Parent: YARN-556 Update ContainerId#toString()

[jira] [Commented] (YARN-611) Add an AM retry count reset window to YARN RM

2014-06-19 Thread Xuan Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037544#comment-14037544 ] Xuan Gong commented on YARN-611: I am working on this, and will give a proposal soon. Add

[jira] [Assigned] (YARN-611) Add an AM retry count reset window to YARN RM

2014-06-19 Thread Xuan Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xuan Gong reassigned YARN-611: -- Assignee: Xuan Gong Add an AM retry count reset window to YARN RM

[jira] [Commented] (YARN-1964) Create Docker analog of the LinuxContainerExecutor in YARN

2014-06-19 Thread Allen Wittenauer (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037550#comment-14037550 ] Allen Wittenauer commented on YARN-1964: I took a look at the patch. After some

[jira] [Updated] (YARN-1713) Implement getnewapplication and submitapp as part of RM web service

2014-06-19 Thread Varun Vasudev (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Varun Vasudev updated YARN-1713: Attachment: apache-yarn-1713.6.patch New patch with code and documentation for submit app.

[jira] [Commented] (YARN-2026) Fair scheduler : Fair share for inactive queues causes unfair allocation in some scenarios

2014-06-19 Thread Ashwin Shankar (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037647#comment-14037647 ] Ashwin Shankar commented on YARN-2026: -- [~sandyr], Sure. Just to be on the same

[jira] [Commented] (YARN-1341) Recover NMTokens upon nodemanager restart

2014-06-19 Thread Junping Du (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037656#comment-14037656 ] Junping Du commented on YARN-1341: -- bq. I'm not sure I understand what you're requesting.

[jira] [Commented] (YARN-2019) Retrospect on decision of making RM crashed if any exception throw in ZKRMStateStore

2014-06-19 Thread Junping Du (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037664#comment-14037664 ] Junping Du commented on YARN-2019: -- [~kasha], sorry that I ignored your comments as my

[jira] [Commented] (YARN-1713) Implement getnewapplication and submitapp as part of RM web service

2014-06-19 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037670#comment-14037670 ] Hadoop QA commented on YARN-1713: - {color:green}+1 overall{color}. Here are the results of

[jira] [Updated] (YARN-1039) Add parameter for YARN resource requests to indicate long lived

2014-06-19 Thread Xuan Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xuan Gong updated YARN-1039: Assignee: (was: Xuan Gong) Add parameter for YARN resource requests to indicate long lived

[jira] [Assigned] (YARN-1039) Add parameter for YARN resource requests to indicate long lived

2014-06-19 Thread Craig Welch (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Craig Welch reassigned YARN-1039: - Assignee: Craig Welch Add parameter for YARN resource requests to indicate long lived

[jira] [Commented] (YARN-1365) ApplicationMasterService to allow Register and Unregister of an app that was running before restart

2014-06-19 Thread Jian He (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037791#comment-14037791 ] Jian He commented on YARN-1365: --- Thanks for updating the patch 1.how about

[jira] [Commented] (YARN-941) RM Should have a way to update the tokens it has for a running application

2014-06-19 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/YARN-941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037828#comment-14037828 ] Steve Loughran commented on YARN-941: - [~vanzin], the issue here is that the AMRM token

[jira] [Commented] (YARN-1051) YARN Admission Control/Planner: enhancing the resource allocation model with time.

2014-06-19 Thread Carlo Curino (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037833#comment-14037833 ] Carlo Curino commented on YARN-1051: We created a branch named YARN-1051 where we are

[jira] [Commented] (YARN-2130) Cleanup: Adding getRMAppManager, getQueueACLsManager, getApplicationACLsManager to RMContext

2014-06-19 Thread Karthik Kambatla (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037838#comment-14037838 ] Karthik Kambatla commented on YARN-2130: Looking even better. Few more comments: #

[jira] [Commented] (YARN-941) RM Should have a way to update the tokens it has for a running application

2014-06-19 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/YARN-941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037850#comment-14037850 ] Marcelo Vanzin commented on YARN-941: - [~ste...@apache.org], thanks for the comments,

[jira] [Commented] (YARN-1341) Recover NMTokens upon nodemanager restart

2014-06-19 Thread Junping Du (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037868#comment-14037868 ] Junping Du commented on YARN-1341: -- bq. Restarts should be rare, and I'd rather not force

[jira] [Commented] (YARN-2052) ContainerId creation after work preserving restart is broken

2014-06-19 Thread Jian He (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037875#comment-14037875 ] Jian He commented on YARN-2052: --- I think the conclusion was to not add any new fields into

[jira] [Commented] (YARN-2026) Fair scheduler : Fair share for inactive queues causes unfair allocation in some scenarios

2014-06-19 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038029#comment-14038029 ] Sandy Ryza commented on YARN-2026: -- I think it might be simpler to just: * Create

[jira] [Commented] (YARN-1039) Add parameter for YARN resource requests to indicate long lived

2014-06-19 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038041#comment-14038041 ] Steve Loughran commented on YARN-1039: -- marking as depended on by YARN-896. I would

[jira] [Commented] (YARN-2176) CapacityScheduler loops over all running applications rather than actively requesting apps

2014-06-19 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038045#comment-14038045 ] Sandy Ryza commented on YARN-2176: -- Can we merge the ActiveUsersManager stuff into an

[jira] [Commented] (YARN-2026) Fair scheduler : Fair share for inactive queues causes unfair allocation in some scenarios

2014-06-19 Thread Ashwin Shankar (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038047#comment-14038047 ] Ashwin Shankar commented on YARN-2026: -- [~sandyr], makes sense. I'll post a patch

[jira] [Updated] (YARN-2144) Add logs when preemption occurs

2014-06-19 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wangda Tan updated YARN-2144: - Attachment: YARN-2144.patch Sorry I forgot attaching patch yesterday, attached now. Add logs when

[jira] [Commented] (YARN-2074) Preemption of AM containers shouldn't count towards AM failures

2014-06-19 Thread Vinod Kumar Vavilapalli (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038169#comment-14038169 ] Vinod Kumar Vavilapalli commented on YARN-2074: --- Also

[jira] [Updated] (YARN-2179) Initial cache manager structure and context

2014-06-19 Thread Chris Trezzo (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Trezzo updated YARN-2179: --- Attachment: YARN-2179-trunk-v2.patch Attached is v2 patch to address javac warning for use of a

[jira] [Created] (YARN-2183) Cleaner service for cache manager

2014-06-19 Thread Chris Trezzo (JIRA)
Chris Trezzo created YARN-2183: -- Summary: Cleaner service for cache manager Key: YARN-2183 URL: https://issues.apache.org/jira/browse/YARN-2183 Project: Hadoop YARN Issue Type: Sub-task

[jira] [Commented] (YARN-2144) Add logs when preemption occurs

2014-06-19 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038185#comment-14038185 ] Hadoop QA commented on YARN-2144: - {color:green}+1 overall{color}. Here are the results of

[jira] [Updated] (YARN-2183) Cleaner service for cache manager

2014-06-19 Thread Chris Trezzo (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Trezzo updated YARN-2183: --- Attachment: YARN-2183-trunk-v1.patch Attached is a v1 patch based on trunk+YARN-2179+YARN-2180.

[jira] [Commented] (YARN-2179) Initial cache manager structure and context

2014-06-19 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038208#comment-14038208 ] Hadoop QA commented on YARN-2179: - {color:red}-1 overall{color}. Here are the results of

[jira] [Commented] (YARN-1039) Add parameter for YARN resource requests to indicate long lived

2014-06-19 Thread Vinod Kumar Vavilapalli (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038248#comment-14038248 ] Vinod Kumar Vavilapalli commented on YARN-1039: --- For now, we can start with a

[jira] [Updated] (YARN-2074) Preemption of AM containers shouldn't count towards AM failures

2014-06-19 Thread Jian He (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jian He updated YARN-2074: -- Attachment: YARN-2074.8.patch Thanks Vinod for the review! uploaded a new patch. bq. Not related to the patch,

[jira] [Updated] (YARN-1709) Admission Control: Reservation subsystem

2014-06-19 Thread Subramaniam Venkatraman Krishnan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Subramaniam Venkatraman Krishnan updated YARN-1709: --- Attachment: YARN-1709.patch Admission Control: Reservation

[jira] [Commented] (YARN-2074) Preemption of AM containers shouldn't count towards AM failures

2014-06-19 Thread Jian He (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038264#comment-14038264 ] Jian He commented on YARN-2074: --- Seem to find a bug in

[jira] [Commented] (YARN-1709) Admission Control: Reservation subsystem

2014-06-19 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038271#comment-14038271 ] Hadoop QA commented on YARN-1709: - {color:red}-1 overall{color}. Here are the results of

[jira] [Commented] (YARN-614) Retry attempts automatically for hardware failures or YARN issues and set default app retries to 1

2014-06-19 Thread Xuan Gong (JIRA)
[ https://issues.apache.org/jira/browse/YARN-614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038297#comment-14038297 ] Xuan Gong commented on YARN-614: [~criccomini] Hey, Chris. Do you have any updates for this

[jira] [Commented] (YARN-2074) Preemption of AM containers shouldn't count towards AM failures

2014-06-19 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14038300#comment-14038300 ] Hadoop QA commented on YARN-2074: - {color:green}+1 overall{color}. Here are the results of

[jira] [Created] (YARN-2184) ResourceManager may fail due to name node in safe mode

2014-06-19 Thread Jeff Zhang (JIRA)
Jeff Zhang created YARN-2184: Summary: ResourceManager may fail due to name node in safe mode Key: YARN-2184 URL: https://issues.apache.org/jira/browse/YARN-2184 Project: Hadoop YARN Issue Type: