[jira] [Commented] (YARN-1055) Handle app recovery differently for AM failures and RM restart

2013-08-14 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13740446#comment-13740446 ] Robert Kanter commented on YARN-1055: - Another way of phrasing this: when the action's

[jira] [Commented] (YARN-2131) Add a way to format the RMStateStore

2014-07-18 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14067006#comment-14067006 ] Robert Kanter commented on YARN-2131: - Given that Karthik created YARN-2268 and we

[jira] [Updated] (YARN-2131) Add a way to format the RMStateStore

2014-07-22 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2131: Attachment: YARN-2131_addendum2.patch Add a way to format the RMStateStore

[jira] [Updated] (YARN-1530) [Umbrella] Store, manage and serve per-framework application-timeline data

2014-09-05 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1530: Attachment: ATS-Write-Pipeline-Design-Proposal.pdf Thanks Sangjin for posting those notes.

[jira] [Commented] (YARN-2461) Fix PROCFS_USE_SMAPS_BASED_RSS_ENABLED property in YarnConfiguration

2014-09-16 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14136184#comment-14136184 ] Robert Kanter commented on YARN-2461: - LGTM (non-binding) Fix

[jira] [Commented] (YARN-1530) [Umbrella] Store, manage and serve per-framework application-timeline data

2014-09-19 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14141644#comment-14141644 ] Robert Kanter commented on YARN-1530: - I also agree that providing reliability through

[jira] [Assigned] (YARN-2423) TimelineClient should wrap all GET APIs to facilitate Java users

2014-10-10 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter reassigned YARN-2423: --- Assignee: Robert Kanter TimelineClient should wrap all GET APIs to facilitate Java users

[jira] [Updated] (YARN-2423) TimelineClient should wrap all GET APIs to facilitate Java users

2014-10-14 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2423: Attachment: YARN-2423.patch The patch adds the GET APIs. I modeled them after the get methods in

[jira] [Updated] (YARN-2423) TimelineClient should wrap all GET APIs to facilitate Java users

2014-10-16 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2423: Attachment: YARN-2423.patch The new patch fixes the javadoc warnings and TestMemoryTimelineStore.

[jira] [Updated] (YARN-2423) TimelineClient should wrap all GET APIs to facilitate Java users

2014-10-16 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2423: Attachment: (was: YARN-2423.patch) TimelineClient should wrap all GET APIs to facilitate Java

[jira] [Updated] (YARN-2423) TimelineClient should wrap all GET APIs to facilitate Java users

2014-10-16 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2423: Attachment: YARN-2423.patch Oops, I generated the patch backwards. New patch is correct.

[jira] [Assigned] (YARN-2716) Refactor ZKRMStateStore retry code with Apache Curator

2014-10-20 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter reassigned YARN-2716: --- Assignee: Robert Kanter Refactor ZKRMStateStore retry code with Apache Curator

[jira] [Created] (YARN-1795) Oozie tests are flakey after YARN-713

2014-03-06 Thread Robert Kanter (JIRA)
Robert Kanter created YARN-1795: --- Summary: Oozie tests are flakey after YARN-713 Key: YARN-1795 URL: https://issues.apache.org/jira/browse/YARN-1795 Project: Hadoop YARN Issue Type: Bug

[jira] [Updated] (YARN-1795) Oozie tests are flakey after YARN-713

2014-03-06 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1795: Attachment: syslog

[jira] [Commented] (YARN-1795) Oozie tests are flakey after YARN-713

2014-03-06 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923552#comment-13923552 ] Robert Kanter commented on YARN-1795: - Looking at the printouts I added to the

[jira] [Created] (YARN-1811) Error 500 when clicking the Application Master link in the RM UI while a job is running with RM HA

2014-03-10 Thread Robert Kanter (JIRA)
Robert Kanter created YARN-1811: --- Summary: Error 500 when clicking the Application Master link in the RM UI while a job is running with RM HA Key: YARN-1811 URL: https://issues.apache.org/jira/browse/YARN-1811

[jira] [Updated] (YARN-1811) Error 500 when clicking the Application Master link in the RM UI while a job is running with RM HA

2014-03-10 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1811: Attachment: YARN-1811.patch The basic problem was that WebAppUtils.getProxyHostAndPort(...) was not

[jira] [Commented] (YARN-1811) Error 500 when clicking the Application Master link in the RM UI while a job is running with RM HA

2014-03-10 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13926259#comment-13926259 ] Robert Kanter commented on YARN-1811: - I didn't write any tests because the problem

[jira] [Commented] (YARN-1811) Error 500 when clicking the Application Master link in the RM UI while a job is running with RM HA

2014-03-10 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13926407#comment-13926407 ] Robert Kanter commented on YARN-1811: - {quote} No matter we start proxy server

[jira] [Updated] (YARN-1811) Error 500 when clicking the Application Master link in the RM UI while a job is running with RM HA

2014-03-10 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1811: Attachment: YARN-1811.patch New patch fixes the test failure. The problem was when submitting an

[jira] [Updated] (YARN-1822) Revisit AM link being broken for work preserving restart

2014-03-11 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1822: Summary: Revisit AM link being broken for work preserving restart (was: Revisit AM link being

[jira] [Created] (YARN-1822) Revisit AM link being broken for RM restart

2014-03-11 Thread Robert Kanter (JIRA)
Robert Kanter created YARN-1822: --- Summary: Revisit AM link being broken for RM restart Key: YARN-1822 URL: https://issues.apache.org/jira/browse/YARN-1822 Project: Hadoop YARN Issue Type:

[jira] [Commented] (YARN-1811) RM HA: AM link broken if the AM is on nodes other than RM

2014-03-11 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13930802#comment-13930802 ] Robert Kanter commented on YARN-1811: - {quote} IAC, I think the correct fix is to

[jira] [Updated] (YARN-1811) RM HA: AM link broken if the AM is on nodes other than RM

2014-03-12 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1811: Attachment: YARN-1811.patch New patch addresses Vinod's comments and adds/updates the tests

[jira] [Updated] (YARN-1795) After YARN-713, using FairScheduler can cause an InvalidToken Exception for NMTokens

2014-03-13 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1795: Description: Running the Oozie unit tests against a Hadoop build with YARN-713 causes many of the

[jira] [Commented] (YARN-1811) RM HA: AM link broken if the AM is on nodes other than RM

2014-03-13 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13933825#comment-13933825 ] Robert Kanter commented on YARN-1811: - TestResourceTrackerService is flakey (and fails

[jira] [Commented] (YARN-1811) RM HA: AM link broken if the AM is on nodes other than RM

2014-03-13 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13933873#comment-13933873 ] Robert Kanter commented on YARN-1811: - I'll make those changes and put up a new patch.

[jira] [Updated] (YARN-1811) RM HA: AM link broken if the AM is on nodes other than RM

2014-03-13 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1811: Attachment: YARN-1811.patch New patch addresses Karthik's comments. RM HA: AM link broken if the

[jira] [Resolved] (YARN-1822) Revisit AM link being broken for work preserving restart

2014-03-13 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter resolved YARN-1822. - Resolution: Invalid YARN-1811 is being done differently, and this is no longer needed Revisit

[jira] [Commented] (YARN-1811) RM HA: AM link broken if the AM is on nodes other than RM

2014-03-13 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13934181#comment-13934181 ] Robert Kanter commented on YARN-1811: - Both failures look untreated and already have

[jira] [Commented] (YARN-1811) RM HA: AM link broken if the AM is on nodes other than RM

2014-03-14 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13935329#comment-13935329 ] Robert Kanter commented on YARN-1811: - Ok, I'll make it {{@Public}} and put back the

[jira] [Commented] (YARN-1795) After YARN-713, using FairScheduler can cause an InvalidToken Exception for NMTokens

2014-03-14 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13935437#comment-13935437 ] Robert Kanter commented on YARN-1795: - Sorry, I didn't explain more specifically what I

[jira] [Commented] (YARN-1811) RM HA: AM link broken if the AM is on nodes other than RM

2014-03-14 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13935528#comment-13935528 ] Robert Kanter commented on YARN-1811: - {quote}If we still do the redirection, where you

[jira] [Updated] (YARN-1811) RM HA: AM link broken if the AM is on nodes other than RM

2014-03-14 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1811: Attachment: YARN-1811.patch Updated patch based on Vinod's comments RM HA: AM link broken if the

[jira] [Commented] (YARN-1795) After YARN-713, using FairScheduler can cause an InvalidToken Exception for NMTokens

2014-03-14 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13935875#comment-13935875 ] Robert Kanter commented on YARN-1795: - Thanks for point out YARN-1839, [~jianhe]. That

[jira] [Commented] (YARN-1811) RM HA: AM link broken if the AM is on nodes other than RM

2014-03-17 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938171#comment-13938171 ] Robert Kanter commented on YARN-1811: - [~vinodkv], can you take a look? RM HA: AM

[jira] [Commented] (YARN-1811) RM HA: AM link broken if the AM is on nodes other than RM

2014-03-17 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938315#comment-13938315 ] Robert Kanter commented on YARN-1811: - TestResourceTrackerService failing is unrelated

[jira] [Resolved] (YARN-1795) After YARN-713, using FairScheduler can cause an InvalidToken Exception for NMTokens

2014-03-17 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter resolved YARN-1795. - Resolution: Duplicate Assignee: Robert Kanter (was: Karthik Kambatla) I tried the patch

[jira] [Updated] (YARN-1811) RM HA: AM link broken if the AM is on nodes other than RM

2014-03-17 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1811: Attachment: YARN-1811.patch Updated patch to suppress deprecation warnings in TestAmFilter RM HA:

[jira] [Created] (YARN-1846) TestRM.testNMTokenSentForNormalContainer assumes CapacityScheduler

2014-03-17 Thread Robert Kanter (JIRA)
Robert Kanter created YARN-1846: --- Summary: TestRM.testNMTokenSentForNormalContainer assumes CapacityScheduler Key: YARN-1846 URL: https://issues.apache.org/jira/browse/YARN-1846 Project: Hadoop YARN

[jira] [Updated] (YARN-1846) TestRM.testNMTokenSentForNormalContainer assumes CapacityScheduler

2014-03-17 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1846: Attachment: YARN-1846.patch The patch explicitly sets the Scheduler for the test to the

[jira] [Updated] (YARN-1811) RM HA: AM link broken if the AM is on nodes other than RM

2014-03-20 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1811: Attachment: YARN-1811.patch New patch: rebased to fix conflicts and address Vinod's latest

[jira] [Updated] (YARN-1784) TestContainerAllocation assumes CapacityScheduler

2014-04-08 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1784: Attachment: YARN-1784.patch The patch configures the tests to always use the CapcityScheduler

[jira] [Updated] (YARN-1784) TestContainerAllocation assumes CapacityScheduler

2014-04-08 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1784: Attachment: YARN-1784.patch New patch uses a setup method. TestContainerAllocation assumes

[jira] [Created] (YARN-2015) HTTPS doesn't work properly for daemons (RM, JHS, NM)

2014-05-01 Thread Robert Kanter (JIRA)
Robert Kanter created YARN-2015: --- Summary: HTTPS doesn't work properly for daemons (RM, JHS, NM) Key: YARN-2015 URL: https://issues.apache.org/jira/browse/YARN-2015 Project: Hadoop YARN Issue

[jira] [Commented] (YARN-2015) HTTPS doesn't work properly for daemons (RM, JHS, NM)

2014-05-01 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13987241#comment-13987241 ] Robert Kanter commented on YARN-2015: - The problem appears to be that while HttpServer2

[jira] [Updated] (YARN-2015) HTTPS doesn't work properly for daemons (RM, JHS, NM)

2014-05-01 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2015: Affects Version/s: 2.3.0 HTTPS doesn't work properly for daemons (RM, JHS, NM)

[jira] [Resolved] (YARN-2015) HTTPS doesn't work properly for daemons (RM, JHS, NM)

2014-05-01 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter resolved YARN-2015. - Resolution: Invalid Nevermind, this appears to be fixed by YARN-1553 HTTPS doesn't work

[jira] [Assigned] (YARN-2070) DistributedShell publishes unfriendly user information to the timeline server

2014-05-21 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter reassigned YARN-2070: --- Assignee: Robert Kanter DistributedShell publishes unfriendly user information to the

[jira] [Updated] (YARN-2070) DistributedShell publishes unfriendly user information to the timeline server

2014-05-21 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2070: Attachment: YARN-2070.patch DistributedShell publishes unfriendly user information to the timeline

[jira] [Updated] (YARN-1877) ZK store: Add yarn.resourcemanager.zk-state-store.root-node.auth for root node auth

2014-05-29 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1877: Attachment: YARN-1877.patch ZK store: Add yarn.resourcemanager.zk-state-store.root-node.auth for

[jira] [Commented] (YARN-1877) ZK store: Add yarn.resourcemanager.zk-state-store.root-node.auth for root node auth

2014-05-29 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14013148#comment-14013148 ] Robert Kanter commented on YARN-1877: - Discussed with Karthik offline. There's no need

[jira] [Updated] (YARN-2122) In AllocationFileLoaderService, the reloadThread should be created in init() and started in start()

2014-06-02 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2122: Attachment: YARN-2122.patch In AllocationFileLoaderService, the reloadThread should be created in

[jira] [Updated] (YARN-2122) In AllocationFileLoaderService, the reloadThread should be created in init() and started in start()

2014-06-06 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2122: Attachment: YARN-2122.patch The new patch addresses Karthik's comments; except that it checks if

[jira] [Updated] (YARN-2122) In AllocationFileLoaderService, the reloadThread should be created in init() and started in start()

2014-06-06 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2122: Attachment: YARN-2122.patch Good point. The new patch does that. In AllocationFileLoaderService,

[jira] [Created] (YARN-2187) FairScheduler should have a way of disabling the max AM share check for launching new AMs

2014-06-20 Thread Robert Kanter (JIRA)
Robert Kanter created YARN-2187: --- Summary: FairScheduler should have a way of disabling the max AM share check for launching new AMs Key: YARN-2187 URL: https://issues.apache.org/jira/browse/YARN-2187

[jira] [Updated] (YARN-2187) FairScheduler: Disable max-AM-share check by default

2014-06-20 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2187: Attachment: YARN-2187.patch FairScheduler: Disable max-AM-share check by default

[jira] [Created] (YARN-2199) FairScheduler: Allow max-AM-share to be specified in the root queue

2014-06-24 Thread Robert Kanter (JIRA)
Robert Kanter created YARN-2199: --- Summary: FairScheduler: Allow max-AM-share to be specified in the root queue Key: YARN-2199 URL: https://issues.apache.org/jira/browse/YARN-2199 Project: Hadoop YARN

[jira] [Updated] (YARN-2199) FairScheduler: Allow max-AM-share to be specified in the root queue

2014-06-24 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2199: Attachment: YARN-2199.patch FairScheduler: Allow max-AM-share to be specified in the root queue

[jira] [Created] (YARN-2204) TestAMRestart#testAMRestartWithExistingContainers assumes CapacityScheduler

2014-06-24 Thread Robert Kanter (JIRA)
Robert Kanter created YARN-2204: --- Summary: TestAMRestart#testAMRestartWithExistingContainers assumes CapacityScheduler Key: YARN-2204 URL: https://issues.apache.org/jira/browse/YARN-2204 Project:

[jira] [Updated] (YARN-2204) TestAMRestart#testAMRestartWithExistingContainers assumes CapacityScheduler

2014-06-24 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2204: Attachment: YARN-2204.patch The patch simply sets the scheduler to the CapacityScheduler, just like

[jira] [Updated] (YARN-2204) TestAMRestart#testAMRestartWithExistingContainers assumes CapacityScheduler

2014-06-25 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2204: Attachment: YARN-2204_addendum.patch Makes sense. I've attached an addendum patch that's scheduler

[jira] [Updated] (YARN-2204) TestAMRestart#testAMRestartWithExistingContainers assumes CapacityScheduler

2014-06-26 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2204: Attachment: YARN-2204_addendum.patch I re-looked at this and saw that I could do this much simpler

[jira] [Created] (YARN-2241) Show nicer messages when ZNodes already exist in ZKRMStateStore on startup

2014-07-01 Thread Robert Kanter (JIRA)
Robert Kanter created YARN-2241: --- Summary: Show nicer messages when ZNodes already exist in ZKRMStateStore on startup Key: YARN-2241 URL: https://issues.apache.org/jira/browse/YARN-2241 Project: Hadoop

[jira] [Updated] (YARN-2241) Show nicer messages when ZNodes already exist in ZKRMStateStore on startup

2014-07-01 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2241: Component/s: resourcemanager Show nicer messages when ZNodes already exist in ZKRMStateStore on

[jira] [Updated] (YARN-2241) Show nicer messages when ZNodes already exist in ZKRMStateStore on startup

2014-07-01 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2241: Attachment: YARN-2241.patch The Exception catching was simply in the wrong place; I moved it to the

[jira] [Updated] (YARN-2131) Add a way to nuke the RMStateStore

2014-07-01 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2131: Attachment: YARN-2131.patch The patch adds a {{deleteStore()}} method to RMStateStore and

[jira] [Updated] (YARN-2241) ZKRMStateStore: On startup, show nicer messages when znodes already exist

2014-07-01 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2241: Attachment: YARN-2241.patch You're right, it doesn't fail without the fix; I must have checked it

[jira] [Updated] (YARN-2131) Add a way to nuke the RMStateStore

2014-07-03 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2131: Attachment: YARN-2131.patch Good point. I've uploaded a new patch that adds documentation Add a

[jira] [Assigned] (YARN-1524) Make aggregated logs of completed containers available via REST API

2014-07-08 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter reassigned YARN-1524: --- Assignee: Robert Kanter Make aggregated logs of completed containers available via REST API

[jira] [Commented] (YARN-2131) Add a way to format the RMStateStore

2014-07-10 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14057893#comment-14057893 ] Robert Kanter commented on YARN-2131: - Makes sense to me. I'll do an addendum patch to

[jira] [Updated] (YARN-2131) Add a way to format the RMStateStore

2014-07-10 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2131: Attachment: YARN-2131_addendum.patch The addendum patch renames the command. However, I was

[jira] [Commented] (YARN-2131) Add a way to format the RMStateStore

2014-07-11 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14059320#comment-14059320 ] Robert Kanter commented on YARN-2131: - Are you sure that the NameNode uses a lock file

[jira] [Updated] (YARN-1245) org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testRMRestart times out

2013-09-27 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1245: Attachment: YARN-1245.patch

[jira] [Created] (YARN-1245) org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testRMRestart times out

2013-09-27 Thread Robert Kanter (JIRA)
Robert Kanter created YARN-1245: --- Summary: org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testRMRestart times out Key: YARN-1245 URL: https://issues.apache.org/jira/browse/YARN-1245

[jira] [Updated] (YARN-1245) org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testRMRestart times out

2013-09-27 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1245: Assignee: Robert Kanter

[jira] [Assigned] (YARN-1259) In Fair Scheduler web UI, queue num pending and num active apps switched

2013-10-14 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter reassigned YARN-1259: --- Assignee: Robert Kanter In Fair Scheduler web UI, queue num pending and num active apps

[jira] [Updated] (YARN-1259) In Fair Scheduler web UI, queue num pending and num active apps switched

2013-10-14 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1259: Attachment: YARN-1259.patch I didn't write any tests because its a trivial change, but I can if

[jira] [Commented] (YARN-1390) Add applicationSource to ApplicationSubmissionContext and RMApp

2013-11-08 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13817586#comment-13817586 ] Robert Kanter commented on YARN-1390: - Ultimately, what we want is a way to tag jobs in

[jira] [Commented] (YARN-1399) Allow users to annotate an application with multiple tags

2013-12-04 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13839211#comment-13839211 ] Robert Kanter commented on YARN-1399: - {quote} I am tempted to accept all unicode

[jira] [Commented] (YARN-1399) Allow users to annotate an application with multiple tags

2013-12-30 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13859053#comment-13859053 ] Robert Kanter commented on YARN-1399: - {quote} Having GUID in the workflow ID to

[jira] [Updated] (YARN-1490) RM should optionally not kill all containers when an ApplicationMaster exits

2014-02-07 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1490: Attachment: org.apache.oozie.service.TestRecoveryService_thread-dump.txt As reported in the

[jira] [Updated] (YARN-1731) ResourceManager should record killed ApplicationMasters for History

2014-02-13 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1731: Attachment: YARN-1731.patch I’ve attached a preliminary version of the patch. Once we all agree on

[jira] [Created] (YARN-1731) ResourceManager should record killed ApplicationMasters for History

2014-02-13 Thread Robert Kanter (JIRA)
Robert Kanter created YARN-1731: --- Summary: ResourceManager should record killed ApplicationMasters for History Key: YARN-1731 URL: https://issues.apache.org/jira/browse/YARN-1731 Project: Hadoop YARN

[jira] [Updated] (YARN-1731) ResourceManager should record killed ApplicationMasters for History

2014-02-19 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-1731: Attachment: YARN-1731.patch Updated patch ResourceManager should record killed ApplicationMasters

[jira] [Commented] (YARN-1490) RM should optionally not kill all containers when an ApplicationMaster exits

2014-02-24 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13910615#comment-13910615 ] Robert Kanter commented on YARN-1490: - By the way, the issue I mentioned a few comments

[jira] [Created] (YARN-2766) [JDK 8] TestApplicationHistoryClientService fails

2014-10-28 Thread Robert Kanter (JIRA)
Robert Kanter created YARN-2766: --- Summary: [JDK 8] TestApplicationHistoryClientService fails Key: YARN-2766 URL: https://issues.apache.org/jira/browse/YARN-2766 Project: Hadoop YARN Issue

[jira] [Updated] (YARN-2766) [JDK 8] TestApplicationHistoryClientService fails

2014-10-28 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2766: Attachment: YARN-2766.patch The patch creates a consistent ordering by sorting the Collection before

[jira] [Updated] (YARN-2766) [JDK 8] TestApplicationHistoryClientService fails

2014-10-29 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2766: Attachment: YARN-2766.patch That makes sense. I wasn't able to trace the code back to

[jira] [Updated] (YARN-2766) [JDK 8] TestApplicationHistoryClientService fails

2014-10-29 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2766: Attachment: YARN-2766.patch New patch fixes findbugs warnings [JDK 8]

[jira] [Assigned] (YARN-2604) Scheduler should consider max-allocation-* in conjunction with the largest node

2014-10-29 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter reassigned YARN-2604: --- Assignee: Robert Kanter (was: Karthik Kambatla) Scheduler should consider max-allocation-*

[jira] [Updated] (YARN-2766) ApplicationHistoryManager is expected to return a sorted list of apps/attempts/containers

2014-10-30 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2766: Attachment: YARN-2766.patch Makes sense. The new patch updates

[jira] [Updated] (YARN-2604) Scheduler should consider max-allocation-* in conjunction with the largest node

2014-10-31 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2604: Attachment: YARN-2604.patch Scheduler should consider max-allocation-* in conjunction with the

[jira] [Updated] (YARN-2604) Scheduler should consider max-allocation-* in conjunction with the largest node

2014-11-03 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2604: Attachment: YARN-2604.patch The new patch fixes the test failures: - TestContainerAllocation: Minor

[jira] [Updated] (YARN-2604) Scheduler should consider max-allocation-* in conjunction with the largest node

2014-11-03 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2604: Attachment: YARN-2604.patch The findbugs warning had to do with the lock I added. During

[jira] [Updated] (YARN-2423) TimelineClient should wrap all GET APIs to facilitate Java users

2014-11-10 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2423: Attachment: YARN-2423.patch I rebased the new patch and fixed the test failures. For the bug I

[jira] [Updated] (YARN-2604) Scheduler should consider max-allocation-* in conjunction with the largest node

2014-11-20 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2604: Attachment: YARN-2604.patch The new patch addresses Karthik's 2nd suggestion. That actually made it

[jira] [Updated] (YARN-2604) Scheduler should consider max-allocation-* in conjunction with the largest node

2014-11-20 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kanter updated YARN-2604: Attachment: YARN-2604.patch The new patch adds similar code for scores. I made some other minor

[jira] [Commented] (YARN-2604) Scheduler should consider max-allocation-* in conjunction with the largest node

2014-11-21 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14221169#comment-14221169 ] Robert Kanter commented on YARN-2604: - My last comment should have said vcores, not

[jira] [Commented] (YARN-2461) Fix PROCFS_USE_SMAPS_BASED_RSS_ENABLED property in YarnConfiguration

2014-12-05 Thread Robert Kanter (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236026#comment-14236026 ] Robert Kanter commented on YARN-2461: - +1 Fix PROCFS_USE_SMAPS_BASED_RSS_ENABLED

  1   2   3   4   5   6   7   8   9   10   >