[jira] [Commented] (YARN-2592) Preemption can kill containers to fulfil need of already over-capacity queue.

2014-09-24 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14146391#comment-14146391 ] Jason Lowe commented on YARN-2592: -- +1 for at least allowing users to configure no

[jira] [Commented] (YARN-2592) Preemption can kill containers to fulfil need of already over-capacity queue.

2014-09-24 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14146757#comment-14146757 ] Jason Lowe commented on YARN-2592: -- IMHO users shouldn't be complaining if they are

[jira] [Commented] (YARN-2523) ResourceManager UI showing negative value for Decommissioned Nodes field

2014-09-24 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14146931#comment-14146931 ] Jason Lowe commented on YARN-2523: -- Thanks for updating the patch. I think it looks good

[jira] [Commented] (YARN-2604) Scheduler should consider max-allocation-* in conjunction with the largest node

2014-09-25 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14147899#comment-14147899 ] Jason Lowe commented on YARN-2604: -- How is this different from YARN-56? Also wondering

[jira] [Commented] (YARN-2604) Scheduler should consider max-allocation-* in conjunction with the largest node

2014-09-25 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14147942#comment-14147942 ] Jason Lowe commented on YARN-2604: -- Ah, I see, yes they're a little bit different. They'd

[jira] [Commented] (YARN-2550) TestAMRestart fails intermittently

2014-09-25 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14148376#comment-14148376 ] Jason Lowe commented on YARN-2550: -- This looks like a dup of YARN-2483. TestAMRestart

[jira] [Commented] (YARN-2523) ResourceManager UI showing negative value for Decommissioned Nodes field

2014-09-25 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14148401#comment-14148401 ] Jason Lowe commented on YARN-2523: -- +1 lgtm. Committing this. ResourceManager UI

[jira] [Commented] (YARN-1769) CapacityScheduler: Improve reservations

2014-09-29 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151706#comment-14151706 ] Jason Lowe commented on YARN-1769: -- +1 lgtm. Committing this. CapacityScheduler:

[jira] [Commented] (YARN-90) NodeManager should identify failed disks becoming good back again

2014-09-29 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-90?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14152307#comment-14152307 ] Jason Lowe commented on YARN-90: Thanks for updating the patch, Varun. bq. I've changed it

[jira] [Commented] (YARN-2387) Resource Manager crashes with NPE due to lack of synchronization

2014-09-30 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14153888#comment-14153888 ] Jason Lowe commented on YARN-2387: -- +1 lgtm. Committing this. Resource Manager crashes

[jira] [Commented] (YARN-2610) Hamlet should close table tags

2014-09-30 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14153921#comment-14153921 ] Jason Lowe commented on YARN-2610: -- Looks like branch-2.6 was just cut as this was going

[jira] [Commented] (YARN-2179) Initial cache manager structure and context

2014-10-01 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14154851#comment-14154851 ] Jason Lowe commented on YARN-2179: -- The pom versions are incorrect in branch-2 from the

[jira] [Commented] (YARN-2624) Resource Localization fails on a cluster due to existing cache directories

2014-10-02 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14156824#comment-14156824 ] Jason Lowe commented on YARN-2624: -- Thanks for catching and fixing this, Anubhav! My

[jira] [Commented] (YARN-2414) RM web UI: app page will crash if app is failed before any attempt has been created

2014-10-02 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14157234#comment-14157234 ] Jason Lowe commented on YARN-2414: -- Ran into this as well. Any update, [~leftnoteasy]?

[jira] [Commented] (YARN-1680) availableResources sent to applicationMaster in heartbeat should exclude blacklistedNodes free memory.

2014-10-03 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14158680#comment-14158680 ] Jason Lowe commented on YARN-1680: -- bq. if an application had problems with a node and

[jira] [Commented] (YARN-2312) Marking ContainerId#getId as deprecated

2014-10-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14160481#comment-14160481 ] Jason Lowe commented on YARN-2312: -- Patch looks good overall. Just two minor nits: The

[jira] [Updated] (YARN-1915) ClientToAMTokenMasterKey should be provided to AM at launch time

2014-10-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1915: - Attachment: YARN-1915v3.patch Refreshed patch to latest trunk. [~vinodkv] could you comment? I fully

[jira] [Updated] (YARN-2331) Distinguish shutdown during supervision vs. shutdown for rolling upgrade

2014-10-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-2331: - Attachment: YARN-2331.patch In the interest of getting something done for this in time for 2.6, here's a

[jira] [Commented] (YARN-1915) ClientToAMTokenMasterKey should be provided to AM at launch time

2014-10-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14162091#comment-14162091 ] Jason Lowe commented on YARN-1915: -- Test failure is unrelated, see YARN-2483.

[jira] [Updated] (YARN-2331) Distinguish shutdown during supervision vs. shutdown for rolling upgrade

2014-10-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-2331: - Attachment: YARN-2331v2.patch Updated patch to fix the unit tests. Distinguish shutdown during

[jira] [Commented] (YARN-2414) RM web UI: app page will crash if app is failed before any attempt has been created

2014-10-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14162366#comment-14162366 ] Jason Lowe commented on YARN-2414: -- Thanks, Wangda! Looks good overall. Nit: appMerics

[jira] [Commented] (YARN-2331) Distinguish shutdown during supervision vs. shutdown for rolling upgrade

2014-10-08 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14163633#comment-14163633 ] Jason Lowe commented on YARN-2331: -- Yes, the patch implements the [Another possible

[jira] [Commented] (YARN-2680) Node shouldn't be listed as RUNNING when NM daemon is stop even when recovery work is enabled.

2014-10-13 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14169289#comment-14169289 ] Jason Lowe commented on YARN-2680: -- [~djp] could you elaborate more on the use-case?

[jira] [Updated] (YARN-2667) Fix the release audit warning caused by hadoop-yarn-registry

2014-10-13 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-2667: - Target Version/s: 2.6.0 (was: 2.7.0) Affects Version/s: 2.6.0 Assignee: Yi Liu

[jira] [Resolved] (YARN-2665) Audit warning of registry project

2014-10-13 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-2665. -- Resolution: Duplicate Closing this as a duplicate of YARN-2667, as that already has a patch. Audit

[jira] [Commented] (YARN-2377) Localization exception stack traces are not passed as diagnostic info

2014-10-13 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14169700#comment-14169700 ] Jason Lowe commented on YARN-2377: -- +1 latest patch lgtm. The audit failure is unrelated,

[jira] [Commented] (YARN-2314) ContainerManagementProtocolProxy can create thousands of threads for a large cluster

2014-10-13 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14170082#comment-14170082 ] Jason Lowe commented on YARN-2314: -- The patch effectively restores 0.23 behavior in this

[jira] [Assigned] (YARN-2314) ContainerManagementProtocolProxy can create thousands of threads for a large cluster

2014-10-14 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-2314: Assignee: Jason Lowe bq. Basically the cache doesn't have more functionalities other than just

[jira] [Updated] (YARN-2314) ContainerManagementProtocolProxy can create thousands of threads for a large cluster

2014-10-14 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-2314: - Attachment: YARN-2314.patch Attaching a patch that allows the existing

[jira] [Commented] (YARN-2314) ContainerManagementProtocolProxy can create thousands of threads for a large cluster

2014-10-14 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14171420#comment-14171420 ] Jason Lowe commented on YARN-2314: -- Yes, the patch sets the default to off since that

[jira] [Commented] (YARN-2314) ContainerManagementProtocolProxy can create thousands of threads for a large cluster

2014-10-14 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14171443#comment-14171443 ] Jason Lowe commented on YARN-2314: -- So Tez will automatically benefit on large clusters

[jira] [Commented] (YARN-2312) Marking ContainerId#getId as deprecated

2014-10-14 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14171470#comment-14171470 ] Jason Lowe commented on YARN-2312: -- Sorry for the late reply. +1 lgtm as well. I noticed

[jira] [Commented] (YARN-2314) ContainerManagementProtocolProxy can create thousands of threads for a large cluster

2014-10-14 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14171475#comment-14171475 ] Jason Lowe commented on YARN-2314: -- The only issue I can think of is the idle timeout

[jira] [Updated] (YARN-2314) ContainerManagementProtocolProxy can create thousands of threads for a large cluster

2014-10-15 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-2314: - Attachment: YARN-2314v2.patch Updated the patch to deprecate yarn.client.max-nodemanagers-proxies in favor

[jira] [Updated] (YARN-1915) ClientToAMTokenMasterKey should be provided to AM at launch time

2014-10-17 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1915: - Priority: Blocker (was: Critical) Per offline discussion with [~vinodkv] marking this as a blocker for

[jira] [Updated] (YARN-90) NodeManager should identify failed disks becoming good again

2014-10-21 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-90?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-90: --- Summary: NodeManager should identify failed disks becoming good again (was: NodeManager should identify failed

[jira] [Commented] (YARN-2010) RM can't transition to active if it can't recover an app attempt

2014-10-22 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14180130#comment-14180130 ] Jason Lowe commented on YARN-2010: -- We recently ran into a case where an application tried

[jira] [Resolved] (YARN-2473) YARN never cleans up container directories from a full disk

2014-10-23 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-2473. -- Resolution: Duplicate Closing as a duplicate of YARN-90. YARN never cleans up container directories

[jira] [Commented] (YARN-2314) ContainerManagementProtocolProxy can create thousands of threads for a large cluster

2014-10-23 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181809#comment-14181809 ] Jason Lowe commented on YARN-2314: -- bq. IIUC, mayBeCloseProxy can be invoked by

[jira] [Commented] (YARN-24) Nodemanager fails to start if log aggregation enabled and namenode unavailable

2012-08-21 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-24?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13438840#comment-13438840 ] Jason Lowe commented on YARN-24: One thing we could consider is marking the node as UNHEALTHY

[jira] [Updated] (YARN-63) RMNodeImpl is missing valid transitions from the UNHEALTHY state

2012-08-30 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-63?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-63: --- Attachment: YARN-63.patch Straightforward patch to add support for REBOOTING, DECOMMISSION, and EXPIRE events

[jira] [Updated] (YARN-63) RMNodeImpl is missing valid transitions from the UNHEALTHY state

2012-08-30 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-63?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-63: --- Attachment: YARN-63-branch-0.23.patch Patch for branch-0.23 which includes the missed handling of CLEANUP_APP

[jira] [Commented] (YARN-63) RMNodeImpl is missing valid transitions from the UNHEALTHY state

2012-09-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-63?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13448866#comment-13448866 ] Jason Lowe commented on YARN-63: bq. I am a bit curious about the DECOMMISSIONED and REBOOTED

[jira] [Created] (YARN-87) NM ResourceLocalizationService can fail to create

2012-09-05 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-87: -- Summary: NM ResourceLocalizationService can fail to create Key: YARN-87 URL: https://issues.apache.org/jira/browse/YARN-87 Project: Hadoop YARN Issue Type: Bug

[jira] [Updated] (YARN-87) NM ResourceLocalizationService does not set permissions of local cache directories

2012-09-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-87?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-87: --- Summary: NM ResourceLocalizationService does not set permissions of local cache directories (was: NM

[jira] [Updated] (YARN-87) NM ResourceLocalizationService does not set permissions of local cache directories

2012-09-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-87?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-87: --- Attachment: YARN-87.patch Quick patch to fix the permissions of the cache directories when they are created.

[jira] [Created] (YARN-88) DefaultContainerExecutor can fail to set proper permissions

2012-09-06 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-88: -- Summary: DefaultContainerExecutor can fail to set proper permissions Key: YARN-88 URL: https://issues.apache.org/jira/browse/YARN-88 Project: Hadoop YARN Issue Type:

[jira] [Updated] (YARN-93) Diagnostics missing from applications that have finished but failed

2012-09-11 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-93?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-93: --- Attachment: YARN-93.patch Patch to add diagnostics for finished apps. This adds a new

[jira] [Commented] (YARN-106) Nodemanager needs to set permissions of local directories

2012-09-19 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13458761#comment-13458761 ] Jason Lowe commented on YARN-106: - How will public files in the distributed cache work

[jira] [Created] (YARN-112) Race in localization can cause containers to fail

2012-09-19 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-112: --- Summary: Race in localization can cause containers to fail Key: YARN-112 URL: https://issues.apache.org/jira/browse/YARN-112 Project: Hadoop YARN Issue Type: Bug

[jira] [Commented] (YARN-112) Race in localization can cause containers to fail

2012-09-19 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13458799#comment-13458799 ] Jason Lowe commented on YARN-112: - Here's the localization error that appeared in the

[jira] [Updated] (YARN-88) DefaultContainerExecutor can fail to set proper permissions

2012-09-19 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-88?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-88: --- Attachment: YARN-88.patch Updated the patch to have the container and temp directory use the same permissions

[jira] [Commented] (YARN-106) Nodemanager needs to set permissions of local directories

2012-09-27 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13464702#comment-13464702 ] Jason Lowe commented on YARN-106: - Yep, you're right, since we explicitly remove the

[jira] [Commented] (YARN-22) Using URI for yarn.nodemanager log dirs fails

2012-09-27 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-22?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13464819#comment-13464819 ] Jason Lowe commented on YARN-22: Note that using a URI for the nodemanager local/log dirs

[jira] [Updated] (YARN-106) Nodemanager needs to set permissions of local directories

2012-09-27 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-106: Attachment: YARN-106.patch Failure was due to a bogus URI being used by TestContainerLogsPage that should

[jira] [Updated] (YARN-93) Diagnostics missing from applications that have finished but failed

2012-09-27 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-93?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-93: --- Attachment: YARN-93-branch-0.23.patch Patch for branch-0.23. Verified test case passes and manually verified

[jira] [Updated] (YARN-163) Retrieving container log via NM webapp can hang with multibyte characters in log

2012-10-16 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-163: Attachment: YARN-163.patch Patch to skip bytes instead of characters. Still needs a unit test, but I

[jira] [Created] (YARN-165) RM should point tracking URL to RM web page for app when AM fails

2012-10-17 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-165: --- Summary: RM should point tracking URL to RM web page for app when AM fails Key: YARN-165 URL: https://issues.apache.org/jira/browse/YARN-165 Project: Hadoop YARN

[jira] [Commented] (YARN-171) NodeManager should serve logs directly if log-aggregation is not enabled

2012-10-18 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13479535#comment-13479535 ] Jason Lowe commented on YARN-171: - It's already doing this for AM logs today off of the RM

[jira] [Commented] (YARN-139) Interrupted Exception within AsyncDispatcher leads to user confusion

2012-10-24 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13483298#comment-13483298 ] Jason Lowe commented on YARN-139: - +1, thanks Vinod! I'll commit this shortly.

[jira] [Updated] (YARN-165) RM should point tracking URL to RM web page for app when AM fails

2012-10-26 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-165: Attachment: YARN-165.patch Patch to change the tracking URL to point to the RM app page when the AM fails.

[jira] [Commented] (YARN-165) RM should point tracking URL to RM web page for app when AM fails

2012-10-31 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13487843#comment-13487843 ] Jason Lowe commented on YARN-165: - Sorry I should have mentioned that I performed some

[jira] [Updated] (YARN-165) RM should point tracking URL to RM web page for app when AM fails

2012-10-31 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-165: Attachment: YARN-165-branch23.patch Patch for branch-0.23. RM should point tracking URL to

[jira] [Updated] (YARN-165) RM should point tracking URL to RM web page for app when AM fails

2012-10-31 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-165: Attachment: YARN-165-branch23.patch Sorry, my bad. I manually tested the fix just like I did for the trunk

[jira] [Assigned] (YARN-201) CapacityScheduler can take a very long time to schedule containers if requests are off cluster

2012-11-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-201: --- Assignee: Jason Lowe CapacityScheduler can take a very long time to schedule containers if

[jira] [Updated] (YARN-201) CapacityScheduler can take a very long time to schedule containers if requests are off cluster

2012-11-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-201: Priority: Critical (was: Major) CapacityScheduler can take a very long time to schedule containers if

[jira] [Updated] (YARN-201) CapacityScheduler can take a very long time to schedule containers if requests are off cluster

2012-11-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-201: Attachment: YARN-201.patch Thought a bit about filtering the AM's requests based on what resources are

[jira] [Created] (YARN-206) TestApplicationCleanup.testContainerCleanup occasionally fails

2012-11-07 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-206: --- Summary: TestApplicationCleanup.testContainerCleanup occasionally fails Key: YARN-206 URL: https://issues.apache.org/jira/browse/YARN-206 Project: Hadoop YARN Issue

[jira] [Updated] (YARN-206) TestApplicationCleanup.testContainerCleanup occasionally fails

2012-11-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-206: Attachment: YARN-206.patch Patch to assert on the proper values. Also fixed where and how long the tests

[jira] [Commented] (YARN-212) NM state machine ignores an APPLICATION_CONTAINER_FINISHED event when it shouldn't

2012-11-12 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13495988#comment-13495988 ] Jason Lowe commented on YARN-212: - +1, looks good overall. Minor nit in ContainerImpl.java

[jira] [Commented] (YARN-144) MiniMRYarnCluster launches RM and JHS on default ports

2012-11-13 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13496687#comment-13496687 ] Jason Lowe commented on YARN-144: - +1, thanks for the update Rob.

[jira] [Commented] (YARN-244) Application Master Retries fail due to FileNotFoundException

2012-11-26 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13503832#comment-13503832 ] Jason Lowe commented on YARN-244: - could you provide a bit more detail from the AM logs when

[jira] [Commented] (YARN-243) Job Client doesn't give progress for Application Master Retries

2012-11-26 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13503863#comment-13503863 ] Jason Lowe commented on YARN-243: - I tried replicating this with a sleep job and manually

[jira] [Commented] (YARN-243) Job Client doesn't give progress for Application Master Retries

2012-11-26 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13503896#comment-13503896 ] Jason Lowe commented on YARN-243: - That doesn't sound like something to fix on the client

[jira] [Commented] (YARN-244) Application Master Retries fail due to FileNotFoundException

2012-11-30 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13507474#comment-13507474 ] Jason Lowe commented on YARN-244: - Devaraj, could you confirm if this is a case where the AM

[jira] [Commented] (YARN-257) NM should gracefully handle a full local disk

2012-12-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13510842#comment-13510842 ] Jason Lowe commented on YARN-257: - bq. Before the complete change, would it help if the NM

[jira] [Commented] (YARN-266) RM and JHS Web UIs are blank because AppsBlock is not escaping string properly

2012-12-11 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13529059#comment-13529059 ] Jason Lowe commented on YARN-266: - +1, will commit shortly. RM and JHS Web

[jira] [Commented] (YARN-205) yarn logs throws nullpointerexception for running application

2013-01-02 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13542209#comment-13542209 ] Jason Lowe commented on YARN-205: - I was not able to reproduce this on a recent 0.23 build.

[jira] [Commented] (YARN-293) Node Manager leaks LocalizerRunner object for every Container

2013-01-02 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13542488#comment-13542488 ] Jason Lowe commented on YARN-293: - +1, lgtm. Node Manager leaks

[jira] [Commented] (YARN-308) Improve documentation about what asks means in AMRMProtocol

2013-01-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13544136#comment-13544136 ] Jason Lowe commented on YARN-308: - To clarify, I believe it is a semi-delta protocol. An

[jira] [Commented] (YARN-308) Improve documentation about what asks means in AMRMProtocol

2013-01-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13544216#comment-13544216 ] Jason Lowe commented on YARN-308: - bq. Would changing it to be a pure absolute or pure delta

[jira] [Commented] (YARN-308) Improve documentation about what asks means in AMRMProtocol

2013-01-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13544263#comment-13544263 ] Jason Lowe commented on YARN-308: - bq. For pure absolute, we would only need to send the

[jira] [Commented] (YARN-324) Provide way to preserve container directories

2013-01-08 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13547191#comment-13547191 ] Jason Lowe commented on YARN-324: - The nodemanager currently supports this via the

[jira] [Updated] (YARN-236) RM should point tracking URL to RM web page when app fails to start

2013-01-16 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-236: Attachment: YARN-236.patch Patch to have the proxy servlet fallback to the RM's webpage for the app if no

[jira] [Updated] (YARN-227) Application expiration difficult to debug for end-users

2013-01-17 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-227: Attachment: YARN-227.patch Patch to add diagnostics to the expired attempt indicating it timed out. Also

[jira] [Assigned] (YARN-227) Application expiration difficult to debug for end-users

2013-01-17 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-227: --- Assignee: Jason Lowe Application expiration difficult to debug for end-users

[jira] [Updated] (YARN-354) WebAppProxyServer exits immediately after startup

2013-01-23 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-354: Priority: Blocker (was: Critical) Upgrading to Blocker since this should be fixed in the 0.23.6 respin.

[jira] [Commented] (YARN-345) Many InvalidStateTransitonException errors for ApplicationImpl in Node Manager

2013-01-25 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13563027#comment-13563027 ] Jason Lowe commented on YARN-345: - Devaraj, I see you picked up this ticket. Have you

[jira] [Created] (YARN-361) Web service tests are failing

2013-01-28 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-361: --- Summary: Web service tests are failing Key: YARN-361 URL: https://issues.apache.org/jira/browse/YARN-361 Project: Hadoop YARN Issue Type: Bug Affects Versions:

[jira] [Resolved] (YARN-361) Web service tests are failing

2013-01-28 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-361. - Resolution: Duplicate Web service tests are failing -

[jira] [Commented] (YARN-357) App submission should not be synchronized

2013-01-28 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13564688#comment-13564688 ] Jason Lowe commented on YARN-357: - TestRMWebServices failure is likely unrelated, see

[jira] [Commented] (YARN-362) Unexpected extra results when using the task attempt table search

2013-01-29 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13565522#comment-13565522 ] Jason Lowe commented on YARN-362: - Moved to the YARN project since that's where the fix

[jira] [Moved] (YARN-362) Unexpected extra results when using the task attempt table search

2013-01-29 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe moved MAPREDUCE-4960 to YARN-362: Component/s: (was: webapps) Affects Version/s: (was:

[jira] [Commented] (YARN-363) yarn proxyserver fails to find webapps/proxy directory on startup

2013-01-29 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13565562#comment-13565562 ] Jason Lowe commented on YARN-363: - The problem does not occur on trunk because trunk has the

[jira] [Created] (YARN-364) AggregatedLogDeletionService can take too long to delete logs

2013-01-30 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-364: --- Summary: AggregatedLogDeletionService can take too long to delete logs Key: YARN-364 URL: https://issues.apache.org/jira/browse/YARN-364 Project: Hadoop YARN Issue

[jira] [Created] (YARN-376) Apps that have completed can appear as RUNNING on the NM UI

2013-02-04 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-376: --- Summary: Apps that have completed can appear as RUNNING on the NM UI Key: YARN-376 URL: https://issues.apache.org/jira/browse/YARN-376 Project: Hadoop YARN Issue

[jira] [Commented] (YARN-376) Apps that have completed can appear as RUNNING on the NM UI

2013-02-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13570568#comment-13570568 ] Jason Lowe commented on YARN-376: - There appears to be a race condition in the RM's handling

[jira] [Commented] (YARN-236) RM should point tracking URL to RM web page when app fails to start

2013-02-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13573546#comment-13573546 ] Jason Lowe commented on YARN-236: - Unfortunately YARN-165 only handled the case where the AM

[jira] [Updated] (YARN-388) testContainerKillOnMemoryOverflow is failing

2013-02-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-388: Attachment: YARN-388.patch Patch to make the message pattern being checked a bit more lenient on the memory

[jira] [Commented] (YARN-377) Fix test failure for HADOOP-9252

2013-02-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13573722#comment-13573722 ] Jason Lowe commented on YARN-377: - Sorry, didn't see this when I filed YARN-388. Any ETA on

<    1   2   3   4   5   6   7   8   9   10   >