[jira] [Commented] (YARN-377) Fix test failure for HADOOP-9252

2013-02-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13573872#comment-13573872 ] Jason Lowe commented on YARN-377: - If the total fix is coming real soon, then no worries

[jira] [Commented] (YARN-362) Unexpected extra results when using the task attempt table search

2013-02-08 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13574555#comment-13574555 ] Jason Lowe commented on YARN-362: - Thanks for separating out the patch into YARN and

[jira] [Updated] (YARN-362) Unexpected extra results when using webUI table search

2013-02-08 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-362: Summary: Unexpected extra results when using webUI table search (was: Unexpected extra results when using

[jira] [Created] (YARN-400) RM can return null application resource usage report leading to NPE in client

2013-02-12 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-400: --- Summary: RM can return null application resource usage report leading to NPE in client Key: YARN-400 URL: https://issues.apache.org/jira/browse/YARN-400 Project: Hadoop YARN

[jira] [Updated] (YARN-400) RM can return null application resource usage report leading to NPE in client

2013-02-12 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-400: Attachment: YARN-400.patch Patch to ensure application resource usage report is always provided when

[jira] [Updated] (YARN-400) RM can return null application resource usage report leading to NPE in client

2013-02-20 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-400: Attachment: YARN-400-branch-0.23.patch Thanks for the review, Tom. Here's the patch for branch-0.23.

[jira] [Resolved] (YARN-413) With log aggregation on, nodemanager dies on startup if it can't connect to HDFS

2013-02-21 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-413. - Resolution: Duplicate With log aggregation on, nodemanager dies on startup if it can't connect to

[jira] [Updated] (YARN-376) Apps that have completed can appear as RUNNING on the NM UI

2013-02-22 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-376: Attachment: YARN-376.patch Patch that adds a new interface to RMNode so ResourceTrackingService can

[jira] [Updated] (YARN-376) Apps that have completed can appear as RUNNING on the NM UI

2013-02-22 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-376: Priority: Blocker (was: Major) Increasing to Blocker as this race can lead to lost logs since NM will not

[jira] [Updated] (YARN-376) Apps that have completed can appear as RUNNING on the NM UI

2013-02-22 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-376: Attachment: YARN-376.patch Updated patch so the test has a timeout. Apps that have

[jira] [Created] (YARN-426) Failure to download a public resource on a node prevents further downloads of the resource from that node

2013-02-25 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-426: --- Summary: Failure to download a public resource on a node prevents further downloads of the resource from that node Key: YARN-426 URL: https://issues.apache.org/jira/browse/YARN-426

[jira] [Resolved] (YARN-428) YARNClientImpl logging too aggressively

2013-02-26 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-428. - Resolution: Duplicate YARNClientImpl logging too aggressively

[jira] [Updated] (YARN-426) Failure to download a public resource on a node prevents further downloads of the resource from that node

2013-02-26 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-426: Attachment: YARN-426.patch Patch to ensure all queued attempts for a public resource are notified of a

[jira] [Updated] (YARN-376) Apps that have completed can appear as RUNNING on the NM UI

2013-02-27 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-376: Attachment: YARN-376.patch Thanks for the review, Sidd. I originally had it update the heartbeat since the

[jira] [Commented] (YARN-376) Apps that have completed can appear as RUNNING on the NM UI

2013-02-27 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13589136#comment-13589136 ] Jason Lowe commented on YARN-376: - The eclipse failure appears to be unrelated, as it builds

[jira] [Updated] (YARN-269) Resource Manager not logging the health_check_script result when taking it out

2013-02-28 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-269: Attachment: YARN-269.patch Thanks for the review, Kihwal. Here's an updated patch.

[jira] [Created] (YARN-445) Ability to signal containers

2013-03-02 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-445: --- Summary: Ability to signal containers Key: YARN-445 URL: https://issues.apache.org/jira/browse/YARN-445 Project: Hadoop YARN Issue Type: New Feature

[jira] [Commented] (YARN-446) Container killed before hprof dumps profile.out

2013-03-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13592246#comment-13592246 ] Jason Lowe commented on YARN-446: - IMO the AM should always allow the task attempt time to

[jira] [Commented] (YARN-345) Many InvalidStateTransitonException errors for ApplicationImpl in Node Manager

2013-03-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13592654#comment-13592654 ] Jason Lowe commented on YARN-345: - +1, lgtm. Many

[jira] [Updated] (YARN-410) New lines in diagnostics for a failed app on the per-application page make it hard to read

2013-03-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-410: Fix Version/s: 0.23.7 Thanks, Omkar. I pulled this into branch-0.23. New lines in

[jira] [Created] (YARN-455) NM warns about stopping an unknown container under normal circumstances

2013-03-07 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-455: --- Summary: NM warns about stopping an unknown container under normal circumstances Key: YARN-455 URL: https://issues.apache.org/jira/browse/YARN-455 Project: Hadoop YARN

[jira] [Commented] (YARN-463) Show explicitly excluded nodes on the UI

2013-03-08 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13597529#comment-13597529 ] Jason Lowe commented on YARN-463: - On a similar note, should we also be listing nodes that

[jira] [Created] (YARN-476) ProcfsBasedProcessTree info message confuses users

2013-03-13 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-476: --- Summary: ProcfsBasedProcessTree info message confuses users Key: YARN-476 URL: https://issues.apache.org/jira/browse/YARN-476 Project: Hadoop YARN Issue Type: Bug

[jira] [Commented] (YARN-472) MR app master deletes staging dir when sent a reboot command from the RM

2013-03-18 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13605953#comment-13605953 ] Jason Lowe commented on YARN-472: - Another cause for the AM to receive a reboot command from

[jira] [Commented] (YARN-472) MR app master deletes staging dir when sent a reboot command from the RM

2013-03-19 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13606333#comment-13606333 ] Jason Lowe commented on YARN-472: - {{Runtime.halt}} would be one brutally efficient way to

[jira] [Created] (YARN-512) Log aggregation root directory check is more expensive than it needs to be

2013-03-28 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-512: --- Summary: Log aggregation root directory check is more expensive than it needs to be Key: YARN-512 URL: https://issues.apache.org/jira/browse/YARN-512 Project: Hadoop YARN

[jira] [Commented] (YARN-515) Node Manager not getting the master key

2013-03-29 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13617665#comment-13617665 ] Jason Lowe commented on YARN-515: - +1 Node Manager not getting the master

[jira] [Updated] (YARN-548) Cover package org.apache.hadoop.yarn with unit tests

2013-04-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-548: Issue Type: Sub-task (was: Test) Parent: YARN-526 Cover package org.apache.hadoop.yarn with

[jira] [Moved] (YARN-548) Cover package org.apache.hadoop.yarn with unit tests

2013-04-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe moved HDFS-4528 to YARN-548: --- Affects Version/s: (was: 0.23.6) (was: 2.0.3-alpha)

[jira] [Updated] (YARN-548) Add tests for YarnUncaughtExceptionHandler

2013-04-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-548: Summary: Add tests for YarnUncaughtExceptionHandler (was: Cover package org.apache.hadoop.yarn with unit

[jira] [Assigned] (YARN-548) Add tests for YarnUncaughtExceptionHandler

2013-04-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-548: --- Assignee: Vadim Bondarev Add tests for YarnUncaughtExceptionHandler

[jira] [Commented] (YARN-445) Ability to signal containers

2013-04-15 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13631761#comment-13631761 ] Jason Lowe commented on YARN-445: - Yes, it's an enhancement request to the NM API. I filed

[jira] [Updated] (YARN-476) ProcfsBasedProcessTree info message confuses users

2013-04-18 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-476: Fix Version/s: 0.23.8 Thanks, Sandy. I also committed this to branch-0.23.

[jira] [Updated] (YARN-71) Ensure/confirm that the NodeManager cleans up local-dirs on restart

2013-04-18 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-71?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-71: --- Fix Version/s: 0.23.8 Thanks, Xuan. I committed this to branch-0.23 as well. Ensure/confirm

[jira] [Commented] (YARN-363) yarn proxyserver fails to find webapps/proxy directory on startup

2013-04-24 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13640835#comment-13640835 ] Jason Lowe commented on YARN-363: - By default it runs as part of the ResourceManager

[jira] [Created] (YARN-661) NM fails to cleanup local directories for users

2013-05-09 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-661: --- Summary: NM fails to cleanup local directories for users Key: YARN-661 URL: https://issues.apache.org/jira/browse/YARN-661 Project: Hadoop YARN Issue Type: Bug

[jira] [Commented] (YARN-661) NM fails to cleanup local directories for users

2013-05-09 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653198#comment-13653198 ] Jason Lowe commented on YARN-661: - Sample log of the failure: {noformat} 2013-05-09

[jira] [Created] (YARN-713) ResourceManager can exit unexpectedly if DNS is unavailable

2013-05-21 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-713: --- Summary: ResourceManager can exit unexpectedly if DNS is unavailable Key: YARN-713 URL: https://issues.apache.org/jira/browse/YARN-713 Project: Hadoop YARN Issue

[jira] [Commented] (YARN-1704) Review LICENSE and NOTICE to reflect new levelDB releated libraries being used

2014-02-27 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13915040#comment-13915040 ] Jason Lowe commented on YARN-1704: -- Snappy and leveldb are included in leveldbjni-all.

[jira] [Commented] (YARN-1704) Review LICENSE and NOTICE to reflect new levelDB releated libraries being used

2014-02-27 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13915111#comment-13915111 ] Jason Lowe commented on YARN-1704: -- The jar contains binaries for: - linux32 - linux64 -

[jira] [Commented] (YARN-1704) Review LICENSE and NOTICE to reflect new levelDB releated libraries being used

2014-02-27 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13915119#comment-13915119 ] Jason Lowe commented on YARN-1704: -- bq. The license for snappy mentions test resources

[jira] [Commented] (YARN-1704) Review LICENSE and NOTICE to reflect new levelDB releated libraries being used

2014-02-28 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13916578#comment-13916578 ] Jason Lowe commented on YARN-1704: -- +1 lgtm. Review LICENSE and NOTICE to reflect new

[jira] [Commented] (YARN-1771) many getFileStatus calls made from node manager for localizing a public distributed cache resource

2014-03-03 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13918112#comment-13918112 ] Jason Lowe commented on YARN-1771: -- Here's a thought to possibly avoid checking each

[jira] [Commented] (YARN-1771) many getFileStatus calls made from node manager for localizing a public distributed cache resource

2014-03-03 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13918301#comment-13918301 ] Jason Lowe commented on YARN-1771: -- Yes, it would be a weaker condition check, but I'm

[jira] [Resolved] (YARN-1777) Nodemanager fails to detect Full disk and try to launch container

2014-03-03 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-1777. -- Resolution: Duplicate This is a duplicate of YARN-257. Nodemanager fails to detect Full disk and try

[jira] [Commented] (YARN-1771) many getFileStatus calls made from node manager for localizing a public distributed cache resource

2014-03-03 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13918565#comment-13918565 ] Jason Lowe commented on YARN-1771: -- Today the public cache localizes as the NM user, so

[jira] [Commented] (YARN-1771) many getFileStatus calls made from node manager for localizing a public distributed cache resource

2014-03-03 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13918649#comment-13918649 ] Jason Lowe commented on YARN-1771: -- Agreed, a nobody account would make the check

[jira] [Commented] (YARN-1445) Separate FINISHING and FINISHED state in YarnApplicationState

2014-03-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919440#comment-13919440 ] Jason Lowe commented on YARN-1445: -- bq. Then, it is possible that AM is unregistered, and

[jira] [Commented] (YARN-1781) NM should allow users to specify max disk utilization for local disks

2014-03-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919750#comment-13919750 ] Jason Lowe commented on YARN-1781: -- Note that we may need to do more than just mark disks

[jira] [Updated] (YARN-1783) yarn application does not make any progress even when no other application is running when RM is being restarted in the background

2014-03-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1783: - Target Version/s: 2.4.0 yarn application does not make any progress even when no other application is

[jira] [Updated] (YARN-1338) Recover localized resource cache state upon nodemanager restart

2014-03-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1338: - Attachment: YARN-1338.patch Patch to recover the localized resource cache state when NM recovery is

[jira] [Resolved] (YARN-1791) Distributed cache issue using YARN

2014-03-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-1791. -- Resolution: Invalid The distributed cache only preserves the basename of files and links them into the

[jira] [Updated] (YARN-1341) Recover NMTokens upon nodemanager restart

2014-03-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1341: - Attachment: YARN-1341.patch Patch to enable the recovery of NMTokens. Like YARN-1338 it uses leveldb as

[jira] [Updated] (YARN-1341) Recover NMTokens upon nodemanager restart

2014-03-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1341: - Attachment: YARN-1341v2.patch Revised patch without the addition of the state store to the NM context

[jira] [Updated] (YARN-1342) Recover container tokens upon nodemanager restart

2014-03-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1342: - Attachment: YARN-1342.patch Patch to recover container tokens after a restart. This is very similar to

[jira] [Commented] (YARN-1800) YARN NodeManager with java.util.concurrent.RejectedExecutionException

2014-03-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923926#comment-13923926 ] Jason Lowe commented on YARN-1800: -- This is the aftermath of an earlier error that

[jira] [Created] (YARN-1801) NPE in public localizer

2014-03-07 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-1801: Summary: NPE in public localizer Key: YARN-1801 URL: https://issues.apache.org/jira/browse/YARN-1801 Project: Hadoop YARN Issue Type: Bug Components:

[jira] [Commented] (YARN-1800) YARN NodeManager with java.util.concurrent.RejectedExecutionException

2014-03-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923929#comment-13923929 ] Jason Lowe commented on YARN-1800: -- Filed YARN-1801 to track NPE in public localizer.

[jira] [Updated] (YARN-1800) YARN NodeManager with java.util.concurrent.RejectedExecutionException

2014-03-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1800: - Target Version/s: 2.4.0 YARN NodeManager with java.util.concurrent.RejectedExecutionException

[jira] [Commented] (YARN-1800) YARN NodeManager with java.util.concurrent.RejectedExecutionException

2014-03-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13924169#comment-13924169 ] Jason Lowe commented on YARN-1800: -- Do we really want to catch any kind of Throwable and

[jira] [Commented] (YARN-1789) ApplicationSummary does not escape newlines in the app name

2014-03-10 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13925779#comment-13925779 ] Jason Lowe commented on YARN-1789: -- RMAppManager.ApplicationSummary is logging

[jira] [Commented] (YARN-1810) YARN RM Webapp Application Filter Issue

2014-03-10 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13926182#comment-13926182 ] Jason Lowe commented on YARN-1810: -- Had you used a search filter in the past? I'm

[jira] [Updated] (YARN-1339) Recover DeletionService state upon nodemanager restart

2014-03-10 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1339: - Attachment: YARN-1339.patch Patch to allow recovery of deletion service state after an NM restart. Like

[jira] [Resolved] (YARN-1820) I can not run mapreduce!!

2014-03-11 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-1820. -- Resolution: Invalid Please use [mailto:u...@hadoop.apache.org] to ask for help with getting a cluster

[jira] [Commented] (YARN-1818) When mapreduce.jobhistory.intermediate-done-dir isn't writable, application fails with generic error

2014-03-11 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13930397#comment-13930397 ] Jason Lowe commented on YARN-1818: -- Moving this to the MAPREDUCE project since the

[jira] [Commented] (YARN-1810) YARN RM Webapp Application Filter Issue

2014-03-11 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13930661#comment-13930661 ] Jason Lowe commented on YARN-1810: -- Does it also occur if you simply click anywhere on the

[jira] [Commented] (YARN-1789) ApplicationSummary does not escape newlines in the app name

2014-03-12 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13932541#comment-13932541 ] Jason Lowe commented on YARN-1789: -- +1 lgtm. Committing this. ApplicationSummary does

[jira] [Created] (YARN-1844) yarn.log.server.url should have a default value

2014-03-17 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-1844: Summary: yarn.log.server.url should have a default value Key: YARN-1844 URL: https://issues.apache.org/jira/browse/YARN-1844 Project: Hadoop YARN Issue Type:

[jira] [Commented] (YARN-1842) InvalidApplicationMasterRequestException raised during AM-requested shutdown

2014-03-17 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13937809#comment-13937809 ] Jason Lowe commented on YARN-1842: -- Wondering if this is a case where the NM or AM somehow

[jira] [Updated] (YARN-500) ResourceManager webapp is using next port if configured port is already in use

2014-03-17 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-500: Fix Version/s: 0.23.11 Thanks, Kenji! I committed this to branch-0.23 as well. ResourceManager webapp is

[jira] [Commented] (YARN-1206) AM container log link broken on NM web page if log-aggregation is disabled.

2014-03-17 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938468#comment-13938468 ] Jason Lowe commented on YARN-1206: -- Note I'm not sure this problem is localized to just

[jira] [Commented] (YARN-1847) YARN application always exits with FAILED state

2014-03-18 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939271#comment-13939271 ] Jason Lowe commented on YARN-1847: -- The expired transition occurs when the application

[jira] [Commented] (YARN-1847) YARN application always exits with FAILED state

2014-03-18 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939348#comment-13939348 ] Jason Lowe commented on YARN-1847: -- bq. No, its not expiring. If it's not expiring, then

[jira] [Resolved] (YARN-1847) YARN application always exits with FAILED state

2014-03-18 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-1847. -- Resolution: Invalid YARN application always exits with FAILED state

[jira] [Commented] (YARN-1847) YARN application always exits with FAILED state

2014-03-18 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939772#comment-13939772 ] Jason Lowe commented on YARN-1847: -- bq. Again, the real issue is the API which does not

[jira] [Commented] (YARN-1888) Not add NodeManager to inactiveRMNodes when reboot NodeManager which have different port

2014-04-01 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13956690#comment-13956690 ] Jason Lowe commented on YARN-1888: -- I agree with [~kasha] on this. A nodemanager coming

[jira] [Resolved] (YARN-1862) Change mapreduce.jobhistory.done-dir by command line arg seems not working

2014-04-01 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-1862. -- Resolution: Invalid This is a question best asked on the [user@ mailing

[jira] [Commented] (YARN-1901) All tasks restart during RM failover on Hive

2014-04-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13960026#comment-13960026 ] Jason Lowe commented on YARN-1901: -- This appears to be a duplicate of HIVE-6638. As

[jira] [Commented] (YARN-1769) CapacityScheduler: Improve reservations

2014-04-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13960541#comment-13960541 ] Jason Lowe commented on YARN-1769: -- The patch no longer applies cleanly after YARN-1512.

[jira] [Updated] (YARN-1757) Auxiliary service support for nodemanager recovery

2014-04-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1757: - Attachment: YARN-1757-v2.patch Thanks for the review, Karthik! bq. Nit: YarnConfiguration: We might want

[jira] [Commented] (YARN-1769) CapacityScheduler: Improve reservations

2014-04-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13962325#comment-13962325 ] Jason Lowe commented on YARN-1769: -- Thanks, Tom. Latest changes look good to me. Waiting

[jira] [Commented] (YARN-1913) Cluster logjam when all resources are consumed by AM

2014-04-08 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13963143#comment-13963143 ] Jason Lowe commented on YARN-1913: -- Which scheduler are you using? The CapacityScheduler

[jira] [Updated] (YARN-1341) Recover NMTokens upon nodemanager restart

2014-04-08 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1341: - Attachment: YARN-1341v3.patch Updating patch after YARN-1757 was committed. Recover NMTokens upon

[jira] [Updated] (YARN-1342) Recover container tokens upon nodemanager restart

2014-04-08 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1342: - Attachment: YARN-1342v2.patch Updating patch after YARN-1757 was committed. Recover container tokens

[jira] [Updated] (YARN-1339) Recover DeletionService state upon nodemanager restart

2014-04-08 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1339: - Attachment: YARN-1339v2.patch Updating patch after YARN-1757 was committed. Recover DeletionService

[jira] [Updated] (YARN-1338) Recover localized resource cache state upon nodemanager restart

2014-04-09 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1338: - Attachment: YARN-1338v2.patch Updating patch after YARN-1757 and other recent changes on trunk. Recover

[jira] [Created] (YARN-1930) HostUtil.getTaskLogUrl is not backwards binary compatible with 2.3

2014-04-11 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-1930: Summary: HostUtil.getTaskLogUrl is not backwards binary compatible with 2.3 Key: YARN-1930 URL: https://issues.apache.org/jira/browse/YARN-1930 Project: Hadoop YARN

[jira] [Commented] (YARN-1553) Do not use HttpConfig.isSecure() in YARN

2014-04-11 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13966574#comment-13966574 ] Jason Lowe commented on YARN-1553: -- This broke source and binary backwards-compatibility

[jira] [Comment Edited] (YARN-1553) Do not use HttpConfig.isSecure() in YARN

2014-04-11 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13966574#comment-13966574 ] Jason Lowe edited comment on YARN-1553 at 4/11/14 2:36 PM: --- This

[jira] [Assigned] (YARN-1355) Recover application ACLs upon nodemanager restart

2014-04-14 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-1355: Assignee: Jason Lowe Recover application ACLs upon nodemanager restart

[jira] [Assigned] (YARN-1354) Recover applications upon nodemanager restart

2014-04-14 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-1354: Assignee: Jason Lowe Recover applications upon nodemanager restart

[jira] [Assigned] (YARN-1352) Recover LogAggregationService upon nodemanager restart

2014-04-14 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-1352: Assignee: Jason Lowe Recover LogAggregationService upon nodemanager restart

[jira] [Assigned] (YARN-1337) Recover active container state upon nodemanager restart

2014-04-14 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-1337: Assignee: Jason Lowe Recover active container state upon nodemanager restart

[jira] [Updated] (YARN-1354) Recover applications upon nodemanager restart

2014-04-15 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1354: - Attachment: YARN-1354-v1.patch Patch that persists applications to a leveldb state store when recovery is

[jira] [Resolved] (YARN-1355) Recover application ACLs upon nodemanager restart

2014-04-15 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-1355. -- Resolution: Duplicate Recover application ACLs upon nodemanager restart

[jira] [Commented] (YARN-1355) Recover application ACLs upon nodemanager restart

2014-04-15 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13969626#comment-13969626 ] Jason Lowe commented on YARN-1355: -- YARN-1354 is recovering application ACLs as part of

[jira] [Commented] (YARN-1942) ConverterUtils should not be Private

2014-04-15 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13969787#comment-13969787 ] Jason Lowe commented on YARN-1942: -- +1 to moving the conversion-from-string utilities into

[jira] [Commented] (YARN-1940) deleteAsUser() terminates early without deleting more files on error

2014-04-16 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13971784#comment-13971784 ] Jason Lowe commented on YARN-1940: -- Thanks for posting a patch, Rushabh. The patch will

[jira] [Commented] (YARN-1932) Javascript injection on the job status page

2014-04-18 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13974227#comment-13974227 ] Jason Lowe commented on YARN-1932: -- +1 lgtm. I'll commit this later today unless there

[jira] [Commented] (YARN-1940) deleteAsUser() terminates early without deleting more files on error

2014-04-18 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13974336#comment-13974336 ] Jason Lowe commented on YARN-1940: -- +1 lgtm, committing this deleteAsUser() terminates

<    1   2   3   4   5   6   7   8   9   10   >