[jira] [Commented] (YARN-512) Log aggregation root directory check is more expensive than it needs to be

2013-05-29 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13669227#comment-13669227 ] Jason Lowe commented on YARN-512: - +1, will commit shortly. Log

[jira] [Commented] (YARN-713) ResourceManager can exit unexpectedly if DNS is unavailable

2013-05-31 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13671879#comment-13671879 ] Jason Lowe commented on YARN-713: - The visibility of setTokenServiceUseIp does not need to

[jira] [Updated] (YARN-742) Log aggregation causes a lot of redundant setPermission calls

2013-06-03 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-742: Attachment: YARN-742.patch Patch to walk back up the app log dir path to check for the existence of a

[jira] [Updated] (YARN-742) Log aggregation causes a lot of redundant setPermission calls

2013-06-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-742: Attachment: YARN-742-1.branch-0.23.patch Thanks for the review, Kihwal, and apologies with the patch issues.

[jira] [Assigned] (YARN-760) NodeManager throws AvroRuntimeException on failed start

2013-06-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-760: --- Assignee: Jason Lowe NodeManager throws AvroRuntimeException on failed start

[jira] [Assigned] (YARN-760) NodeManager throws AvroRuntimeException on failed start

2013-06-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-760: --- Assignee: Niranjan Singh (was: Jason Lowe) NodeManager throws AvroRuntimeException on failed

[jira] [Resolved] (YARN-346) InvalidStateTransitonException: Invalid event: INIT_CONTAINER at DONE for ContainerImpl in Node Manager

2013-06-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-346. - Resolution: Duplicate This was fixed by YARN-212. InvalidStateTransitonException:

[jira] [Commented] (YARN-769) Add metrics for number of containers

2013-06-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13677100#comment-13677100 ] Jason Lowe commented on YARN-769: - I think one scenario where it could be useful is the case

[jira] [Commented] (YARN-760) NodeManager throws AvroRuntimeException on failed start

2013-06-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13677156#comment-13677156 ] Jason Lowe commented on YARN-760: - Is changing the unit test the right fix, or should we not

[jira] [Commented] (YARN-769) Add metrics for number of containers

2013-06-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13677215#comment-13677215 ] Jason Lowe commented on YARN-769: - Yes, there is the misconfig issue, but I was also

[jira] [Commented] (YARN-775) stream jobs are not cleaning the Yarn local-dirs after container is released

2013-06-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13677528#comment-13677528 ] Jason Lowe commented on YARN-775: - Is yarn.nodemanager.delete.debug-delay-sec configured to

[jira] [Commented] (YARN-778) Failures in container launches due to issues like disk failure are difficult to diagnose

2013-06-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13677702#comment-13677702 ] Jason Lowe commented on YARN-778: - Dup of YARN-257? Failures in container

[jira] [Commented] (YARN-760) NodeManager throws AvroRuntimeException on failed start

2013-06-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678110#comment-13678110 ] Jason Lowe commented on YARN-760: - +1, will commit shortly NodeManager

[jira] [Commented] (YARN-295) Resource Manager throws InvalidStateTransitonException: Invalid event: CONTAINER_FINISHED at ALLOCATED for RMAppAttemptImpl

2013-06-14 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13683479#comment-13683479 ] Jason Lowe commented on YARN-295: - Is it guaranteed that if we get CONTAINER_FINISHED in the

[jira] [Commented] (YARN-649) Make container logs available over HTTP in plain text

2013-06-14 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13683639#comment-13683639 ] Jason Lowe commented on YARN-649: - Couple of other comments: * In

[jira] [Commented] (YARN-694) Start using NMTokens to authenticate all communication with NM

2013-06-19 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13688083#comment-13688083 ] Jason Lowe commented on YARN-694: - This broke TestContainerLauncherImpl -- it's now hanging

[jira] [Resolved] (YARN-892) Resource Manager throws InvalidStateTransitonException: Invalid event: CONTAINER_FINISHED at ALLOCATED

2013-07-01 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-892. - Resolution: Duplicate Resource Manager throws InvalidStateTransitonException: Invalid event:

[jira] [Commented] (YARN-917) Job can fail when RM restarts after staging dir is cleaned but before MR successfully unregister with RM

2013-07-15 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13708537#comment-13708537 ] Jason Lowe commented on YARN-917: - I think one way to solve this is to move the removal of

[jira] [Commented] (YARN-321) Generic application history service

2013-07-15 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13709262#comment-13709262 ] Jason Lowe commented on YARN-321: - bq. Is there a reason to embed this inside the RM? I

[jira] [Resolved] (YARN-929) 2 MRAppMaster running parallely for same Job Id

2013-07-16 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-929. - Resolution: Duplicate This is an issue with the MRAppMaster, currently tracked by MAPREDUCE-5396.

[jira] [Created] (YARN-950) Ability to limit or avoid aggregating logs beyond a certain size

2013-07-22 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-950: --- Summary: Ability to limit or avoid aggregating logs beyond a certain size Key: YARN-950 URL: https://issues.apache.org/jira/browse/YARN-950 Project: Hadoop YARN

[jira] [Assigned] (YARN-949) Failed log aggregation can leave a file open.

2013-07-23 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-949: --- Assignee: Kihwal Lee The change is OK in the sense that it makes it more robust to errors occurring

[jira] [Updated] (YARN-949) Failed log aggregation can leave a file open.

2013-07-24 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-949: Target Version/s: 0.23.10 Affects Version/s: (was: 2.1.0-beta) Sounds good to me. Adjusting

[jira] [Resolved] (YARN-949) Failed log aggregation can leave a file open.

2013-07-25 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-949. - Resolution: Fixed Fix Version/s: 0.23.10 Hadoop Flags: Reviewed Thanks, Kihwal. I committed

[jira] [Commented] (YARN-107) ClientRMService.forceKillApplication() should handle the non-RUNNING applications properly

2013-07-26 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13720845#comment-13720845 ] Jason Lowe commented on YARN-107: - bq. I think the easiest way to differentiate the error is

[jira] [Commented] (YARN-981) YARN/MR2/Job history /logs and /metrics link do not have correct content

2013-07-29 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13722512#comment-13722512 ] Jason Lowe commented on YARN-981: - Some overlap with YARN-783.

[jira] [Commented] (YARN-981) YARN/MR2/Job history /logs and /metrics link do not have correct content

2013-07-29 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13722513#comment-13722513 ] Jason Lowe commented on YARN-981: - Also appears to be a full duplicate of MAPREDUCE-3841.

[jira] [Resolved] (YARN-991) YARN/MR2 /logs and /metrics link do not have correct content

2013-07-29 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-991. - Resolution: Duplicate Duplicate of YARN-981. YARN/MR2 /logs and /metrics link do not

[jira] [Commented] (YARN-917) Job can fail when RM restarts after staging dir is cleaned but before MR successfully unregister with RM

2013-07-30 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13723891#comment-13723891 ] Jason Lowe commented on YARN-917: - Yes, that's exactly what I was proposing with my first

[jira] [Commented] (YARN-993) job can not recovery after restart resourcemanager

2013-07-30 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13723970#comment-13723970 ] Jason Lowe commented on YARN-993: - This looks more like a MAPREDUCE issue to me. The MR AM

[jira] [Commented] (YARN-403) Node Manager throws java.io.IOException: Verification of the hashReply failed

2013-07-30 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13724199#comment-13724199 ] Jason Lowe commented on YARN-403: - This sounds like a duplicate of MAPREDUCE-5042 which was

[jira] [Commented] (YARN-107) ClientRMService.forceKillApplication() should handle the non-RUNNING applications properly

2013-07-31 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13725237#comment-13725237 ] Jason Lowe commented on YARN-107: - I still think throwing an exception for this is a mistake

[jira] [Commented] (YARN-573) Shared data structures in Public Localizer and Private Localizer are not Thread safe.

2013-07-31 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13725544#comment-13725544 ] Jason Lowe commented on YARN-573: - bq. I thought about it earlier but we are using iterator

[jira] [Commented] (YARN-972) Allow requests and scheduling for fractional virtual cores

2013-08-01 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13726452#comment-13726452 ] Jason Lowe commented on YARN-972: - bq. What could work is for a YARN app to be able to say

[jira] [Commented] (YARN-573) Shared data structures in Public Localizer and Private Localizer are not Thread safe.

2013-08-01 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13726776#comment-13726776 ] Jason Lowe commented on YARN-573: - +1, lgtm as well. Committing this.

[jira] [Updated] (YARN-573) Shared data structures in Public Localizer and Private Localizer are not Thread safe.

2013-08-01 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-573: Fix Version/s: (was: 2.1.0-beta) 2.1.1-beta Shared data structures in Public

[jira] [Commented] (YARN-1020) Resource Localization using Groups as a new Localization Type

2013-08-02 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13728110#comment-13728110 ] Jason Lowe commented on YARN-1020: -- A container could modify a localized resource if the

[jira] [Commented] (YARN-1024) Define a virtual core unambigiously

2013-08-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13729895#comment-13729895 ] Jason Lowe commented on YARN-1024: -- Agree that the example posed by [~sandyr] shows that a

[jira] [Commented] (YARN-1031) JQuery UI components reference external css in branch-23

2013-08-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13729922#comment-13729922 ] Jason Lowe commented on YARN-1031: -- +1, lgtm. JQuery UI components

[jira] [Commented] (YARN-1016) Define a HDFS based repository that allows YARN services to share resources

2013-08-07 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13732610#comment-13732610 ] Jason Lowe commented on YARN-1016: -- YARN already provides a cache of localized resources,

[jira] [Updated] (YARN-337) RM handles killed application tracking URL poorly

2013-08-08 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-337: Attachment: YARN-337.patch Patch that sets the tracking URL to the RM app page when an AM attempt is

[jira] [Commented] (YARN-1036) Distributed Cache gives inconsistent result if cache files get deleted from task tracker

2013-08-13 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13738337#comment-13738337 ] Jason Lowe commented on YARN-1036: -- Agree with Ravi that we should focus on porting the

[jira] [Commented] (YARN-1036) Distributed Cache gives inconsistent result if cache files get deleted from task tracker

2013-08-13 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13738754#comment-13738754 ] Jason Lowe commented on YARN-1036: -- +1 lgtm as well. Committing this.

[jira] [Updated] (YARN-573) Shared data structures in Public Localizer and Private Localizer are not Thread safe.

2013-08-13 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-573: Fix Version/s: 0.23.10 +1 lgtm as well, thanks Mit and Omkar! I committed this to branch-0.23.

[jira] [Commented] (YARN-1071) ResourceManager's decommissioned and lost node count is 0 after restart

2013-08-16 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13742469#comment-13742469 ] Jason Lowe commented on YARN-1071: -- The NM counts are only for NMs that have connected to

[jira] [Commented] (YARN-194) Log handling in case of NM restart.

2013-08-19 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13743902#comment-13743902 ] Jason Lowe commented on YARN-194: - The NM waits not only for the container to complete but

[jira] [Commented] (YARN-917) Job can fail when RM restarts after staging dir is cleaned but before MR successfully unregister with RM

2013-08-21 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13746747#comment-13746747 ] Jason Lowe commented on YARN-917: - Patch looks OK to me. Shouldn't be too hard to write a

[jira] [Resolved] (YARN-1091) All containers localization fails in NM when any one of the configured nm local-dir disk becomes full

2013-08-22 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-1091. -- Resolution: Duplicate Duplicate of YARN-257. All containers localization fails in NM

[jira] [Commented] (YARN-707) Add user info in the YARN ClientToken

2013-08-23 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13748691#comment-13748691 ] Jason Lowe commented on YARN-707: - Tested this on a secure cluster along with the original

[jira] [Commented] (YARN-707) Add user info in the YARN ClientToken

2013-08-23 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13749037#comment-13749037 ] Jason Lowe commented on YARN-707: - Yep, talked with Daryn offline and he's OK with it. We

[jira] [Reopened] (YARN-707) Add user info in the YARN ClientToken

2013-08-23 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reopened YARN-707: - Reopening this, as the client token is always the app submitter. That means the AM always sees the user as

[jira] [Assigned] (YARN-707) Add user info in the YARN ClientToken

2013-08-26 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-707: --- Assignee: Jason Lowe (was: Vinod Kumar Vavilapalli) In order for YARN applications to implement their

[jira] [Commented] (YARN-707) Add user info in the YARN ClientToken

2013-08-27 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13751312#comment-13751312 ] Jason Lowe commented on YARN-707: - bq. Can you please verify that this is the case and RM

[jira] [Commented] (YARN-707) Add user info in the YARN ClientToken

2013-08-27 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13751379#comment-13751379 ] Jason Lowe commented on YARN-707: - I don't believe they do, as

[jira] [Commented] (YARN-707) Add user info in the YARN ClientToken

2013-08-27 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13751428#comment-13751428 ] Jason Lowe commented on YARN-707: - bq. Would it be possible to continue using the app

[jira] [Updated] (YARN-707) Add user info in the YARN ClientToken

2013-08-27 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-707: Attachment: YARN-707-20130827.txt Patch to change the client-to-AM token user to be the user that requested

[jira] [Created] (YARN-1108) Always use tokens for client-to-AM connections

2013-08-27 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-1108: Summary: Always use tokens for client-to-AM connections Key: YARN-1108 URL: https://issues.apache.org/jira/browse/YARN-1108 Project: Hadoop YARN Issue Type:

[jira] [Commented] (YARN-1113) Job failing when one of the NM local dir got filled

2013-08-28 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13752417#comment-13752417 ] Jason Lowe commented on YARN-1113: -- This is related to, and possibly just a duplicate of,

[jira] [Resolved] (YARN-1114) Resource Manager Failure Due to Unreachable DNS

2013-08-28 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe resolved YARN-1114. -- Resolution: Duplicate This is a duplicate of YARN-713. Resource Manager Failure Due

[jira] [Commented] (YARN-707) Add user info in the YARN ClientToken

2013-08-28 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13752573#comment-13752573 ] Jason Lowe commented on YARN-707: - Thanks for the review, Daryn. bq. Technically you should

[jira] [Updated] (YARN-707) Add user info in the YARN ClientToken

2013-08-28 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-707: Attachment: YARN-707-20130828.txt Updated patch to change Tokens to Credentials in method names.

[jira] [Commented] (YARN-707) Add user info in the YARN ClientToken

2013-08-28 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13752765#comment-13752765 ] Jason Lowe commented on YARN-707: - bq. there isn't a version ID in the token to bump. Will

[jira] [Updated] (YARN-707) Add user info in the YARN ClientToken

2013-08-28 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-707: Attachment: YARN-707-20130828-2.txt Realized I never changed the ClientToAMTokenIdentifier method and field

[jira] [Commented] (YARN-707) Add user info in the YARN ClientToken

2013-08-28 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13753029#comment-13753029 ] Jason Lowe commented on YARN-707: - Thanks for the review, Vinod. I manually tested this on

[jira] [Commented] (YARN-1107) Job submitted with Delegation token in secured environment causes RM to fail during RM restart

2013-08-29 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13754059#comment-13754059 ] Jason Lowe commented on YARN-1107: -- bq. Jason Lowe and Daryn Sharp, can you confirm if

[jira] [Updated] (YARN-707) Add user info in the YARN ClientToken

2013-08-29 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-707: Attachment: YARN-707-20130829.txt Patch to pass the client user name to getApplicationReport and enhance the

[jira] [Commented] (YARN-896) Roll up for long lived YARN

2013-08-30 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13754812#comment-13754812 ] Jason Lowe commented on YARN-896: - bq. Chris, feel free to file a JIRA for rolling of stdout

[jira] [Assigned] (YARN-305) Too many 'Node offerred to app:... messages in RM

2013-09-03 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-305: --- Assignee: Lohit Vijayarenu Too many 'Node offerred to app:... messages in RM

[jira] [Commented] (YARN-540) Race condition causing RM to potentially relaunch already unregistered AMs on RM restart

2013-09-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13757839#comment-13757839 ] Jason Lowe commented on YARN-540: - Sorry for arriving late, but why wouldn't we want to

[jira] [Commented] (YARN-707) Add user info in the YARN ClientToken

2013-09-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13757909#comment-13757909 ] Jason Lowe commented on YARN-707: - bq. Ug, the RM and AM are abusing the same secret manager

[jira] [Moved] (YARN-1145) Potential file handler leak in JobHistoryServer web ui.

2013-09-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe moved MAPREDUCE-5486 to YARN-1145: - Component/s: (was: jobhistoryserver) Assignee:

[jira] [Assigned] (YARN-1145) Potential file handler leak in JobHistoryServer web ui.

2013-09-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-1145: Assignee: Rohith Sharma K S Potential file handler leak in JobHistoryServer web ui.

[jira] [Updated] (YARN-1145) Potential file handle leak in aggregated logs web ui

2013-09-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1145: - Target Version/s: 0.23.10, 2.1.1-beta (was: 2.1.1-beta) Affects Version/s: 0.23.9

[jira] [Commented] (YARN-1145) Potential file handle leak in aggregated logs web ui

2013-09-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758047#comment-13758047 ] Jason Lowe commented on YARN-1145: -- I think the logic behind calling close on the

[jira] [Commented] (YARN-540) Race condition causing RM to potentially relaunch already unregistered AMs on RM restart

2013-09-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758096#comment-13758096 ] Jason Lowe commented on YARN-540: - Yes, I realize that 1) and 2) are at a high level

[jira] [Commented] (YARN-707) Add user info in the YARN ClientToken

2013-09-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13758209#comment-13758209 ] Jason Lowe commented on YARN-707: - Thanks for the review, Daryn. bq.

[jira] [Updated] (YARN-707) Add user info in the YARN ClientToken

2013-09-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-707: Attachment: YARN-707-20130904.branch-0.23.txt Updated patch for branch-0.23 to add isEmpty() check on client

[jira] [Updated] (YARN-707) Add user info in the YARN ClientToken

2013-09-04 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-707: Fix Version/s: 0.23.10 I committed this to branch-0.23. Add user info in the YARN

[jira] [Commented] (YARN-540) Race condition causing RM to potentially relaunch already unregistered AMs on RM restart

2013-09-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13759081#comment-13759081 ] Jason Lowe commented on YARN-540: - bq. Once work-preserving restart is implemented, this

[jira] [Commented] (YARN-540) Race condition causing RM to potentially relaunch already unregistered AMs on RM restart

2013-09-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13759395#comment-13759395 ] Jason Lowe commented on YARN-540: - Ah, after the RM restarts, the NM can notify the RM that

[jira] [Commented] (YARN-540) Race condition causing RM to potentially relaunch already unregistered AMs on RM restart

2013-09-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13759384#comment-13759384 ] Jason Lowe commented on YARN-540: - Unless I'm missing something, it does require a behavior

[jira] [Commented] (YARN-540) Race condition causing RM to potentially relaunch already unregistered AMs on RM restart

2013-09-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13759412#comment-13759412 ] Jason Lowe commented on YARN-540: - bq. Then RM will also need to somehow remember that

[jira] [Created] (YARN-1152) Invalid key to HMAC computation error when getting application report for completed app attempt

2013-09-05 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-1152: Summary: Invalid key to HMAC computation error when getting application report for completed app attempt Key: YARN-1152 URL: https://issues.apache.org/jira/browse/YARN-1152

[jira] [Commented] (YARN-1152) Invalid key to HMAC computation error when getting application report for completed app attempt

2013-09-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13759576#comment-13759576 ] Jason Lowe commented on YARN-1152: -- Stack trace: {noformat} Problem accessing

[jira] [Assigned] (YARN-1152) Invalid key to HMAC computation error when getting application report for completed app attempt

2013-09-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe reassigned YARN-1152: Assignee: Jason Lowe Invalid key to HMAC computation error when getting application report for

[jira] [Updated] (YARN-1152) Invalid key to HMAC computation error when getting application report for completed app attempt

2013-09-05 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1152: - Attachment: YARN-1152.txt Patch to move client token creation from RMAppImpl to RMAppAttemptImpl. Also

[jira] [Updated] (YARN-1152) Invalid key to HMAC computation error when getting application report for completed app attempt

2013-09-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1152: - Target Version/s: 2.1.1-beta (was: 0.23.10, 2.1.1-beta) Affects Version/s: (was: 0.23.10) Turns

[jira] [Commented] (YARN-1152) Invalid key to HMAC computation error when getting application report for completed app attempt

2013-09-06 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13760281#comment-13760281 ] Jason Lowe commented on YARN-1152: -- I also manually tested this on a secure cluster.

[jira] [Updated] (YARN-1152) Invalid key to HMAC computation error when getting application report for completed app attempt

2013-09-09 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1152: - Attachment: YARN-1152-2.txt Thanks for the review, Vinod. bq. We had a

[jira] [Commented] (YARN-1175) LogLength shown in $ yarn logs is 1 character longer than actual stdout

2013-09-09 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13762370#comment-13762370 ] Jason Lowe commented on YARN-1175: -- I believe there's a trailing newline character in the

[jira] [Commented] (YARN-1175) LogLength shown in $ yarn logs is 1 character longer than actual stdout

2013-09-09 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13762388#comment-13762388 ] Jason Lowe commented on YARN-1175: -- It is consistent with the contents of the log. If

[jira] [Commented] (YARN-1179) add ApplicationConstants option to define base dir of the installed application

2013-09-11 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13764278#comment-13764278 ] Jason Lowe commented on YARN-1179: -- Doesn't {{$PWD}} already expand to the current working

[jira] [Commented] (YARN-540) Race condition causing RM to potentially relaunch already unregistered AMs on RM restart

2013-09-11 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13764287#comment-13764287 ] Jason Lowe commented on YARN-540: - bq. The solution is to not report success to user until

[jira] [Commented] (YARN-540) Race condition causing RM to potentially relaunch already unregistered AMs on RM restart

2013-09-11 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13764689#comment-13764689 ] Jason Lowe commented on YARN-540: - JobClient is the standard APIs. I don't mean to imply we

[jira] [Commented] (YARN-1185) FileSystemRMStateStore doesn't use temporary files when writing data

2013-09-12 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13765622#comment-13765622 ] Jason Lowe commented on YARN-1185: -- Also, couldn't it be left with zero-length files if it

[jira] [Commented] (YARN-1185) FileSystemRMStateStore doesn't use temporary files when writing data

2013-09-12 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13765620#comment-13765620 ] Jason Lowe commented on YARN-1185: -- Ah I see. That's an assumption based on a specific

[jira] [Created] (YARN-1185) FileSystemRMStateStore doesn't use temporary files when writing data

2013-09-12 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-1185: Summary: FileSystemRMStateStore doesn't use temporary files when writing data Key: YARN-1185 URL: https://issues.apache.org/jira/browse/YARN-1185 Project: Hadoop YARN

[jira] [Created] (YARN-1189) NMTokenSecretManagerInNM is not being told when applications have finished

2013-09-12 Thread Jason Lowe (JIRA)
Jason Lowe created YARN-1189: Summary: NMTokenSecretManagerInNM is not being told when applications have finished Key: YARN-1189 URL: https://issues.apache.org/jira/browse/YARN-1189 Project: Hadoop YARN

[jira] [Updated] (YARN-1185) FileSystemRMStateStore can leave partial files that prevent subsequent recovery

2013-09-13 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated YARN-1185: - Summary: FileSystemRMStateStore can leave partial files that prevent subsequent recovery (was:

[jira] [Commented] (YARN-1194) TestContainerLogsPage test fails on trunk

2013-09-13 Thread Jason Lowe (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13766516#comment-13766516 ] Jason Lowe commented on YARN-1194: -- +1, lgtm. TestContainerLogsPage test

  1   2   3   4   5   6   7   8   9   10   >