[jira] [Commented] (YARN-1779) Handle AMRMTokens across RM failover

2014-07-23 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14071772#comment-14071772 ] Rohith commented on YARN-1779: -- This is critical issue for work preserving restart feature. AM

[jira] [Commented] (YARN-2349) InvalidStateTransitonException after RM switch

2014-07-24 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073158#comment-14073158 ] Rohith commented on YARN-2349: -- This is basically configurations in capacity-scheduler.xml of

[jira] [Assigned] (YARN-2349) InvalidStateTransitonException after RM switch

2014-07-24 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith reassigned YARN-2349: Assignee: Rohith InvalidStateTransitonException after RM switch

[jira] [Commented] (YARN-2350) TestApplicationMasterServiceOnHA fails with InvalidToken exception

2014-07-24 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073973#comment-14073973 ] Rohith commented on YARN-2350: -- This issue is because of YARN-2208 check in. As a wholse

[jira] [Commented] (YARN-2209) Replace allocate#resync command with ApplicationMasterNotRegisteredException to indicate AM to re-register on RM restart

2014-07-25 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14074366#comment-14074366 ] Rohith commented on YARN-2209: -- Hi [~jianhe], I reviewed patch and found some comments 1.

[jira] [Commented] (YARN-2209) Replace allocate#resync command with ApplicationMasterNotRegisteredException to indicate AM to re-register on RM restart

2014-07-27 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075877#comment-14075877 ] Rohith commented on YARN-2209: -- Thanks Jian He for updating patch. It looks good overall to

[jira] [Commented] (YARN-2209) Replace AM resync/shutdown command with corresponding exceptions

2014-07-28 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075973#comment-14075973 ] Rohith commented on YARN-2209: -- +1 patch looks good to me Replace AM resync/shutdown command

[jira] [Assigned] (YARN-2409) InvalidStateTransitonException in ResourceManager after job recovery

2014-08-13 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith reassigned YARN-2409: Assignee: Rohith InvalidStateTransitonException in ResourceManager after job recovery

[jira] [Commented] (YARN-2409) InvalidStateTransitonException in ResourceManager after job recovery

2014-08-13 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14095466#comment-14095466 ] Rohith commented on YARN-2409: -- I looked into issue (got logs from [~nishan] offline), there

[jira] [Updated] (YARN-2409) InvalidStateTransitonException in ResourceManager after job recovery

2014-08-13 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2409: - Attachment: YARN-2409.patch Attached the patch. Please review.. I have verified patch for 1. Thread Leak :

[jira] [Updated] (YARN-2409) InvalidStateTransitonException in ResourceManager after job recovery

2014-08-13 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2409: - Attachment: (was: YARN-2409.patch) InvalidStateTransitonException in ResourceManager after job recovery

[jira] [Updated] (YARN-2409) InvalidStateTransitonException in ResourceManager after job recovery

2014-08-13 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2409: - Attachment: YARN-2409.patch InvalidStateTransitonException in ResourceManager after job recovery

[jira] [Updated] (YARN-2409) Active to StandBy transition does not stop rmDispatcher that causes 1 AsyncDispatcher thread leak.

2014-08-13 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2409: - Priority: Critical (was: Major) Active to StandBy transition does not stop rmDispatcher that causes 1

[jira] [Updated] (YARN-2409) Active to StandBy transition does not stop rmDispatcher that causes 1 AsyncDispatcher thread leak.

2014-08-13 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2409: - Summary: Active to StandBy transition does not stop rmDispatcher that causes 1 AsyncDispatcher thread leak.

[jira] [Commented] (YARN-2409) Active to StandBy transition does not stop rmDispatcher that causes 1 AsyncDispatcher thread leak.

2014-08-19 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14103382#comment-14103382 ] Rohith commented on YARN-2409: -- Thanks [~eepayne] and [~jianhe] for review.:-) Active to

[jira] [Commented] (YARN-1879) Mark Idempotent/AtMostOnce annotations to ApplicationMasterProtocol

2014-08-21 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14105276#comment-14105276 ] Rohith commented on YARN-1879: -- Hi, Any update on this issue? Mark Idempotent/AtMostOnce

[jira] [Assigned] (YARN-2442) ResourceManager JMX UI does not give HA State

2014-08-27 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith reassigned YARN-2442: Assignee: Rohith ResourceManager JMX UI does not give HA State

[jira] [Commented] (YARN-2442) ResourceManager JMX UI does not give HA State

2014-08-27 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14112134#comment-14112134 ] Rohith commented on YARN-2442: -- This can be taken as Improvement, not really a bug. In

[jira] [Assigned] (YARN-2523) ResourceManager UI showing negative value for Decommissioned Nodes field

2014-09-08 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith reassigned YARN-2523: Assignee: Rohith ResourceManager UI showing negative value for Decommissioned Nodes field

[jira] [Commented] (YARN-2523) ResourceManager UI showing negative value for Decommissioned Nodes field

2014-09-08 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14126580#comment-14126580 ] Rohith commented on YARN-2523: -- Decommissioned Node metrics are set by NodeListManager. If

[jira] [Updated] (YARN-2523) ResourceManager UI showing negative value for Decommissioned Nodes field

2014-09-13 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2523: - Attachment: YARN-2523.patch uploaded patch to fix this issue. Test details : 1. Recured using test, and applied

[jira] [Updated] (YARN-2523) ResourceManager UI showing negative value for Decommissioned Nodes field

2014-09-13 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2523: - Attachment: YARN-2523.patch Verified the fix again, decommissioned nodes should not be decremented again

[jira] [Commented] (YARN-2523) ResourceManager UI showing negative value for Decommissioned Nodes field

2014-09-13 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14132688#comment-14132688 ] Rohith commented on YARN-2523: -- Attached udpated patch. Please review ResourceManager UI

[jira] [Created] (YARN-2550) TestAMRestart fails intermittently

2014-09-13 Thread Rohith (JIRA)
Rohith created YARN-2550: Summary: TestAMRestart fails intermittently Key: YARN-2550 URL: https://issues.apache.org/jira/browse/YARN-2550 Project: Hadoop YARN Issue Type: Bug Components:

[jira] [Commented] (YARN-2523) ResourceManager UI showing negative value for Decommissioned Nodes field

2014-09-13 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14132773#comment-14132773 ] Rohith commented on YARN-2523: -- Test failure checked, it is not related to fix. I raised new

[jira] [Created] (YARN-2579) Both RM's state is Active , but 1 RM is not really active.

2014-09-22 Thread Rohith (JIRA)
Rohith created YARN-2579: Summary: Both RM's state is Active , but 1 RM is not really active. Key: YARN-2579 URL: https://issues.apache.org/jira/browse/YARN-2579 Project: Hadoop YARN Issue Type: Bug

[jira] [Commented] (YARN-2579) Both RM's state is Active , but 1 RM is not really active.

2014-09-22 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14143200#comment-14143200 ] Rohith commented on YARN-2579: -- This scenario could ocure if 2 thread trying to access

[jira] [Assigned] (YARN-2579) Both RM's state is Active , but 1 RM is not really active.

2014-09-23 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith reassigned YARN-2579: Assignee: Rohith Both RM's state is Active , but 1 RM is not really active.

[jira] [Commented] (YARN-2579) Both RM's state is Active , but 1 RM is not really active.

2014-09-23 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144550#comment-14144550 ] Rohith commented on YARN-2579: -- For fixing this, approaches I can think of are 1. we can call

[jira] [Created] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-09-23 Thread Rohith (JIRA)
Rohith created YARN-2588: Summary: Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception. Key: YARN-2588 URL: https://issues.apache.org/jira/browse/YARN-2588

[jira] [Assigned] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-09-23 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith reassigned YARN-2588: Assignee: Rohith Standby RM does not transitionToActive if previous transitionToActive is failed with ZK

[jira] [Commented] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-09-23 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144604#comment-14144604 ] Rohith commented on YARN-2588: -- Consider RM initially in standby. 1. StandBy RM 2. StandBy

[jira] [Commented] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-09-23 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144612#comment-14144612 ] Rohith commented on YARN-2588: -- This is basically problem in not recreating active services if

[jira] [Commented] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-09-23 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14145822#comment-14145822 ] Rohith commented on YARN-2588: -- [~cindy2012] RM zkClient got session expired. Immediately

[jira] [Commented] (YARN-2523) ResourceManager UI showing negative value for Decommissioned Nodes field

2014-09-24 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14146083#comment-14146083 ] Rohith commented on YARN-2523: -- Thank you Jason Lowe for your suggestion. Considering your

[jira] [Updated] (YARN-2523) ResourceManager UI showing negative value for Decommissioned Nodes field

2014-09-24 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2523: - Attachment: YARN-2523.1.patch Updated the patch for handling tests mentioned in my previous comment. Please

[jira] [Updated] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-09-24 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2588: - Attachment: YARN-2588.patch Updated the patch for fixing issue. Please review.. Standby RM does not

[jira] [Commented] (YARN-2601) RMs(HA RMS) can't enter active state

2014-09-24 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14147308#comment-14147308 ] Rohith commented on YARN-2601: -- Hi [~cindy2012], This looks to be similar of YARN-2588. In

[jira] [Commented] (YARN-2349) InvalidStateTransitonException after RM switch

2014-09-24 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14147325#comment-14147325 ] Rohith commented on YARN-2349: -- Hi [~nishan], would you confirm that this issue you are still

[jira] [Commented] (YARN-1703) Too many connections are opened for proxy server when applicationMaster UI is accessed.

2014-09-24 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14147329#comment-14147329 ] Rohith commented on YARN-1703: -- Hi all, Could anyone review the patch please? Too many

[jira] [Commented] (YARN-2523) ResourceManager UI showing negative value for Decommissioned Nodes field

2014-09-24 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14147369#comment-14147369 ] Rohith commented on YARN-2523: -- Thanks Jian He for looking into patch. I will relook into test

[jira] [Commented] (YARN-2523) ResourceManager UI showing negative value for Decommissioned Nodes field

2014-09-25 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14148636#comment-14148636 ] Rohith commented on YARN-2523: -- Thanks [~jianhe] and [~jlowe] for review and committing

[jira] [Commented] (YARN-2625) Problems with CLASSPATH in Job Submission REST API

2014-10-06 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14160170#comment-14160170 ] Rohith commented on YARN-2625: -- While submitting appliclaiton from REST api, it expects

[jira] [Updated] (YARN-2579) Both RM's state is Active , but 1 RM is not really active.

2014-10-07 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2579: - Attachment: YARN-2579.patch Both RM's state is Active , but 1 RM is not really active.

[jira] [Commented] (YARN-2579) Both RM's state is Active , but 1 RM is not really active.

2014-10-07 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14161774#comment-14161774 ] Rohith commented on YARN-2579: -- Considering 1 st approach as feasible, I attached patch.

[jira] [Commented] (YARN-2655) AllocatedGB/AvailableGB in nodemanager JMX showing only integer values

2014-10-08 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14164679#comment-14164679 ] Rohith commented on YARN-2655: -- Keeping values in MB's is good approach that make consistency

[jira] [Commented] (YARN-2010) RM can't transition to active if it can't recover an app attempt

2014-10-08 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14164692#comment-14164692 ] Rohith commented on YARN-2010: -- Changing assignee to [~kasha] since he has done all the work.

[jira] [Assigned] (YARN-2010) RM can't transition to active if it can't recover an app attempt

2014-10-08 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith reassigned YARN-2010: Assignee: Karthik Kambatla (was: Rohith) RM can't transition to active if it can't recover an app attempt

[jira] [Commented] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-10-14 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14170587#comment-14170587 ] Rohith commented on YARN-2588: -- Hi [~jianhe] , could you please review the patch whenever you

[jira] [Updated] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-10-15 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2588: - Attachment: YARN-2588.1.patch Standby RM does not transitionToActive if previous transitionToActive is failed

[jira] [Commented] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-10-15 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172284#comment-14172284 ] Rohith commented on YARN-2588: -- I updated the patch with above changes.Please review Standby

[jira] [Updated] (YARN-2579) Both RM's state is Active , but 1 RM is not really active.

2014-10-15 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2579: - Attachment: YARN-2579.patch Both RM's state is Active , but 1 RM is not really active.

[jira] [Commented] (YARN-2579) Both RM's state is Active , but 1 RM is not really active.

2014-10-15 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14172369#comment-14172369 ] Rohith commented on YARN-2579: -- I updated the patch with test that simulates

[jira] [Commented] (YARN-2579) Both RM's state is Active , but 1 RM is not really active.

2014-10-15 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14173273#comment-14173273 ] Rohith commented on YARN-2579: -- Hi [~vinodkv], [~kasha], [~jianhe] Can this issue fix

[jira] [Commented] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-10-16 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14174673#comment-14174673 ] Rohith commented on YARN-2588: -- In RMWebApp, we have code below where ApplicationACLManager

[jira] [Updated] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-10-16 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2588: - Attachment: YARN-2588.2.patch Standby RM does not transitionToActive if previous transitionToActive is failed

[jira] [Commented] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-10-16 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14174748#comment-14174748 ] Rohith commented on YARN-2588: -- bq. But anyways, this doesn't matter too much? as these two

[jira] [Commented] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-10-16 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14174749#comment-14174749 ] Rohith commented on YARN-2588: -- bq. add Assert.fail() after

[jira] [Commented] (YARN-2398) TestResourceTrackerOnHA crashes

2014-10-17 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14174943#comment-14174943 ] Rohith commented on YARN-2398: -- [~ozawa] The attached log and reported issue both are

[jira] [Created] (YARN-2702) TestResourceTrackerOnHA fails with NPE

2014-10-17 Thread Rohith (JIRA)
Rohith created YARN-2702: Summary: TestResourceTrackerOnHA fails with NPE Key: YARN-2702 URL: https://issues.apache.org/jira/browse/YARN-2702 Project: Hadoop YARN Issue Type: Bug

[jira] [Resolved] (YARN-2702) TestResourceTrackerOnHA fails with NPE

2014-10-17 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith resolved YARN-2702. -- Resolution: Duplicate ok TestResourceTrackerOnHA fails with NPE --

[jira] [Commented] (YARN-2588) Standby RM does not transitionToActive if previous transitionToActive is failed with ZK exception.

2014-10-17 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14175329#comment-14175329 ] Rohith commented on YARN-2588: -- Thanks 'Jian He' for review and committing patch:-) Standby

[jira] [Updated] (YARN-2691) User level API support for priority label

2014-10-20 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-2691: - Attachment: YARN-2691.patch User level API support for priority label -

[jira] [Commented] (YARN-2579) Both RM's state is Active , but 1 RM is not really active.

2014-10-20 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-2579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14177886#comment-14177886 ] Rohith commented on YARN-2579: -- bq. Under what conditions, can resetDispatcher be called by

[jira] [Updated] (YARN-1752) Unexpected Unregistered event at Attempt Launched state

2014-02-28 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1752: - Attachment: YARN-1752.3.patch Attaching patch for addressing comments. Please review. Unexpected Unregistered

[jira] [Commented] (YARN-1206) Container logs link is broken on RM web UI after application finished

2014-03-03 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13918064#comment-13918064 ] Rohith commented on YARN-1206: -- I am able to reproduce this issue in today's trunk with log

[jira] [Updated] (YARN-1752) Unexpected Unregistered event at Attempt Launched state

2014-03-03 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1752: - Attachment: YARN-1752.4.patch Attaching patch for fixing comments.Please review. Unexpected Unregistered event

[jira] [Assigned] (YARN-1206) Container logs link is broken on RM web UI after application finished

2014-03-04 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith reassigned YARN-1206: Assignee: Rohith Container logs link is broken on RM web UI after application finished

[jira] [Updated] (YARN-1206) Container logs link is broken on RM web UI after application finished

2014-03-04 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1206: - Attachment: YARN-1206.patch Attaching patch for fixing this issue. Please review Container logs link is broken

[jira] [Commented] (YARN-1752) Unexpected Unregistered event at Attempt Launched state

2014-03-04 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13919548#comment-13919548 ] Rohith commented on YARN-1752: -- Previous Hadoop QA failure is not because of patch. What is

[jira] [Updated] (YARN-1752) Unexpected Unregistered event at Attempt Launched state

2014-03-04 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1752: - Attachment: YARN-1752.5.patch bq. but just that there's still a typo in the code comment: tries to register more

[jira] [Commented] (YARN-1705) Cluster metrics are off after failover

2014-03-10 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13929905#comment-13929905 ] Rohith commented on YARN-1705: -- Hi Karthik, I started verifying RM HA in trunk. I got issue

[jira] [Commented] (YARN-1705) Cluster metrics are off after failover

2014-03-10 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13929921#comment-13929921 ] Rohith commented on YARN-1705: -- Thank you for offering :-) I will take up this Jira. Cluster

[jira] [Assigned] (YARN-1705) Cluster metrics are off after failover

2014-03-10 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith reassigned YARN-1705: Assignee: Rohith (was: Karthik Kambatla) Cluster metrics are off after failover

[jira] [Commented] (YARN-1705) Cluster metrics are off after failover

2014-03-11 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13930309#comment-13930309 ] Rohith commented on YARN-1705: -- For understaing detail scope of Jira, 1. Currently , on

[jira] [Updated] (YARN-1705) Cluster metrics are off after failover

2014-03-13 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1705: - Attachment: YARN-1705.1.patch Hi, I attached patch that handles 1. transtion

[jira] [Commented] (YARN-1206) Container logs link is broken on RM web UI after application finished

2014-03-16 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13937439#comment-13937439 ] Rohith commented on YARN-1206: -- Hi Jian, Thank you for looking into patch. There are 2

[jira] [Updated] (YARN-1206) Container logs link is broken on RM web UI after application finished

2014-03-17 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1206: - Attachment: YARN-1206.1.patch I added comment in ContainerLogsUtils.getContainerLogDirs() as below. It is not

[jira] [Updated] (YARN-1705) Cluster metrics are off after failover

2014-03-18 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1705: - Attachment: YARN-1705.2.patch Cluster metrics are off after failover --

[jira] [Commented] (YARN-1705) Cluster metrics are off after failover

2014-03-18 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939010#comment-13939010 ] Rohith commented on YARN-1705: -- Attached patch for addressing comment. Please review.

[jira] [Created] (YARN-1852) Application recovery throws InvalidStateTransitonException for FAILED and KILLED jobs

2014-03-19 Thread Rohith (JIRA)
Rohith created YARN-1852: Summary: Application recovery throws InvalidStateTransitonException for FAILED and KILLED jobs Key: YARN-1852 URL: https://issues.apache.org/jira/browse/YARN-1852 Project: Hadoop

[jira] [Commented] (YARN-1852) Application recovery throws InvalidStateTransitonException for FAILED and KILLED jobs

2014-03-19 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13940394#comment-13940394 ] Rohith commented on YARN-1852: -- Here is the exception stack trace.. For Killed application

[jira] [Assigned] (YARN-1854) TestRMHA#testStartAndTransitions Fails

2014-03-19 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith reassigned YARN-1854: Assignee: Rohith TestRMHA#testStartAndTransitions Fails --

[jira] [Commented] (YARN-1854) TestRMHA#testStartAndTransitions Fails

2014-03-19 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13941364#comment-13941364 ] Rohith commented on YARN-1854: -- I will look into Test Case Failure.

[jira] [Updated] (YARN-1854) TestRMHA#testStartAndTransitions Fails

2014-03-20 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1854: - Attachment: YARN-1854.patch TestRMHA#testStartAndTransitions Fails --

[jira] [Commented] (YARN-1854) TestRMHA#testStartAndTransitions Fails

2014-03-20 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13941462#comment-13941462 ] Rohith commented on YARN-1854: -- I ran multiple times in Linux and windows.I didnt fint any

[jira] [Updated] (YARN-1852) Application recovery throws InvalidStateTransitonException for FAILED and KILLED jobs

2014-03-20 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1852: - Attachment: YARN-1852.patch +1 Jian, Attaching patch for handling KILLED/FAILED applications during recovery. I

[jira] [Commented] (YARN-1198) Capacity Scheduler headroom calculation does not work as expected

2014-03-20 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13941797#comment-13941797 ] Rohith commented on YARN-1198: -- Does this Jira handles scenario mentioned in YARN-1680 for

[jira] [Commented] (YARN-1198) Capacity Scheduler headroom calculation does not work as expected

2014-03-20 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13942731#comment-13942731 ] Rohith commented on YARN-1198: -- bq. It's kind of related to New node is added/removed from the

[jira] [Commented] (YARN-1854) TestRMHA#testStartAndTransitions Fails

2014-03-20 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13942760#comment-13942760 ] Rohith commented on YARN-1854: -- Thank you [~vinodkv] for going through patch. I agree that

[jira] [Updated] (YARN-1854) TestRMHA#testStartAndTransitions Fails

2014-03-21 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1854: - Attachment: YARN-1854.1.patch Attaching patch. Please review.. I changed verifyClusterMetrics for retrying 5

[jira] [Updated] (YARN-1852) Application recovery throws InvalidStateTransitonException for FAILED and KILLED jobs

2014-03-24 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1852: - Attachment: YARN-1852.3patch bq. We may check against RMApp.recoveredFinalState state instead? Done Test is

[jira] [Updated] (YARN-1852) Application recovery throws InvalidStateTransitonException for FAILED and KILLED jobs

2014-03-24 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1852: - Attachment: (was: YARN-1852.3patch) Application recovery throws InvalidStateTransitonException for FAILED

[jira] [Updated] (YARN-1852) Application recovery throws InvalidStateTransitonException for FAILED and KILLED jobs

2014-03-24 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1852: - Attachment: YARN-1852.3.patch Application recovery throws InvalidStateTransitonException for FAILED and KILLED

[jira] [Commented] (YARN-1854) TestRMHA#testStartAndTransitions Fails

2014-03-25 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13946487#comment-13946487 ] Rohith commented on YARN-1854: -- [~mitdesai], I checked attached logs for while. It is very

[jira] [Updated] (YARN-1854) Race condition in TestRMHA#testStartAndTransitions

2014-03-25 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1854: - Description: There is race in test. TestRMHA#testStartAndTransitions calls verifyClusterMetrics() immediately

[jira] [Commented] (YARN-1854) Race condition in TestRMHA#testStartAndTransitions

2014-03-25 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13947510#comment-13947510 ] Rohith commented on YARN-1854: -- bq. Rohith : The logs that I have submitted already has the

[jira] [Commented] (YARN-1885) yarn logs command does not provide the application logs for some applications

2014-03-27 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13950342#comment-13950342 ] Rohith commented on YARN-1885: -- [~arpitgupta] can you please describe more on the issue. 1.

[jira] [Updated] (YARN-1703) Too many connections are opened for proxy server when applicationMaster UI is accessed.

2014-03-28 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1703: - Summary: Too many connections are opened for proxy server when applicationMaster UI is accessed. (was: There

[jira] [Updated] (YARN-1703) Too many connections are opened for proxy server when applicationMaster UI is accessed.

2014-03-28 Thread Rohith (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohith updated YARN-1703: - Priority: Critical (was: Major) Too many connections are opened for proxy server when applicationMaster UI is

  1   2   3   4   5   6   7   8   9   10   >