[jira] [Commented] (YARN-2096) Race in TestRMRestart#testQueueMetricsOnRMRestart

2014-05-27 Thread Tsuyoshi OZAWA (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14009807#comment-14009807
 ] 

Tsuyoshi OZAWA commented on YARN-2096:
--

One good news: TestRMRestart with Anubhav's patch works well - after running 
tests hundreds times, no failure. Good job :-)

 Race in TestRMRestart#testQueueMetricsOnRMRestart
 -

 Key: YARN-2096
 URL: https://issues.apache.org/jira/browse/YARN-2096
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Anubhav Dhoot
Assignee: Anubhav Dhoot
 Fix For: 2.5.0

 Attachments: YARN-2096.patch


 org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testQueueMetricsOnRMRestart
  fails randomly because of a race condition.
 The test validates that metrics are incremented, but does not wait for all 
 transitions to finish before checking for the values.
 It also resets metrics after kicking off recovery of second RM. The metrics 
 that need to be incremented race with this reset causing test to fail 
 randomly.
 We need to wait for the right transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2096) Race in TestRMRestart#testQueueMetricsOnRMRestart

2014-05-26 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14008759#comment-14008759
 ] 

Hudson commented on YARN-2096:
--

FAILURE: Integrated in Hadoop-Yarn-trunk #564 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/564/])
YARN-2096. Race in TestRMRestart#testQueueMetricsOnRMRestart. (Anubhav Dhoot 
via kasha) (kasha: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1597223)
* /hadoop/common/trunk/hadoop-yarn-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestRMRestart.java


 Race in TestRMRestart#testQueueMetricsOnRMRestart
 -

 Key: YARN-2096
 URL: https://issues.apache.org/jira/browse/YARN-2096
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Anubhav Dhoot
Assignee: Anubhav Dhoot
 Fix For: 2.5.0

 Attachments: YARN-2096.patch


 org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testQueueMetricsOnRMRestart
  fails randomly because of a race condition.
 The test validates that metrics are incremented, but does not wait for all 
 transitions to finish before checking for the values.
 It also resets metrics after kicking off recovery of second RM. The metrics 
 that need to be incremented race with this reset causing test to fail 
 randomly.
 We need to wait for the right transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2096) Race in TestRMRestart#testQueueMetricsOnRMRestart

2014-05-26 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14008916#comment-14008916
 ] 

Hudson commented on YARN-2096:
--

FAILURE: Integrated in Hadoop-Mapreduce-trunk #1782 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1782/])
YARN-2096. Race in TestRMRestart#testQueueMetricsOnRMRestart. (Anubhav Dhoot 
via kasha) (kasha: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1597223)
* /hadoop/common/trunk/hadoop-yarn-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestRMRestart.java


 Race in TestRMRestart#testQueueMetricsOnRMRestart
 -

 Key: YARN-2096
 URL: https://issues.apache.org/jira/browse/YARN-2096
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Anubhav Dhoot
Assignee: Anubhav Dhoot
 Fix For: 2.5.0

 Attachments: YARN-2096.patch


 org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testQueueMetricsOnRMRestart
  fails randomly because of a race condition.
 The test validates that metrics are incremented, but does not wait for all 
 transitions to finish before checking for the values.
 It also resets metrics after kicking off recovery of second RM. The metrics 
 that need to be incremented race with this reset causing test to fail 
 randomly.
 We need to wait for the right transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2096) Race in TestRMRestart#testQueueMetricsOnRMRestart

2014-05-25 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14008410#comment-14008410
 ] 

Hudson commented on YARN-2096:
--

FAILURE: Integrated in Hadoop-trunk-Commit #5609 (See 
[https://builds.apache.org/job/Hadoop-trunk-Commit/5609/])
YARN-2096. Race in TestRMRestart#testQueueMetricsOnRMRestart. (Anubhav Dhoot 
via kasha) (kasha: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1597223)
* /hadoop/common/trunk/hadoop-yarn-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestRMRestart.java


 Race in TestRMRestart#testQueueMetricsOnRMRestart
 -

 Key: YARN-2096
 URL: https://issues.apache.org/jira/browse/YARN-2096
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Anubhav Dhoot
Assignee: Anubhav Dhoot
 Fix For: 2.5.0

 Attachments: YARN-2096.patch


 org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testQueueMetricsOnRMRestart
  fails randomly because of a race condition.
 The test validates that metrics are incremented, but does not wait for all 
 transitions to finish before checking for the values.
 It also resets metrics after kicking off recovery of second RM. The metrics 
 that need to be incremented race with this reset causing test to fail 
 randomly.
 We need to wait for the right transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2096) Race in TestRMRestart#testQueueMetricsOnRMRestart

2014-05-23 Thread Tsuyoshi OZAWA (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14007319#comment-14007319
 ] 

Tsuyoshi OZAWA commented on YARN-2096:
--

Thank you for taking this JIRA, Anubhav. I also faced this problem when 
reviewing YARN-1365. I'll try to run the tests again and again with your patch.

 Race in TestRMRestart#testQueueMetricsOnRMRestart
 -

 Key: YARN-2096
 URL: https://issues.apache.org/jira/browse/YARN-2096
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Anubhav Dhoot
Assignee: Anubhav Dhoot
 Attachments: YARN-2096.patch


 org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testQueueMetricsOnRMRestart
  fails randomly because of a race condition.
 The test validates that metrics are incremented, but does not wait for all 
 transitions to finish before checking for the values.
 It also resets metrics after kicking off recovery of second RM. The metrics 
 that need to be incremented race with this reset causing test to fail 
 randomly.
 We need to wait for the right transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2096) Race in TestRMRestart#testQueueMetricsOnRMRestart

2014-05-23 Thread Karthik Kambatla (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14007338#comment-14007338
 ] 

Karthik Kambatla commented on YARN-2096:


Looks good to me. +1. 

 Race in TestRMRestart#testQueueMetricsOnRMRestart
 -

 Key: YARN-2096
 URL: https://issues.apache.org/jira/browse/YARN-2096
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Anubhav Dhoot
Assignee: Anubhav Dhoot
 Attachments: YARN-2096.patch


 org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testQueueMetricsOnRMRestart
  fails randomly because of a race condition.
 The test validates that metrics are incremented, but does not wait for all 
 transitions to finish before checking for the values.
 It also resets metrics after kicking off recovery of second RM. The metrics 
 that need to be incremented race with this reset causing test to fail 
 randomly.
 We need to wait for the right transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2096) Race in TestRMRestart#testQueueMetricsOnRMRestart

2014-05-23 Thread Tsuyoshi OZAWA (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14007363#comment-14007363
 ] 

Tsuyoshi OZAWA commented on YARN-2096:
--

The change looks good to me too(non-binding).

 Race in TestRMRestart#testQueueMetricsOnRMRestart
 -

 Key: YARN-2096
 URL: https://issues.apache.org/jira/browse/YARN-2096
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Anubhav Dhoot
Assignee: Anubhav Dhoot
 Attachments: YARN-2096.patch


 org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testQueueMetricsOnRMRestart
  fails randomly because of a race condition.
 The test validates that metrics are incremented, but does not wait for all 
 transitions to finish before checking for the values.
 It also resets metrics after kicking off recovery of second RM. The metrics 
 that need to be incremented race with this reset causing test to fail 
 randomly.
 We need to wait for the right transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2096) Race in TestRMRestart#testQueueMetricsOnRMRestart

2014-05-23 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14007419#comment-14007419
 ] 

Hadoop QA commented on YARN-2096:
-

{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12646464/YARN-2096.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in 
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager.

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-YARN-Build/3795//testReport/
Console output: https://builds.apache.org/job/PreCommit-YARN-Build/3795//console

This message is automatically generated.

 Race in TestRMRestart#testQueueMetricsOnRMRestart
 -

 Key: YARN-2096
 URL: https://issues.apache.org/jira/browse/YARN-2096
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Anubhav Dhoot
Assignee: Anubhav Dhoot
 Attachments: YARN-2096.patch


 org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testQueueMetricsOnRMRestart
  fails randomly because of a race condition.
 The test validates that metrics are incremented, but does not wait for all 
 transitions to finish before checking for the values.
 It also resets metrics after kicking off recovery of second RM. The metrics 
 that need to be incremented race with this reset causing test to fail 
 randomly.
 We need to wait for the right transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-2096) Race in TestRMRestart#testQueueMetricsOnRMRestart

2014-05-23 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14007876#comment-14007876
 ] 

Hadoop QA commented on YARN-2096:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12646464/YARN-2096.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager:

  
org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.TestFairScheduler

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-YARN-Build/3814//testReport/
Console output: https://builds.apache.org/job/PreCommit-YARN-Build/3814//console

This message is automatically generated.

 Race in TestRMRestart#testQueueMetricsOnRMRestart
 -

 Key: YARN-2096
 URL: https://issues.apache.org/jira/browse/YARN-2096
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Anubhav Dhoot
Assignee: Anubhav Dhoot
 Attachments: YARN-2096.patch


 org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testQueueMetricsOnRMRestart
  fails randomly because of a race condition.
 The test validates that metrics are incremented, but does not wait for all 
 transitions to finish before checking for the values.
 It also resets metrics after kicking off recovery of second RM. The metrics 
 that need to be incremented race with this reset causing test to fail 
 randomly.
 We need to wait for the right transitions.



--
This message was sent by Atlassian JIRA
(v6.2#6252)