[ 
https://issues.apache.org/jira/browse/YARN-7314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16203127#comment-16203127
 ] 

Sunil G edited comment on YARN-7314 at 10/13/17 6:53 AM:
---------------------------------------------------------

Adding more analysis from my end here.

I could see these test cases fails in trunk with out any patch. 
https://builds.apache.org/job/PreCommit-YARN-Build/17888/testReport/

When I ran these in my local machine on trunk i can see same errors as well. 
There are some scheduler state pblm in FS side. And some -ve values are also 
coming. [~templedf] [~yufeigu], could you please pool in to see why this is 
happening as test cases failures are in FS. cc/ [~rohithsharma] and 
[~leftnoteasy]

{code}
Running 
org.apache.hadoop.yarn.server.resourcemanager.TestApplicationMasterService
Tests run: 1, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 47.665 sec <<< 
FAILURE! - in 
org.apache.hadoop.yarn.server.resourcemanager.TestApplicationMasterService
testResourceTypes(org.apache.hadoop.yarn.server.resourcemanager.TestApplicationMasterService)
  Time elapsed: 47.522 sec  <<< FAILURE!
java.lang.AssertionError: Attempt state is not correct (timeout). 
expected:<ALLOCATED> but was:<SCHEDULED>
        at 
org.apache.hadoop.yarn.server.resourcemanager.TestApplicationMasterService.testResourceTypes(TestApplicationMasterService.java:520)

Running 
org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart
Tests run: 38, Failures: 2, Errors: 0, Skipped: 0, Time elapsed: 68.306 sec <<< 
FAILURE! - in 
org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart
testSchedulerRecovery[FAIR](org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart)
  Time elapsed: 0.434 sec  <<< FAILURE!
java.lang.AssertionError: expected:<<memory:6144, vCores:6>> but 
was:<<memory:6144, vCores:-3>>
        at 
org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart.checkFSQueue(TestWorkPreservingRMRestart.java:484)
        at 
org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart.testSchedulerRecovery(TestWorkPreservingRMRestart.java:243)

testDynamicQueueRecovery[FAIR](org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart)
  Time elapsed: 0.471 sec  <<< FAILURE!
java.lang.AssertionError: expected:<<memory:6144, vCores:6>> but 
was:<<memory:6144, vCores:-3>>
        at 
org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart.checkFSQueue(TestWorkPreservingRMRestart.java:484)
        at 
org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart.testDynamicQueueRecovery(TestWorkPreservingRMRestart.java:387)
{code}


was (Author: sunilg):
Adding more analysis from end here.

I could these test cases fails in trunk with out any patch. 
https://builds.apache.org/job/PreCommit-YARN-Build/17888/testReport/

When I ran these in my local machine on trunk i can see same errors as well. 
There are some scheduler state pblm in FS side. And some -ve values are also 
coming. [~templedf] [~yufeigu], could you please pool in to see why this is 
happening as test cases failures are in FS. cc/ [~rohithsharma] and 
[~leftnoteasy]

{code}
Running 
org.apache.hadoop.yarn.server.resourcemanager.TestApplicationMasterService
Tests run: 1, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 47.665 sec <<< 
FAILURE! - in 
org.apache.hadoop.yarn.server.resourcemanager.TestApplicationMasterService
testResourceTypes(org.apache.hadoop.yarn.server.resourcemanager.TestApplicationMasterService)
  Time elapsed: 47.522 sec  <<< FAILURE!
java.lang.AssertionError: Attempt state is not correct (timeout). 
expected:<ALLOCATED> but was:<SCHEDULED>
        at 
org.apache.hadoop.yarn.server.resourcemanager.TestApplicationMasterService.testResourceTypes(TestApplicationMasterService.java:520)

Running 
org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart
Tests run: 38, Failures: 2, Errors: 0, Skipped: 0, Time elapsed: 68.306 sec <<< 
FAILURE! - in 
org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart
testSchedulerRecovery[FAIR](org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart)
  Time elapsed: 0.434 sec  <<< FAILURE!
java.lang.AssertionError: expected:<<memory:6144, vCores:6>> but 
was:<<memory:6144, vCores:-3>>
        at 
org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart.checkFSQueue(TestWorkPreservingRMRestart.java:484)
        at 
org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart.testSchedulerRecovery(TestWorkPreservingRMRestart.java:243)

testDynamicQueueRecovery[FAIR](org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart)
  Time elapsed: 0.471 sec  <<< FAILURE!
java.lang.AssertionError: expected:<<memory:6144, vCores:6>> but 
was:<<memory:6144, vCores:-3>>
        at 
org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart.checkFSQueue(TestWorkPreservingRMRestart.java:484)
        at 
org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart.testDynamicQueueRecovery(TestWorkPreservingRMRestart.java:387)
{code}

> Multiple Resource manager test cases failing
> --------------------------------------------
>
>                 Key: YARN-7314
>                 URL: https://issues.apache.org/jira/browse/YARN-7314
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: lovekesh bansal
>            Priority: Critical
>
> Following 14 unit tests are failing in the trunk:
>  
> org.apache.hadoop.yarn.server.resourcemanager.TestApplicationMasterService.testResourceTypes
>          
>  
> org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart.testSchedulerRecovery[FAIR]
>                 
>  
> org.apache.hadoop.yarn.server.resourcemanager.TestWorkPreservingRMRestart.testDynamicQueueRecovery[FAIR]
>              
>  
> org.apache.hadoop.yarn.server.resourcemanager.ahs.TestRMApplicationHistoryWriter.testRMWritingMassiveHistoryForFairSche
>               
>  
> org.apache.hadoop.yarn.server.resourcemanager.ahs.TestRMApplicationHistoryWriter.testRMWritingMassiveHistoryForCapacitySche
>           
>  
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.TestFSAppAttempt.testHeadroomWithBlackListedNodes
>                
>  
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.TestFairScheduler.testQueueMaxAMShare
>            
>  
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.TestFairScheduler.testMultipleCompletedEvent
>             
>  
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.TestFairScheduler.testRefreshQueuesWhenRMHA
>              
>  
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.TestFairScheduler.testQueueMaxAMShareWithContainerReservation
>            
>  
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.TestFairScheduler.testComputeMaxAMResource
>               
>  
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.TestFairScheduler.testQueueMaxAMShareDefault
>             
>  
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.TestFairScheduler.testReservationMetrics
>                 
>  
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.TestFairScheduler.testRequestAMResourceInZeroFairShareQueue
>              



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to