[ 
https://issues.apache.org/jira/browse/SOLR-12200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16437848#comment-16437848
 ] 

Mikhail Khludnev commented on SOLR-12200:
-----------------------------------------

Continuing to adding more debug and observing leak failures. Here is how one 
test is finishing   
{quote}
  2> 70559 DEBUG (TEST-ZkControllerTest.testGetHostName-seed#[21CB3E792F7FAB5]) 
[n:127.0.0.1:8983_solr    ] o.a.s.c.a.ScheduledTriggers Shutting down action 
executor now
  2> 70592 WARN  (OverseerExitThread) [    ] o.a.s.c.Overseer I'm exiting, but 
I'm still the leader
..
  2> 70594 WARN  
(OverseerAutoScalingTriggerThread-72100555224645634-127.0.0.1:8983_solr-n_0000000000)
 [    ] o.a.s.c.a.OverseerTriggerThread OverseerTriggerThread has been closed, 
exiting.
  2> 70594 INFO  (TEST-ZkControllerTest.testGetHostName-seed#[21CB3E792F7FAB5]) 
[n:127.0.0.1:8983_solr    ] o.a.s.c.u.ObjectReleaseTracker releasing 
Overseer@1511021387 id=72100555224645634-127.0.0.1:8983_solr-n_0000000000 
closed=true
.. 
2> 70596 INFO  (OverseerExitThread) [    ] o.a.s.c.Overseer 
org.apache.solr.cloud.Overseer$ClusterStateUpdater@72cdd73e is *NOT* shutting 
down, Then it needs to rejoin election
..
  2> 70602 INFO  (OverseerExitThread) [    ] o.a.s.c.Overseer turning 
Overseer@1511021387 id=72100555224645634-127.0.0.1:8983_solr-n_0000000000 
closed=true to Overseer@1511021387 
id=72100555224645634-127.0.0.1:8983_solr-n_0000000001 closed=true
  2> 70602 INFO  (OverseerExitThread) [    ] o.a.s.c.Overseer 
Overseer@1511021387 id=72100555224645634-127.0.0.1:8983_solr-n_0000000001 
closed=false is starting
  2> 70612 INFO  (OverseerExitThread) [    ] o.a.s.c.Overseer tracking 
Overseer@1511021387 id=72100555224645634-127.0.0.1:8983_solr-n_0000000001 
closed=false
// *leak*
   2> 70612 INFO  (OverseerExitThread) [    ] o.a.s.c.u.ObjectReleaseTracker 
tracking Overseer@1511021387 
id=72100555224645634-127.0.0.1:8983_solr-n_0000000001 
closed=false=>org.apache.solr.common.util.ObjectReleaseTracker$ObjectTrackerException:
 org.apache.solr.cloud.Overseer
  2>    at 
org.apache.solr.common.util.ObjectReleaseTracker.track(ObjectReleaseTracker.java:41)
  2>    at org.apache.solr.cloud.Overseer.start(Overseer.java:548)
  2>    at 
org.apache.solr.cloud.OverseerElectionContext.runLeaderProcess(ElectionContext.java:851)
  2>    at 
org.apache.solr.cloud.LeaderElector.runIamLeaderProcess(LeaderElector.java:170)
  2>    at 
org.apache.solr.cloud.LeaderElector.checkIfIamLeader(LeaderElector.java:135)
  2>    at 
org.apache.solr.cloud.LeaderElector.joinElection(LeaderElector.java:307)
  2>    at 
org.apache.solr.cloud.LeaderElector.retryElection(LeaderElector.java:393)
  2>    at 
org.apache.solr.cloud.ZkController.rejoinOverseerElection(ZkController.java:2069)
  2>    at 
org.apache.solr.cloud.Overseer$ClusterStateUpdater.checkIfIamStillLeader(Overseer.java:331)
  2>    at java.lang.Thread.run(Thread.java:745)

  2> 70616 INFO  (TEST-ZkControllerTest.testGetHostName-seed#[21CB3E792F7FAB5]) 
[n:127.0.0.1:8983_solr    ] o.a.s.c.CoreContainer Shutting down CoreContainer 
instance=1058782550
{quote}
The most suspicious thing is that  *is NOT shutting down, Then it needs to 
rejoin election* while test is definitely is shutting down. 

> ZkControllerTest failure. Leaking Overseer
> ------------------------------------------
>
>                 Key: SOLR-12200
>                 URL: https://issues.apache.org/jira/browse/SOLR-12200
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: SolrCloud
>            Reporter: Mikhail Khludnev
>            Priority: Major
>         Attachments: SOLR-12200.patch, tests-failures.txt, 
> tests-failures.txt.gz, zk.fail.txt.gz
>
>
> Failure seems suspiciously the same. 
>    [junit4]   2> 499919 INFO  
> (TEST-ZkControllerTest.testReadConfigName-seed#[BC856CC565039E77]) 
> [n:127.0.0.1:8983_solr    ] o.a.s.c.Overseer Overseer 
> (id=73578760132362243-127.0.0.1:8983_solr-n_0000000000) closing
>    [junit4]   2> 499920 INFO  
> (OverseerStateUpdate-73578760132362243-127.0.0.1:8983_solr-n_0000000000) [    
> ] o.a.s.c.Overseer Overseer Loop exiting : 127.0.0.1:8983_solr
>    [junit4]   2> 499920 ERROR 
> (OverseerCollectionConfigSetProcessor-73578760132362243-127.0.0.1:8983_solr-n_0000000000)
>  [    ] o.a.s.c.OverseerTaskProcessor Unable to prioritize overseer
>    [junit4]   2> java.lang.InterruptedException: null
>    [junit4]   2>        at java.lang.Object.wait(Native Method) ~[?:1.8.0_152]
>    [junit4]   2>        at java.lang.Object.wait(Object.java:502) 
> ~[?:1.8.0_152]
>    [junit4]   2>        at 
> org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1409) 
> ~[zookeeper-3.4.11.jar:3.4
> then it spins in SessionExpiredException, all tests pass but suite fails due 
> to leaking Overseer. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to