[ 
https://issues.apache.org/jira/browse/SOLR-12200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16439895#comment-16439895
 ] 

Mikhail Khludnev edited comment on SOLR-12200 at 4/16/18 7:33 PM:
------------------------------------------------------------------

[~varunthacker], I suppose so, but hardly able to prove it.
Meanwhile beasting reproduced the stackoverflow, it seems like 
{quote}
 "chunk": "947195 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[376DCF4DA82652A6]) [    ] 
o.a.s.c.OverseerElectionContext I am going to be the leader 
127.0.0.1:64482_solr%0A"

 "chunk": "947196 ERROR 
(TEST-OverseerTest.testOverseerStatsReset-seed#[376DCF4DA82652A6]) [    ] 
o.a.s.c.LeaderElector node 
exists%0Aorg.apache.zookeeper.KeeperException$NodeExistsException: 
KeeperErrorCode = NodeExists for /overseer_elect/leader%0A%09at 
org.apache.zookeeper.KeeperException.create(KeeperException.java:122) 
~[zookeeper-3.4.11.jar:3.4.11-37e277162d567b55a07d1755f0b31c32e93c01a0]%0A%09at 
org.apache.zookeeper.KeeperException.create(KeeperException.java:54) 
~[zookeeper-3.4.11.jar:3.4.11-37e277162d567b55a07d1755f0b31c32e93c01a0]%0A%09at 
org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:786) 
~[zookeeper-3.4.11.jar:3.4.11-37e277162d567b55a07d1755f0b31c32e93c01a0]%0A%09at 
org.apache.solr.common.cloud.SolrZkClient.lambda$makePath$8(SolrZkClient.java:496)
 ~[java/:?]%0A%09at 
org.apache.solr.common.cloud.ZkCmdExecutor.retryOperation(ZkCmdExecutor.java:60)
 ~[java/:?]%0A%09at 
org.apache.solr.common.cloud.SolrZkClient.makePath(SolrZkClient.java:495) 
~[java/:?]%0A%09at 
org.apache.solr.common.cloud.SolrZkClient.makePath(SolrZkClient.java:438) 
~[java/:?]%0A%09at 
org.apache.solr.common.cloud.SolrZkClient.makePath(SolrZkClient.java:425) 
~[java/:?]%0A%09at 
org.apache.solr.cloud.OverseerElectionContext.runLeaderProcess(ElectionContext.java:840)
 ~[java/:?]%0A%09at 
org.apache.solr.cloud.LeaderElector.runIamLeaderProcess(LeaderElector.java:170) 
~[java/:?]at 
org.apache.solr.cloud.LeaderElector.checkIfIamLeader(LeaderElector.java:138) 
[java/:?]%0A%09at 
org.apache.solr.cloud.LeaderElector.joinElection(LeaderElector.java:307) 
[java/:?]%0A%09at 
org.apache.solr.cloud.LeaderElector.retryElection(LeaderElector.java:393) 
[java/:?]%0A%09at 
org.apache.solr.cloud.LeaderElector.checkIfIamLeader(LeaderElector.java:138) 
[java/:?]%0A%09at 
org.apache.solr.cloud.LeaderElector.joinElection(LeaderElector.java:307) 
[java/:?]%0A%09at 
org.apache.solr.cloud.LeaderElector.joinElection(LeaderElector.java:216) 
[java/:?]%0A%09at 
org.apache.solr.cloud.OverseerTest.testOverseerStatsReset(OverseerTest.java:712)
 
{quote}

This stack tends to grow until it hits the stack boundary. I don't know how 
it's related to the patch. 
Ok. at least I reproduced it on patched code with 

bq. ant beast -Dbeast.iters=100 -Dtests.dups=10  
-Dtests.class=org.apache.solr.cloud.OverseerTest 
-Dtests.method=testOverseerStatsReset -Dtests.badapples=false 
-Dtests.ifNoTests=ignore



was (Author: mkhludnev):
[~varunthacker], I suppose so, but hardly able to prove it.
Meanwhile beasting reproduced the stackoverflow, it seems like 
{quote}
 "chunk": "947195 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[376DCF4DA82652A6]) [    ] 
o.a.s.c.OverseerElectionContext I am going to be the leader 
127.0.0.1:64482_solr%0A"

 "chunk": "947196 ERROR 
(TEST-OverseerTest.testOverseerStatsReset-seed#[376DCF4DA82652A6]) [    ] 
o.a.s.c.LeaderElector node 
exists%0Aorg.apache.zookeeper.KeeperException$NodeExistsException: 
KeeperErrorCode = NodeExists for /overseer_elect/leader%0A%09at 
org.apache.zookeeper.KeeperException.create(KeeperException.java:122) 
~[zookeeper-3.4.11.jar:3.4.11-37e277162d567b55a07d1755f0b31c32e93c01a0]%0A%09at 
org.apache.zookeeper.KeeperException.create(KeeperException.java:54) 
~[zookeeper-3.4.11.jar:3.4.11-37e277162d567b55a07d1755f0b31c32e93c01a0]%0A%09at 
org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:786) 
~[zookeeper-3.4.11.jar:3.4.11-37e277162d567b55a07d1755f0b31c32e93c01a0]%0A%09at 
org.apache.solr.common.cloud.SolrZkClient.lambda$makePath$8(SolrZkClient.java:496)
 ~[java/:?]%0A%09at 
org.apache.solr.common.cloud.ZkCmdExecutor.retryOperation(ZkCmdExecutor.java:60)
 ~[java/:?]%0A%09at 
org.apache.solr.common.cloud.SolrZkClient.makePath(SolrZkClient.java:495) 
~[java/:?]%0A%09at 
org.apache.solr.common.cloud.SolrZkClient.makePath(SolrZkClient.java:438) 
~[java/:?]%0A%09at 
org.apache.solr.common.cloud.SolrZkClient.makePath(SolrZkClient.java:425) 
~[java/:?]%0A%09at 
org.apache.solr.cloud.OverseerElectionContext.runLeaderProcess(ElectionContext.java:840)
 ~[java/:?]%0A%09at 
org.apache.solr.cloud.LeaderElector.runIamLeaderProcess(LeaderElector.java:170) 
~[java/:?]at 
org.apache.solr.cloud.LeaderElector.checkIfIamLeader(LeaderElector.java:138) 
[java/:?]%0A%09at 
org.apache.solr.cloud.LeaderElector.joinElection(LeaderElector.java:307) 
[java/:?]%0A%09at 
org.apache.solr.cloud.LeaderElector.retryElection(LeaderElector.java:393) 
[java/:?]%0A%09at 
org.apache.solr.cloud.LeaderElector.checkIfIamLeader(LeaderElector.java:138) 
[java/:?]%0A%09at 
org.apache.solr.cloud.LeaderElector.joinElection(LeaderElector.java:307) 
[java/:?]%0A%09at 
org.apache.solr.cloud.LeaderElector.joinElection(LeaderElector.java:216) 
[java/:?]%0A%09at 
org.apache.solr.cloud.OverseerTest.testOverseerStatsReset(OverseerTest.java:712)
 
{quote}

This stack tends to grow until it hits the stack boundary. I don't know how 
it's related to the patch. 

> ZkControllerTest failure. Leaking Overseer
> ------------------------------------------
>
>                 Key: SOLR-12200
>                 URL: https://issues.apache.org/jira/browse/SOLR-12200
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: SolrCloud
>            Reporter: Mikhail Khludnev
>            Priority: Major
>         Attachments: SOLR-12200.patch, SOLR-12200.patch, 
> patch-unit-solr_core.zip, tests-failures.txt, tests-failures.txt.gz, 
> zk.fail.txt.gz
>
>
> Failure seems suspiciously the same. 
>    [junit4]   2> 499919 INFO  
> (TEST-ZkControllerTest.testReadConfigName-seed#[BC856CC565039E77]) 
> [n:127.0.0.1:8983_solr    ] o.a.s.c.Overseer Overseer 
> (id=73578760132362243-127.0.0.1:8983_solr-n_0000000000) closing
>    [junit4]   2> 499920 INFO  
> (OverseerStateUpdate-73578760132362243-127.0.0.1:8983_solr-n_0000000000) [    
> ] o.a.s.c.Overseer Overseer Loop exiting : 127.0.0.1:8983_solr
>    [junit4]   2> 499920 ERROR 
> (OverseerCollectionConfigSetProcessor-73578760132362243-127.0.0.1:8983_solr-n_0000000000)
>  [    ] o.a.s.c.OverseerTaskProcessor Unable to prioritize overseer
>    [junit4]   2> java.lang.InterruptedException: null
>    [junit4]   2>        at java.lang.Object.wait(Native Method) ~[?:1.8.0_152]
>    [junit4]   2>        at java.lang.Object.wait(Object.java:502) 
> ~[?:1.8.0_152]
>    [junit4]   2>        at 
> org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1409) 
> ~[zookeeper-3.4.11.jar:3.4
> then it spins in SessionExpiredException, all tests pass but suite fails due 
> to leaking Overseer. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to