[ 
https://issues.apache.org/jira/browse/SOLR-7736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16421394#comment-16421394
 ] 

Mikhail Khludnev commented on SOLR-7736:
----------------------------------------

the problem looks as follows: we have OverseerAutoScalingTriggerThread 
[spining|https://github.com/apache/lucene-solr/blob/master/solr/core/src/java/org/apache/solr/cloud/autoscaling/OverseerTriggerThread.java#L142]
 in the same error 
{code}
   [junit4]   2> 2650914 ERROR 
(OverseerAutoScalingTriggerThread-72373776421355522-127.0.0.1:8983_solr-n_0000000001)
 [    ] o.a.s.c.a.OverseerTriggerThread A ZK error has occurred
   [junit4]   2> java.io.IOException: 
org.apache.zookeeper.KeeperException$SessionExpiredException: KeeperErrorCode = 
Session expired for /autoscaling.json
   [junit4]   2>     at 
org.apache.solr.client.solrj.impl.ZkDistribStateManager.getAutoScalingConfig(ZkDistribStateManager.java:183)
 ~[java/:?]
   [junit4]   2>     at 
org.apache.solr.client.solrj.cloud.DistribStateManager.getAutoScalingConfig(DistribStateManager.java:83)
 ~[java/:?]
   [junit4]   2>     at 
org.apache.solr.cloud.autoscaling.OverseerTriggerThread.run(OverseerTriggerThread.java:127)
 [java/:?]
   [junit4]   2>     at java.lang.Thread.run(Thread.java:748) [?:1.8.0_152]
   [junit4]   2> Caused by: 
org.apache.zookeeper.KeeperException$SessionExpiredException: KeeperErrorCode = 
Session expired for /autoscaling.json
   [junit4]   2>     at 
org.apache.zookeeper.KeeperException.create(KeeperException.java:130) 
~[zookeeper-3.4.11.jar:3.4.11-37e277162d567b55a07d1755f0b31c32e93c01a0]
   [junit4]   2>     at 
org.apache.zookeeper.KeeperException.create(KeeperException.java:54) 
~[zookeeper-3.4.11.jar:3.4.11-37e277162d567b55a07d1755f0b31c32e93c01a0]
   [junit4]   2>     at 
org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:1215) 
~[zookeeper-3.4.11.jar:3.4.11-37e277162d567b55a07d1755f0b31c32e93c01a0]
   [junit4]   2>     at 
org.apache.solr.common.cloud.SolrZkClient.lambda$getData$5(SolrZkClient.java:340)
 ~[java/:?]
   [junit4]   2>     at 
org.apache.solr.common.cloud.ZkCmdExecutor.retryOperation(ZkCmdExecutor.java:60)
 ~[java/:?]
   [junit4]   2>     at 
org.apache.solr.common.cloud.SolrZkClient.getData(SolrZkClient.java:340) 
~[java/:?]
   [junit4]   2>     at 
org.apache.solr.client.solrj.impl.ZkDistribStateManager.getAutoScalingConfig(ZkDistribStateManager.java:176)
 ~[java/:?]
   [junit4]   2>     ... 3 more
{code}
That Zk session expiration is a separate question. As I remember in some places 
a session is restored, there are a lot of methods where SessionExpiredException 
fallen through, and I don't fully understand why. There a other questions about 
root case. I want to tackle just interruption problem so far. It seems like 
this spin can not be interrupted by the thread lingering.  
    

> Add a test for ZkController.publishAndWaitForDownStates
> -------------------------------------------------------
>
>                 Key: SOLR-7736
>                 URL: https://issues.apache.org/jira/browse/SOLR-7736
>             Project: Solr
>          Issue Type: Test
>          Components: SolrCloud, Tests
>            Reporter: Shalin Shekhar Mangar
>            Assignee: Shalin Shekhar Mangar
>            Priority: Minor
>             Fix For: 7.4, master (8.0)
>
>         Attachments: SOLR-7736.patch, consoleFull-2462-ZkControllerTest.txt.gz
>
>
> Add a test for ZkController.publishAndWaitForDownStates so that bugs like 
> SOLR-6665 do not occur again. A test exists but it is not correct and 
> currently disabled via AwaitsFix.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to