Created https://issues.apache.org/jira/browse/SOLR-17420

Thanks,
Jerry

________________________________
From: Jerry Chung
Sent: Tuesday, August 20, 2024 3:47 PM
To: dev@solr.apache.org <dev@solr.apache.org>
Subject: Solr does not load some cores occasionally on startup

Hi Solr Devs,

We've noticed this issue quite a bit. If it happens, the replica is marked as 
down. Workaround seems to be restarting Solr service, but this is quite random 
and it might not be feasible.

Today I noticed that it seemed to be hanging while loading the replica. When 
the service stopped, these messages were logged.

2024-08-20 18:27:15.827 INFO  
(coreLoadExecutor-17-thread-1-processing-ip-100-65-231-167.ec2.internal:8983_solr)
 [c:1_80084c8562132c
47_2d076556_1914ca95bf9__8000I18454740_a2f8_5f48_a1d7_9ecfea41540d s:shard1 
r:core_node18 x:1_80084c8562132c47_2d076556_1914ca95bf9_
_8000I18454740_a2f8_5f48_a1d7_9ecfea41540d_shard1_replica_n17] o.a.s.c.SolrCore 
Interrupted waiting for searcherLock => java.lang.In
terruptedException
        at java.base/java.lang.Object.wait(Native Method)
java.lang.InterruptedException: null
        at java.lang.Object.wait(Native Method) ~[?:?]
        at java.lang.Object.wait(Object.java:338) ~[?:?]
        at org.apache.solr.core.SolrCore.getSearcher(SolrCore.java:2538) 
~[solr-core-9.4.0.jar:9.4.0 71e101bb37497f730078d9afe1991b6
0d10bfe96 - stillalex - 2023-10-10 19:10:39]
        at org.apache.solr.core.SolrCore.initSearcher(SolrCore.java:1290) 
~[solr-core-9.4.0.jar:9.4.0 71e101bb37497f730078d9afe1991b
60d10bfe96 - stillalex - 2023-10-10 19:10:39]
        at org.apache.solr.core.SolrCore.<init>(SolrCore.java:1175) 
~[solr-core-9.4.0.jar:9.4.0 71e101bb37497f730078d9afe1991b60d10b
fe96 - stillalex - 2023-10-10 19:10:39]
        at org.apache.solr.core.SolrCore.<init>(SolrCore.java:1056) 
~[solr-core-9.4.0.jar:9.4.0 71e101bb37497f730078d9afe1991b60d10b
fe96 - stillalex - 2023-10-10 19:10:39]
        at 
org.apache.solr.core.CoreContainer.createFromDescriptor(CoreContainer.java:1705)
 ~[solr-core-9.4.0.jar:9.4.0 71e101bb3749
7f730078d9afe1991b60d10bfe96 - stillalex - 2023-10-10 19:10:39]
        at 
org.apache.solr.core.CoreContainer.lambda$loadInternal$12(CoreContainer.java:1043)
 ~[solr-core-9.4.0.jar:9.4.0 71e101bb37
497f730078d9afe1991b60d10bfe96 - stillalex - 2023-10-10 19:10:39]
        at 
com.codahale.metrics.InstrumentedExecutorService$InstrumentedCallable.call(InstrumentedExecutorService.java:234)
 ~[metric
s-core-4.2.20.jar:4.2.20]
        at java.util.concurrent.FutureTask.run(FutureTask.java:264) ~[?:?]

There were tons of threads waiting for the lock as well:

2024-08-20 18:28:16.012 INFO  (qtp1768242710-971) [] o.a.s.c.SolrCore 
Interrupted waiting for searcherLock => java.lang.InterruptedException
        at java.base/java.lang.Object.wait(Native Method)
java.lang.InterruptedException: null
        at java.lang.Object.wait(Native Method) ~[?:?]
        at java.lang.Object.wait(Object.java:338) ~[?:?]
        at org.apache.solr.core.SolrCore.getSearcher(SolrCore.java:2538) 
~[solr-core-9.4.0.jar:9.4.0 71e101bb37497f730078d9afe1991b60d10bfe96 - 
stillalex - 2023-10-10 19:10:39]
        at org.apache.solr.core.SolrCore.getSearcher(SolrCore.java:2281) 
~[solr-core-9.4.0.jar:9.4.0 71e101bb37497f730078d9afe1991b60d10bfe96 - 
stillalex - 2023-10-10 19:10:39]
        at org.apache.solr.core.SolrCore.getSearcher(SolrCore.java:2116) 
~[solr-core-9.4.0.jar:9.4.0 71e101bb37497f730078d9afe1991b60d10bfe96 - 
stillalex - 2023-10-10 19:10:39]
        at org.apache.solr.core.SolrCore.withSearcher(SolrCore.java:2134) 
~[solr-core-9.4.0.jar:9.4.0 71e101bb37497f730078d9afe1991b60d10bfe96 - 
stillalex - 2023-10-10 19:10:39]
        at org.apache.solr.core.SolrCore.getSegmentCount(SolrCore.java:539) 
~[solr-core-9.4.0.jar:9.4.0 71e101bb37497f730078d9afe1991b60d10bfe96 - 
stillalex - 2023-10-10 19:10:39]
        at 
org.apache.solr.core.SolrCore.lambda$initializeMetrics$11(SolrCore.java:1360) 
~[solr-core-9.4.0.jar:9.4.0 71e101bb37497f730078d9afe1991b60d10bfe96 - 
stillalex - 2023-10-10 19:10:39]
        at 
org.apache.solr.util.stats.MetricUtils.convertGauge(MetricUtils.java:656) 
~[solr-core-9.4.0.jar:9.4.0 71e101bb37497f730078d9afe1991b60d10bfe96 - 
stillalex - 2023-10-10 19:10:39]
        at 
org.apache.solr.util.stats.MetricUtils.convertMetric(MetricUtils.java:355) 
~[solr-core-9.4.0.jar:9.4.0 71e101bb37497f730078d9afe1991b60d10bfe96 - 
stillalex - 2023-10-10 19:10:39]

This looks like a serious bug to me. How do you think of this? Can I log a 
defect for this?

We are using Solr 9.4 as you can see. If further information is needed, please 
let me know.

Thanks,
Jerry Chung

Reply via email to