[jira] [Commented] (SOLR-14504) ZkController LiveNodesListener has NullPointerException in startup race
[ https://issues.apache.org/jira/browse/SOLR-14504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17176291#comment-17176291 ] Colvin Cowie commented on SOLR-14504: - [~ab] this did make it into 8.6 right? But it's not been closed/ > ZkController LiveNodesListener has NullPointerException in startup race > --- > > Key: SOLR-14504 > URL: https://issues.apache.org/jira/browse/SOLR-14504 > Project: Solr > Issue Type: Bug >Affects Versions: 7.7, 7.7.1, 7.7.2, 8.0, 8.1, 8.2, 7.7.3, 8.1.1, 8.3, > 8.4, 8.3.1, 8.5, 8.4.1, 8.5.1 >Reporter: Colvin Cowie >Priority: Minor > Attachments: SOLR-14504.patch > > > If a NODELOST event happens before the cloudManager is initialized then a > NullPointerException will occur on this line > [https://github.com/apache/lucene-solr/blob/c18666ad05afc02979c150aacd4810cff02e43f3/solr/core/src/java/org/apache/solr/cloud/ZkController.java#L1020] > {code:java} > byte[] json = Utils.toJSON(Collections.singletonMap("timestamp", > cloudManager.getTimeSource().getEpochTimeNs())); {code} > Rather than accessing cloudManager directly, getSolrCloudManager() should be > called. > > This happens very rarely, but if it happens it will stop Solr starting, > result in "CoreContainer is either not initialized or shutting down". Snippet > from 8.3.1 > {noformat} > 2020-05-19 03:44:40.241 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.245 INFO (zkConnectionManagerCallback-11-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.245 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.359 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.361 INFO (zkConnectionManagerCallback-13-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.361 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.417 INFO (main) [ ] o.a.s.c.c.ZkStateReader Updated > live nodes from ZooKeeper... (0) -> (1) > 2020-05-19 > 03:44:56.606 INFO (zkCallback-12-thread-2) [ ] > o.a.s.c.c.ZkStateReader Updated live nodes from ZooKeeper... (1) -> > (0) > 2020-05-19 03:44:56.614 ERROR (main) [ ] > o.a.s.s.SolrDispatchFilter Could not start Solr. Check solr/home > property and the logs > 2020-05-19 03:44:56.639 ERROR (main) [ ] o.a.s.c.SolrCore > null:java.lang.NullPointerException > at > org.apache.solr.cloud.ZkController.lambda$registerLiveNodesListener$10(ZkController.java:1020) > at > org.apache.solr.common.cloud.ZkStateReader.registerLiveNodesListener(ZkStateReader.java:880) > at > org.apache.solr.cloud.ZkController.registerLiveNodesListener(ZkController.java:1035) > at org.apache.solr.cloud.ZkController.init(ZkController.java:917) > at org.apache.solr.cloud.ZkController.(ZkController.java:473) > at org.apache.solr.core.ZkContainer.initZooKeeper(ZkContainer.java:115) > at > org.apache.solr.core.CoreContainer.load(CoreContainer.java:631){noformat} > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org
[jira] [Commented] (SOLR-14504) ZkController LiveNodesListener has NullPointerException in startup race
[ https://issues.apache.org/jira/browse/SOLR-14504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17115157#comment-17115157 ] ASF subversion and git services commented on SOLR-14504: Commit 57703a11cfc41ca74744549e1f08713772fb5b02 in lucene-solr's branch refs/heads/branch_8x from Andrzej Bialecki [ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=57703a1 ] SOLR-14504: ZkController LiveNodesListener has NullPointerException in startup race. > ZkController LiveNodesListener has NullPointerException in startup race > --- > > Key: SOLR-14504 > URL: https://issues.apache.org/jira/browse/SOLR-14504 > Project: Solr > Issue Type: Bug >Affects Versions: 7.7, 7.7.1, 7.7.2, 8.0, 8.1, 8.2, 7.7.3, 8.1.1, 8.3, > 8.4, 8.3.1, 8.5, 8.4.1, 8.5.1 >Reporter: Colvin Cowie >Priority: Minor > Attachments: SOLR-14504.patch > > > If a NODELOST event happens before the cloudManager is initialized then a > NullPointerException will occur on this line > [https://github.com/apache/lucene-solr/blob/c18666ad05afc02979c150aacd4810cff02e43f3/solr/core/src/java/org/apache/solr/cloud/ZkController.java#L1020] > {code:java} > byte[] json = Utils.toJSON(Collections.singletonMap("timestamp", > cloudManager.getTimeSource().getEpochTimeNs())); {code} > Rather than accessing cloudManager directly, getSolrCloudManager() should be > called. > > This happens very rarely, but if it happens it will stop Solr starting, > result in "CoreContainer is either not initialized or shutting down". Snippet > from 8.3.1 > {noformat} > 2020-05-19 03:44:40.241 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.245 INFO (zkConnectionManagerCallback-11-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.245 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.359 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.361 INFO (zkConnectionManagerCallback-13-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.361 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.417 INFO (main) [ ] o.a.s.c.c.ZkStateReader Updated > live nodes from ZooKeeper... (0) -> (1) > 2020-05-19 > 03:44:56.606 INFO (zkCallback-12-thread-2) [ ] > o.a.s.c.c.ZkStateReader Updated live nodes from ZooKeeper... (1) -> > (0) > 2020-05-19 03:44:56.614 ERROR (main) [ ] > o.a.s.s.SolrDispatchFilter Could not start Solr. Check solr/home > property and the logs > 2020-05-19 03:44:56.639 ERROR (main) [ ] o.a.s.c.SolrCore > null:java.lang.NullPointerException > at > org.apache.solr.cloud.ZkController.lambda$registerLiveNodesListener$10(ZkController.java:1020) > at > org.apache.solr.common.cloud.ZkStateReader.registerLiveNodesListener(ZkStateReader.java:880) > at > org.apache.solr.cloud.ZkController.registerLiveNodesListener(ZkController.java:1035) > at org.apache.solr.cloud.ZkController.init(ZkController.java:917) > at org.apache.solr.cloud.ZkController.(ZkController.java:473) > at org.apache.solr.core.ZkContainer.initZooKeeper(ZkContainer.java:115) > at > org.apache.solr.core.CoreContainer.load(CoreContainer.java:631){noformat} > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org
[jira] [Commented] (SOLR-14504) ZkController LiveNodesListener has NullPointerException in startup race
[ https://issues.apache.org/jira/browse/SOLR-14504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17113687#comment-17113687 ] ASF subversion and git services commented on SOLR-14504: Commit 0728ef06e98cee5a278b8d75054d0f0c9d33a5ac in lucene-solr's branch refs/heads/SOLR-14461-fileupload from Andrzej Bialecki [ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=0728ef0 ] SOLR-14504: ZkController LiveNodesListener has NullPointerException in startup race. > ZkController LiveNodesListener has NullPointerException in startup race > --- > > Key: SOLR-14504 > URL: https://issues.apache.org/jira/browse/SOLR-14504 > Project: Solr > Issue Type: Bug >Affects Versions: 7.7, 7.7.1, 7.7.2, 8.0, 8.1, 8.2, 7.7.3, 8.1.1, 8.3, > 8.4, 8.3.1, 8.5, 8.4.1, 8.5.1 >Reporter: Colvin Cowie >Priority: Minor > Attachments: SOLR-14504.patch > > > If a NODELOST event happens before the cloudManager is initialized then a > NullPointerException will occur on this line > [https://github.com/apache/lucene-solr/blob/c18666ad05afc02979c150aacd4810cff02e43f3/solr/core/src/java/org/apache/solr/cloud/ZkController.java#L1020] > {code:java} > byte[] json = Utils.toJSON(Collections.singletonMap("timestamp", > cloudManager.getTimeSource().getEpochTimeNs())); {code} > Rather than accessing cloudManager directly, getSolrCloudManager() should be > called. > > This happens very rarely, but if it happens it will stop Solr starting, > result in "CoreContainer is either not initialized or shutting down". Snippet > from 8.3.1 > {noformat} > 2020-05-19 03:44:40.241 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.245 INFO (zkConnectionManagerCallback-11-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.245 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.359 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.361 INFO (zkConnectionManagerCallback-13-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.361 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.417 INFO (main) [ ] o.a.s.c.c.ZkStateReader Updated > live nodes from ZooKeeper... (0) -> (1) > 2020-05-19 > 03:44:56.606 INFO (zkCallback-12-thread-2) [ ] > o.a.s.c.c.ZkStateReader Updated live nodes from ZooKeeper... (1) -> > (0) > 2020-05-19 03:44:56.614 ERROR (main) [ ] > o.a.s.s.SolrDispatchFilter Could not start Solr. Check solr/home > property and the logs > 2020-05-19 03:44:56.639 ERROR (main) [ ] o.a.s.c.SolrCore > null:java.lang.NullPointerException > at > org.apache.solr.cloud.ZkController.lambda$registerLiveNodesListener$10(ZkController.java:1020) > at > org.apache.solr.common.cloud.ZkStateReader.registerLiveNodesListener(ZkStateReader.java:880) > at > org.apache.solr.cloud.ZkController.registerLiveNodesListener(ZkController.java:1035) > at org.apache.solr.cloud.ZkController.init(ZkController.java:917) > at org.apache.solr.cloud.ZkController.(ZkController.java:473) > at org.apache.solr.core.ZkContainer.initZooKeeper(ZkContainer.java:115) > at > org.apache.solr.core.CoreContainer.load(CoreContainer.java:631){noformat} > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org
[jira] [Commented] (SOLR-14504) ZkController LiveNodesListener has NullPointerException in startup race
[ https://issues.apache.org/jira/browse/SOLR-14504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17113336#comment-17113336 ] ASF subversion and git services commented on SOLR-14504: Commit 0728ef06e98cee5a278b8d75054d0f0c9d33a5ac in lucene-solr's branch refs/heads/master from Andrzej Bialecki [ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=0728ef0 ] SOLR-14504: ZkController LiveNodesListener has NullPointerException in startup race. > ZkController LiveNodesListener has NullPointerException in startup race > --- > > Key: SOLR-14504 > URL: https://issues.apache.org/jira/browse/SOLR-14504 > Project: Solr > Issue Type: Bug >Affects Versions: 7.7, 7.7.1, 7.7.2, 8.0, 8.1, 8.2, 7.7.3, 8.1.1, 8.3, > 8.4, 8.3.1, 8.5, 8.4.1, 8.5.1 >Reporter: Colvin Cowie >Priority: Minor > Attachments: SOLR-14504.patch > > > If a NODELOST event happens before the cloudManager is initialized then a > NullPointerException will occur on this line > [https://github.com/apache/lucene-solr/blob/c18666ad05afc02979c150aacd4810cff02e43f3/solr/core/src/java/org/apache/solr/cloud/ZkController.java#L1020] > {code:java} > byte[] json = Utils.toJSON(Collections.singletonMap("timestamp", > cloudManager.getTimeSource().getEpochTimeNs())); {code} > Rather than accessing cloudManager directly, getSolrCloudManager() should be > called. > > This happens very rarely, but if it happens it will stop Solr starting, > result in "CoreContainer is either not initialized or shutting down". Snippet > from 8.3.1 > {noformat} > 2020-05-19 03:44:40.241 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.245 INFO (zkConnectionManagerCallback-11-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.245 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.359 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.361 INFO (zkConnectionManagerCallback-13-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.361 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.417 INFO (main) [ ] o.a.s.c.c.ZkStateReader Updated > live nodes from ZooKeeper... (0) -> (1) > 2020-05-19 > 03:44:56.606 INFO (zkCallback-12-thread-2) [ ] > o.a.s.c.c.ZkStateReader Updated live nodes from ZooKeeper... (1) -> > (0) > 2020-05-19 03:44:56.614 ERROR (main) [ ] > o.a.s.s.SolrDispatchFilter Could not start Solr. Check solr/home > property and the logs > 2020-05-19 03:44:56.639 ERROR (main) [ ] o.a.s.c.SolrCore > null:java.lang.NullPointerException > at > org.apache.solr.cloud.ZkController.lambda$registerLiveNodesListener$10(ZkController.java:1020) > at > org.apache.solr.common.cloud.ZkStateReader.registerLiveNodesListener(ZkStateReader.java:880) > at > org.apache.solr.cloud.ZkController.registerLiveNodesListener(ZkController.java:1035) > at org.apache.solr.cloud.ZkController.init(ZkController.java:917) > at org.apache.solr.cloud.ZkController.(ZkController.java:473) > at org.apache.solr.core.ZkContainer.initZooKeeper(ZkContainer.java:115) > at > org.apache.solr.core.CoreContainer.load(CoreContainer.java:631){noformat} > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org
[jira] [Commented] (SOLR-14504) ZkController LiveNodesListener has NullPointerException in startup race
[ https://issues.apache.org/jira/browse/SOLR-14504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17113324#comment-17113324 ] Andrzej Bialecki commented on SOLR-14504: - Right, I tried to come up with a unit test, too, but it's such an awkward place that it would require some restructuring to do it. I'm not happy about it but the fix is simple and makes sense .. so let's do it. > ZkController LiveNodesListener has NullPointerException in startup race > --- > > Key: SOLR-14504 > URL: https://issues.apache.org/jira/browse/SOLR-14504 > Project: Solr > Issue Type: Bug >Affects Versions: 7.7, 7.7.1, 7.7.2, 8.0, 8.1, 8.2, 7.7.3, 8.1.1, 8.3, > 8.4, 8.3.1, 8.5, 8.4.1, 8.5.1 >Reporter: Colvin Cowie >Priority: Minor > Attachments: SOLR-14504.patch > > > If a NODELOST event happens before the cloudManager is initialized then a > NullPointerException will occur on this line > [https://github.com/apache/lucene-solr/blob/c18666ad05afc02979c150aacd4810cff02e43f3/solr/core/src/java/org/apache/solr/cloud/ZkController.java#L1020] > {code:java} > byte[] json = Utils.toJSON(Collections.singletonMap("timestamp", > cloudManager.getTimeSource().getEpochTimeNs())); {code} > Rather than accessing cloudManager directly, getSolrCloudManager() should be > called. > > This happens very rarely, but if it happens it will stop Solr starting, > result in "CoreContainer is either not initialized or shutting down". Snippet > from 8.3.1 > {noformat} > 2020-05-19 03:44:40.241 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.245 INFO (zkConnectionManagerCallback-11-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.245 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.359 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.361 INFO (zkConnectionManagerCallback-13-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.361 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.417 INFO (main) [ ] o.a.s.c.c.ZkStateReader Updated > live nodes from ZooKeeper... (0) -> (1) > 2020-05-19 > 03:44:56.606 INFO (zkCallback-12-thread-2) [ ] > o.a.s.c.c.ZkStateReader Updated live nodes from ZooKeeper... (1) -> > (0) > 2020-05-19 03:44:56.614 ERROR (main) [ ] > o.a.s.s.SolrDispatchFilter Could not start Solr. Check solr/home > property and the logs > 2020-05-19 03:44:56.639 ERROR (main) [ ] o.a.s.c.SolrCore > null:java.lang.NullPointerException > at > org.apache.solr.cloud.ZkController.lambda$registerLiveNodesListener$10(ZkController.java:1020) > at > org.apache.solr.common.cloud.ZkStateReader.registerLiveNodesListener(ZkStateReader.java:880) > at > org.apache.solr.cloud.ZkController.registerLiveNodesListener(ZkController.java:1035) > at org.apache.solr.cloud.ZkController.init(ZkController.java:917) > at org.apache.solr.cloud.ZkController.(ZkController.java:473) > at org.apache.solr.core.ZkContainer.initZooKeeper(ZkContainer.java:115) > at > org.apache.solr.core.CoreContainer.load(CoreContainer.java:631){noformat} > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org
[jira] [Commented] (SOLR-14504) ZkController LiveNodesListener has NullPointerException in startup race
[ https://issues.apache.org/jira/browse/SOLR-14504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17113272#comment-17113272 ] Lucene/Solr QA commented on SOLR-14504: --- | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || || || || || {color:brown} Prechecks {color} || | {color:red}-1{color} | {color:red} test4tests {color} | {color:red} 0m 0s{color} | {color:red} The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. {color} | || || || || {color:brown} master Compile Tests {color} || | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 8s{color} | {color:green} master passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 3s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 1m 3s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} Release audit (RAT) {color} | {color:green} 1m 3s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} Check forbidden APIs {color} | {color:green} 1m 3s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} Validate source patterns {color} | {color:green} 1m 3s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:green}+1{color} | {color:green} unit {color} | {color:green} 46m 24s{color} | {color:green} core in the patch passed. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 50m 31s{color} | {color:black} {color} | \\ \\ || Subsystem || Report/Notes || | JIRA Issue | SOLR-14504 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/13003634/SOLR-14504.patch | | Optional Tests | compile javac unit ratsources checkforbiddenapis validatesourcepatterns | | uname | Linux lucene1-us-west 4.15.0-54-generic #58-Ubuntu SMP Mon Jun 24 10:55:24 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | ant | | Personality | /home/jenkins/jenkins-slave/workspace/PreCommit-SOLR-Build/sourcedir/dev-tools/test-patch/lucene-solr-yetus-personality.sh | | git revision | master / 9c066f60f18 | | ant | version: Apache Ant(TM) version 1.10.5 compiled on March 28 2019 | | Default Java | LTS | | Test Results | https://builds.apache.org/job/PreCommit-SOLR-Build/753/testReport/ | | modules | C: solr/core U: solr/core | | Console output | https://builds.apache.org/job/PreCommit-SOLR-Build/753/console | | Powered by | Apache Yetus 0.7.0 http://yetus.apache.org | This message was automatically generated. > ZkController LiveNodesListener has NullPointerException in startup race > --- > > Key: SOLR-14504 > URL: https://issues.apache.org/jira/browse/SOLR-14504 > Project: Solr > Issue Type: Bug >Affects Versions: 7.7, 7.7.1, 7.7.2, 8.0, 8.1, 8.2, 7.7.3, 8.1.1, 8.3, > 8.4, 8.3.1, 8.5, 8.4.1, 8.5.1 >Reporter: Colvin Cowie >Priority: Minor > Attachments: SOLR-14504.patch > > > If a NODELOST event happens before the cloudManager is initialized then a > NullPointerException will occur on this line > [https://github.com/apache/lucene-solr/blob/c18666ad05afc02979c150aacd4810cff02e43f3/solr/core/src/java/org/apache/solr/cloud/ZkController.java#L1020] > {code:java} > byte[] json = Utils.toJSON(Collections.singletonMap("timestamp", > cloudManager.getTimeSource().getEpochTimeNs())); {code} > Rather than accessing cloudManager directly, getSolrCloudManager() should be > called. > > This happens very rarely, but if it happens it will stop Solr starting, > result in "CoreContainer is either not initialized or shutting down". Snippet > from 8.3.1 > {noformat} > 2020-05-19 03:44:40.241 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.245 INFO (zkConnectionManagerCallback-11-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.245 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.359 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.361 INFO (zkConnectionManagerCallback-13-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.361 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.417 INFO (main) [ ] o.a.s.c.c.ZkStateReader Updated > live nodes from ZooKeeper... (0) ->
[jira] [Commented] (SOLR-14504) ZkController LiveNodesListener has NullPointerException in startup race
[ https://issues.apache.org/jira/browse/SOLR-14504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17113118#comment-17113118 ] Andrzej Bialecki commented on SOLR-14504: - The proposed fix makes sense, I'll commit it shortly - thanks! > ZkController LiveNodesListener has NullPointerException in startup race > --- > > Key: SOLR-14504 > URL: https://issues.apache.org/jira/browse/SOLR-14504 > Project: Solr > Issue Type: Bug > Security Level: Public(Default Security Level. Issues are Public) >Affects Versions: 7.7, 7.7.1, 7.7.2, 8.0, 8.1, 8.2, 7.7.3, 8.1.1, 8.3, > 8.4, 8.3.1, 8.5, 8.4.1, 8.5.1 >Reporter: Colvin Cowie >Priority: Minor > Attachments: SOLR-14504.patch > > > If a NODELOST event happens before the cloudManager is initialized then a > NullPointerException will occur on this line > [https://github.com/apache/lucene-solr/blob/c18666ad05afc02979c150aacd4810cff02e43f3/solr/core/src/java/org/apache/solr/cloud/ZkController.java#L1020] > {code:java} > byte[] json = Utils.toJSON(Collections.singletonMap("timestamp", > cloudManager.getTimeSource().getEpochTimeNs())); {code} > Rather than accessing cloudManager directly, getSolrCloudManager() should be > called. > > This happens very rarely, but if it happens it will stop Solr starting, > result in "CoreContainer is either not initialized or shutting down". Snippet > from 8.3.1 > {noformat} > 2020-05-19 03:44:40.241 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.245 INFO (zkConnectionManagerCallback-11-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.245 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.359 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.361 INFO (zkConnectionManagerCallback-13-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.361 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.417 INFO (main) [ ] o.a.s.c.c.ZkStateReader Updated > live nodes from ZooKeeper... (0) -> (1) > 2020-05-19 > 03:44:56.606 INFO (zkCallback-12-thread-2) [ ] > o.a.s.c.c.ZkStateReader Updated live nodes from ZooKeeper... (1) -> > (0) > 2020-05-19 03:44:56.614 ERROR (main) [ ] > o.a.s.s.SolrDispatchFilter Could not start Solr. Check solr/home > property and the logs > 2020-05-19 03:44:56.639 ERROR (main) [ ] o.a.s.c.SolrCore > null:java.lang.NullPointerException > at > org.apache.solr.cloud.ZkController.lambda$registerLiveNodesListener$10(ZkController.java:1020) > at > org.apache.solr.common.cloud.ZkStateReader.registerLiveNodesListener(ZkStateReader.java:880) > at > org.apache.solr.cloud.ZkController.registerLiveNodesListener(ZkController.java:1035) > at org.apache.solr.cloud.ZkController.init(ZkController.java:917) > at org.apache.solr.cloud.ZkController.(ZkController.java:473) > at org.apache.solr.core.ZkContainer.initZooKeeper(ZkContainer.java:115) > at > org.apache.solr.core.CoreContainer.load(CoreContainer.java:631){noformat} > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org
[jira] [Commented] (SOLR-14504) ZkController LiveNodesListener has NullPointerException in startup race
[ https://issues.apache.org/jira/browse/SOLR-14504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17113064#comment-17113064 ] Colvin Cowie commented on SOLR-14504: - Without making big changes to ZkController's actual implementation, I don't really see a way to write a useful (automated) regression test for this. [~ab] what are you thoughts on just fixing it withtout a test? Or can you see a good way to test it? > ZkController LiveNodesListener has NullPointerException in startup race > --- > > Key: SOLR-14504 > URL: https://issues.apache.org/jira/browse/SOLR-14504 > Project: Solr > Issue Type: Bug > Security Level: Public(Default Security Level. Issues are Public) >Affects Versions: 7.7, 7.7.1, 7.7.2, 8.0, 8.1, 8.2, 7.7.3, 8.1.1, 8.3, > 8.4, 8.3.1, 8.5, 8.4.1, 8.5.1 >Reporter: Colvin Cowie >Priority: Minor > Attachments: SOLR-14504.patch > > > If a NODELOST event happens before the cloudManager is initialized then a > NullPointerException will occur on this line > [https://github.com/apache/lucene-solr/blob/c18666ad05afc02979c150aacd4810cff02e43f3/solr/core/src/java/org/apache/solr/cloud/ZkController.java#L1020] > {code:java} > byte[] json = Utils.toJSON(Collections.singletonMap("timestamp", > cloudManager.getTimeSource().getEpochTimeNs())); {code} > Rather than accessing cloudManager directly, getSolrCloudManager() should be > called. > > This happens very rarely, but if it happens it will stop Solr starting, > result in "CoreContainer is either not initialized or shutting down". Snippet > from 8.3.1 > {noformat} > 2020-05-19 03:44:40.241 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.245 INFO (zkConnectionManagerCallback-11-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.245 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.359 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.361 INFO (zkConnectionManagerCallback-13-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.361 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.417 INFO (main) [ ] o.a.s.c.c.ZkStateReader Updated > live nodes from ZooKeeper... (0) -> (1) > 2020-05-19 > 03:44:56.606 INFO (zkCallback-12-thread-2) [ ] > o.a.s.c.c.ZkStateReader Updated live nodes from ZooKeeper... (1) -> > (0) > 2020-05-19 03:44:56.614 ERROR (main) [ ] > o.a.s.s.SolrDispatchFilter Could not start Solr. Check solr/home > property and the logs > 2020-05-19 03:44:56.639 ERROR (main) [ ] o.a.s.c.SolrCore > null:java.lang.NullPointerException > at > org.apache.solr.cloud.ZkController.lambda$registerLiveNodesListener$10(ZkController.java:1020) > at > org.apache.solr.common.cloud.ZkStateReader.registerLiveNodesListener(ZkStateReader.java:880) > at > org.apache.solr.cloud.ZkController.registerLiveNodesListener(ZkController.java:1035) > at org.apache.solr.cloud.ZkController.init(ZkController.java:917) > at org.apache.solr.cloud.ZkController.(ZkController.java:473) > at org.apache.solr.core.ZkContainer.initZooKeeper(ZkContainer.java:115) > at > org.apache.solr.core.CoreContainer.load(CoreContainer.java:631){noformat} > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org
[jira] [Commented] (SOLR-14504) ZkController LiveNodesListener has NullPointerException in startup race
[ https://issues.apache.org/jira/browse/SOLR-14504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17113034#comment-17113034 ] Colvin Cowie commented on SOLR-14504: - So with breakpoints on in a debugger it's easy to reproduce this, but I'm not seeing a _nice_ way to write a test for it, at least not in the ZkControllerTest. There's a lot of stuff happening in the constructor of the ZkController (both registerLiveNodesListener and getSolrCloudManager are called, so the race only exists between those calls) and registerLiveNodesListener is private. I've gotten part way to testing it by constructing the ZkController on another thread, but then the NullPointerException is just lost in a callback thread's stack... The NullPointerException causing the Core container load to fail can only happen when a node lost event occurs in the initial {{at org.apache.solr.cloud.ZkController.registerLiveNodesListener(ZkController.java:1035)}} > ZkController LiveNodesListener has NullPointerException in startup race > --- > > Key: SOLR-14504 > URL: https://issues.apache.org/jira/browse/SOLR-14504 > Project: Solr > Issue Type: Bug > Security Level: Public(Default Security Level. Issues are Public) >Affects Versions: 7.7, 7.7.1, 7.7.2, 8.0, 8.1, 8.2, 7.7.3, 8.1.1, 8.3, > 8.4, 8.3.1, 8.5, 8.4.1, 8.5.1 >Reporter: Colvin Cowie >Priority: Minor > > If a NODELOST event happens before the cloudManager is initialized then a > NullPointerException will occur on this line > [https://github.com/apache/lucene-solr/blob/c18666ad05afc02979c150aacd4810cff02e43f3/solr/core/src/java/org/apache/solr/cloud/ZkController.java#L1020] > {code:java} > byte[] json = Utils.toJSON(Collections.singletonMap("timestamp", > cloudManager.getTimeSource().getEpochTimeNs())); {code} > Rather than accessing cloudManager directly, getSolrCloudManager() should be > called. > > This happens very rarely, but if it happens it will stop Solr starting, > result in "CoreContainer is either not initialized or shutting down". Snippet > from 8.3.1 > {noformat} > 2020-05-19 03:44:40.241 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.245 INFO (zkConnectionManagerCallback-11-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.245 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.359 INFO (main) [ ] o.a.s.c.c.ConnectionManager > Waiting for client to connect to ZooKeeper > 2020-05-19 03:44:40.361 INFO (zkConnectionManagerCallback-13-thread-1) [ ] > o.a.s.c.c.ConnectionManager zkClient has connected > 2020-05-19 03:44:40.361 INFO (main) [ ] o.a.s.c.c.ConnectionManager Client > is connected to ZooKeeper > 2020-05-19 03:44:40.417 INFO (main) [ ] o.a.s.c.c.ZkStateReader Updated > live nodes from ZooKeeper... (0) -> (1) > 2020-05-19 > 03:44:56.606 INFO (zkCallback-12-thread-2) [ ] > o.a.s.c.c.ZkStateReader Updated live nodes from ZooKeeper... (1) -> > (0) > 2020-05-19 03:44:56.614 ERROR (main) [ ] > o.a.s.s.SolrDispatchFilter Could not start Solr. Check solr/home > property and the logs > 2020-05-19 03:44:56.639 ERROR (main) [ ] o.a.s.c.SolrCore > null:java.lang.NullPointerException > at > org.apache.solr.cloud.ZkController.lambda$registerLiveNodesListener$10(ZkController.java:1020) > at > org.apache.solr.common.cloud.ZkStateReader.registerLiveNodesListener(ZkStateReader.java:880) > at > org.apache.solr.cloud.ZkController.registerLiveNodesListener(ZkController.java:1035) > at org.apache.solr.cloud.ZkController.init(ZkController.java:917) > at org.apache.solr.cloud.ZkController.(ZkController.java:473) > at org.apache.solr.core.ZkContainer.initZooKeeper(ZkContainer.java:115) > at > org.apache.solr.core.CoreContainer.load(CoreContainer.java:631){noformat} > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org