[
https://issues.apache.org/jira/browse/GEODE-3588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16317002#comment-16317002
]
Bruce Schuchardt commented on GEODE-3588:
-----------------------------------------
The second time the locator is restarted it sends a join request to the
coordinator but the request is ignored. The locator times out the join attempt
and starts up in a split-brain.
{noformat}
locator1/locator1.log: [info 2018/01/08 11:29:12.251 PST locator1 <main>
tid=0x1] This member is hosting a locator will be preferred as a membership
coordinator
locator1/locator1.log: [info 2018/01/08 11:29:12.252 PST locator1 <Geode
Failure Detection Server thread 0> tid=0x20] Started failure detection server
thread on /10.118.20.12:4847.
locator1/locator1.log: [info 2018/01/08 11:29:12.253 PST locator1 <main>
tid=0x1] Peer locator is connecting to local membership services with ID
10.118.20.12(locator1:54429:locator)<ec>:1024
locator1/locator1.log: [info 2018/01/08 11:29:12.270 PST locator1 <main>
tid=0x1] Attempting to join the distributed system through coordinator
10.118.20.12(server1:54371)<v1>:1025 using address
10.118.20.12(locator1:54429:locator)<ec>:1024
server1/server1.log: [info 2018/01/08 11:29:12.277 PST server1 <unicast
receiver,bruces-mbpro-48625> tid=0x1b] received join request from
10.118.20.12(locator1:54429:locator)<ec>:1024
server1/server1.log: [info 2018/01/08 11:29:12.277 PST server1 <unicast
receiver,bruces-mbpro-48625> tid=0x1b] Received a join request from
10.118.20.12(locator1:54429:locator)<ec>:1024
locator1/locator1.log: [info 2018/01/08 11:29:17.281 PST locator1 <main>
tid=0x1] This member is becoming the membership coordinator with address
10.118.20.12(locator1:54429:locator)<ec>:1024
locator1/locator1.log: [info 2018/01/08 11:29:17.281 PST locator1 <main>
tid=0x1] received new view:
View[10.118.20.12(locator1:54429:locator)<ec><v0>:1024|0] members:
[10.118.20.12(locator1:54429:locator)<ec><v0>:1024]
old view is: null
locator1/locator1.log: [info 2018/01/08 11:29:17.282 PST locator1 <main>
tid=0x1] Peer locator received new membership view:
View[10.118.20.12(locator1:54429:locator)<ec><v0>:1024|0] members:
[10.118.20.12(locator1:54429:locator)<ec><v0>:1024]
locator1/locator1.log: [info 2018/01/08 11:29:17.291 PST locator1 <main>
tid=0x1] ViewCreator starting
on:10.118.20.12(locator1:54429:locator)<ec><v0>:1024
{noformat}
> Restarting a Locator Loses Cluster Configuration
> ------------------------------------------------
>
> Key: GEODE-3588
> URL: https://issues.apache.org/jira/browse/GEODE-3588
> Project: Geode
> Issue Type: Bug
> Components: locator, membership
> Reporter: Michael Martell
> Priority: Minor
> Attachments: RunLocalhostGemfire.ps1
>
>
> Restarting the locator and listing members works properly. Repeating the
> process a second time shows no cluster members.
> To repro:
> 1) start a locator
> 2) start a server
> 3) list members (shows locator and server)
> 4) stop the locator
> 5) restart the locator (same as step 1)
> 6) list members (shows locator and server)
> 7) stop the locator a second time
> 8) restart the locator a second time (same as step 1)
> 9) list members shows only the locator
> Notes:
> - This was done on a Windows machine using latest geode/develop (as of
> 9/11/2017)
> - Was tried with and without using protobuf=true flag, and fails in both
> cases)
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)