[ 
https://issues.apache.org/jira/browse/GEODE-3588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16317002#comment-16317002
 ] 

Bruce Schuchardt commented on GEODE-3588:
-----------------------------------------

The second time the locator is restarted it sends a join request to the 
coordinator but the request is ignored.  The locator times out the join attempt 
and starts up in a split-brain.

{noformat}
locator1/locator1.log: [info 2018/01/08 11:29:12.251 PST locator1 <main> 
tid=0x1] This member is hosting a locator will be preferred as a membership 
coordinator

locator1/locator1.log: [info 2018/01/08 11:29:12.252 PST locator1 <Geode 
Failure Detection Server thread 0> tid=0x20] Started failure detection server 
thread on /10.118.20.12:4847.

locator1/locator1.log: [info 2018/01/08 11:29:12.253 PST locator1 <main> 
tid=0x1] Peer locator is connecting to local membership services with ID 
10.118.20.12(locator1:54429:locator)<ec>:1024

locator1/locator1.log: [info 2018/01/08 11:29:12.270 PST locator1 <main> 
tid=0x1] Attempting to join the distributed system through coordinator 
10.118.20.12(server1:54371)<v1>:1025 using address 
10.118.20.12(locator1:54429:locator)<ec>:1024


server1/server1.log: [info 2018/01/08 11:29:12.277 PST server1 <unicast 
receiver,bruces-mbpro-48625> tid=0x1b] received join request from 
10.118.20.12(locator1:54429:locator)<ec>:1024

server1/server1.log: [info 2018/01/08 11:29:12.277 PST server1 <unicast 
receiver,bruces-mbpro-48625> tid=0x1b] Received a join request from 
10.118.20.12(locator1:54429:locator)<ec>:1024



locator1/locator1.log: [info 2018/01/08 11:29:17.281 PST locator1 <main> 
tid=0x1] This member is becoming the membership coordinator with address 
10.118.20.12(locator1:54429:locator)<ec>:1024

locator1/locator1.log: [info 2018/01/08 11:29:17.281 PST locator1 <main> 
tid=0x1] received new view: 
View[10.118.20.12(locator1:54429:locator)<ec><v0>:1024|0] members: 
[10.118.20.12(locator1:54429:locator)<ec><v0>:1024]
  old view is: null

locator1/locator1.log: [info 2018/01/08 11:29:17.282 PST locator1 <main> 
tid=0x1] Peer locator received new membership view: 
View[10.118.20.12(locator1:54429:locator)<ec><v0>:1024|0] members: 
[10.118.20.12(locator1:54429:locator)<ec><v0>:1024]

locator1/locator1.log: [info 2018/01/08 11:29:17.291 PST locator1 <main> 
tid=0x1] ViewCreator starting 
on:10.118.20.12(locator1:54429:locator)<ec><v0>:1024
{noformat}

> Restarting a Locator Loses Cluster Configuration
> ------------------------------------------------
>
>                 Key: GEODE-3588
>                 URL: https://issues.apache.org/jira/browse/GEODE-3588
>             Project: Geode
>          Issue Type: Bug
>          Components: locator, membership
>            Reporter: Michael Martell
>            Priority: Minor
>         Attachments: RunLocalhostGemfire.ps1
>
>
> Restarting the locator and listing members works properly. Repeating the 
> process a second time shows no cluster members.
> To repro:
> 1) start a locator
> 2) start a server
> 3) list members (shows locator and server)
> 4) stop the locator
> 5) restart the locator (same as step 1)
> 6) list members (shows locator and server)
> 7) stop the locator a second time
> 8) restart the locator a second time (same as step 1)
> 9) list members shows only the locator
> Notes:
> - This was done on a Windows machine using latest geode/develop (as of 
> 9/11/2017)
> - Was tried with and without using protobuf=true flag, and fails in both 
> cases)



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to