francisoliverlee opened a new issue #4949:
URL: https://github.com/apache/incubator-doris/issues/4949
**Describe the bug**
0.12.21.release, add fe follower error
**To Reproduce**
1. use "kill -9" to kill one follower FE
2. delete the FE's meta dir, and start it but fail
3. use "alter system" to drop the FE, clean the meta dir, use "alter system"
to add it into the cluster
and get the error
**Expected behavior**
A clear and concise description of what you expected to happen.
**Screenshots**
```
2020-11-23 13:13:59.722 UTC 警告 [11.11.27.153_9010_1606137216465] Exiting
inner Replica loop with exception com.sleepycat.je.EnvironmentFailureException:
(JE 7.3.7) 11.11.27.153_9010_1606137216465(-1):/data/doris-fe/doris-meta/bdb
Feeder: 11.11.27.152_9010_1603703245840(14).
com.sleepycat.je.rep.impl.RepGroupImpl$NodeConflictException: (JE 7.3.7) New or
moved node:11.11.27.153_9010_1606137216465, is configured with the socket
address: /11.11.27.153:9010. It conflicts with the socket already used by the
member: 11.11.27.153_9010_1603291427805 HANDSHAKE_ERROR: Error during the
handshake between two nodes. Some validity or compatibility check failed,
preventing further communication between the nodes. Environment is invalid and
must be closed.
com.sleepycat.je.EnvironmentFailureException: (JE 7.3.7)
11.11.27.153_9010_1606137216465(-1):/data/doris-fe/doris-meta/bdb Feeder:
11.11.27.152_9010_1603703245840(14).
com.sleepycat.je.rep.impl.RepGroupImpl$NodeConflictException: (JE 7.3.7) New or
moved node:11.11.27.153_9010_1606137216465, is configured with the socket
address: /11.11.27.153:9010. It conflicts with the socket already used by the
member: 11.11.27.153_9010_1603291427805 HANDSHAKE_ERROR: Error during the
handshake between two nodes. Some validity or compatibility check failed,
preventing further communication between the nodes. Environment is invalid and
must be closed.
at
com.sleepycat.je.rep.stream.ReplicaFeederHandshake.verifyMembership(ReplicaFeederHandshake.java:334)
at
com.sleepycat.je.rep.stream.ReplicaFeederHandshake.execute(ReplicaFeederHandshake.java:259)
at
com.sleepycat.je.rep.impl.node.Replica.initReplicaLoop(Replica.java:691)
at
com.sleepycat.je.rep.impl.node.Replica.runReplicaLoopInternal(Replica.java:474)
at
com.sleepycat.je.rep.impl.node.Replica.runReplicaLoop(Replica.java:409)
at com.sleepycat.je.rep.impl.node.RepNode.run(RepNode.java:1873)
```
**Additional context**
```
i do some check and find out that the droped-node are still in bdb memory,
the two variables bellowing
/com/sleepycat/je/7.3.7/je-7.3.7.jar!/com/sleepycat/je/rep/impl/RepGroupImpl.class
/* All the nodes that form the replication group, indexed by Id. */
private final Map<Integer, RepNodeImpl> nodesById =
new HashMap<Integer, RepNodeImpl>();
/*
* All the nodes that form the replication group, indexed by node name.
* This map is used exclusively for efficient lookups by name. The map
* nodesById does all the heavy lifting.
*/
private final Map<String, RepNodeImpl> nodesByName =
new HashMap<String, RepNodeImpl>();
but in bdb image file , the droped node is not found.
```
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]