刘珍 created IOTDB-4040:
-------------------------
Summary: [grpc-default-executor-0] WARN
o.a.ratis.util.LogUtils:124 - : Failed requestVote ..._50010#0
Key: IOTDB-4040
URL: https://issues.apache.org/jira/browse/IOTDB-4040
Project: Apache IoTDB
Issue Type: Bug
Components: mpp-cluster
Affects Versions: 0.14.0-SNAPSHOT
Reporter: 刘珍
Assignee: Song Ziyang
Attachments: image-2022-08-04-17-51-22-725.png,
image-2022-08-04-17-52-13-492.png
master_0804_f900a34
schema/data : RatisConsensus
3副本3C7D,干净环境,ip5注册到集群成功后,kill datanode进程。
注册元数据,
写入数据,ip5 上有2个schemaregion,启动ip5,同步schemaregion失败:
2022-08-04 17:23:30,895 [grpc-default-executor-0] WARN
o.a.ratis.util.LogUtils:124 - 172.20.70.5_50010: Failed requestVote
172.20.70.16_50010->172.20.70.5_50010#0
org.apache.ratis.protocol.exceptions.GroupMismatchException: 172.20.70.5_50010:
group-000200000001 not found.
at
org.apache.ratis.server.impl.RaftServerProxy$ImplMap.get(RaftServerProxy.java:148)
at
org.apache.ratis.server.impl.RaftServerProxy.getImplFuture(RaftServerProxy.java:347)
at
org.apache.ratis.server.impl.RaftServerProxy.getImpl(RaftServerProxy.java:356)
at
org.apache.ratis.server.impl.RaftServerProxy.getImpl(RaftServerProxy.java:351)
at
org.apache.ratis.server.impl.RaftServerProxy.requestVote(RaftServerProxy.java:603)
at
org.apache.ratis.grpc.server.GrpcServerProtocolService.requestVote(GrpcServerProtocolService.java:172)
at
org.apache.ratis.proto.grpc.RaftServerProtocolServiceGrpc$MethodHandlers.invoke(RaftServerProtocolServiceGrpc.java:382)
at
org.apache.ratis.thirdparty.io.grpc.stub.ServerCalls$UnaryServerCallHandler$UnaryServerCallListener.onHalfClose(ServerCalls.java:182)
at
org.apache.ratis.thirdparty.io.grpc.PartialForwardingServerCallListener.onHalfClose(PartialForwardingServerCallListener.java:35)
at
org.apache.ratis.thirdparty.io.grpc.ForwardingServerCallListener.onHalfClose(ForwardingServerCallListener.java:23)
at
org.apache.ratis.thirdparty.io.grpc.internal.ServerCallImpl$ServerStreamListenerImpl.halfClosed(ServerCallImpl.java:340)
at
org.apache.ratis.thirdparty.io.grpc.internal.ServerImpl$JumpToApplicationThreadServerStreamListener$1HalfClosed.runInContext(ServerImpl.java:866)
at
org.apache.ratis.thirdparty.io.grpc.internal.ContextRunnable.run(ContextRunnable.java:37)
at
org.apache.ratis.thirdparty.io.grpc.internal.SerializingExecutor.run(SerializingExecutor.java:133)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
复现流程
1.私有云172.20.70.2/3/4/5/13/14/16
benchmark 在 ip 15
2. ip5启动脚本监测datanode进程
while true
do
jps|grep -i datanode|awk '{print "kill -9 " $1}'|sh
sleep 2
done
3. 启动3个confignode
启动7个datanode
!image-2022-08-04-17-51-22-725.png!
4.启动3个bm
配置见附件
查看regions信息
!image-2022-08-04-17-52-13-492.png!
5. 启动ip5 的datanode
日志warn,schemaregion同步失败。
--
This message was sent by Atlassian Jira
(v8.20.10#820010)