刘珍 created IOTDB-4027:
-------------------------
Summary: ERROR o.a.i.d.e.s.SnapshotLoader:94 - Exception occurs
when creating links from snapshot directory to data directory
Key: IOTDB-4027
URL: https://issues.apache.org/jira/browse/IOTDB-4027
Project: Apache IoTDB
Issue Type: Bug
Components: mpp-cluster
Affects Versions: 0.14.0-SNAPSHOT
Reporter: 刘珍
Assignee: Liuxuxin
Attachments: image-2022-08-03-09-39-10-230.png,
image-2022-08-03-09-39-48-739.png, ip4_2000_config.properties
master_0801_55b5b17
问题描述
RatisConsensus,3副本3C9D,1个bm连1个datanode执行并发写入,停止1个follower节点,5分钟后启动;然后停止另1个follower节点10分钟后启动(没发生snapshot),此节点报错:
2022-08-02 18:04:17,376 [pool-4-thread-1] ERROR o.a.i.d.e.s.SnapshotLoader:94 -
Exception occurs when creating links from snapshot directory to data directory
java.io.IOException: Cannot find
/data/iotdb/master_0801_2de0dd8/datanode/./sbin/../data/consensus/data_region/47474747-4747-4747-4747-000100000001/sm/1_354536/sequence/root.ip4.g_0
or
/data/iotdb/master_0801_2de0dd8/datanode/./sbin/../data/consensus/data_region/47474747-4747-4747-4747-000100000001/sm/1_354536/unsequence/root.ip4.g_0
at
org.apache.iotdb.db.engine.snapshot.SnapshotLoader.createLinksFromSnapshotDirToDataDir(SnapshotLoader.java:163)
at
org.apache.iotdb.db.engine.snapshot.SnapshotLoader.loadSnapshotForStateMachine(SnapshotLoader.java:91)
at
org.apache.iotdb.db.consensus.statemachine.DataRegionStateMachine.loadSnapshot(DataRegionStateMachine.java:93)
at
org.apache.iotdb.consensus.ratis.ApplicationStateMachineProxy.loadSnapshot(ApplicationStateMachineProxy.java:188)
at
org.apache.iotdb.consensus.ratis.ApplicationStateMachineProxy.lambda$initialize$0(ApplicationStateMachineProxy.java:73)
at
org.apache.ratis.util.LifeCycle.startAndTransition(LifeCycle.java:270)
at
org.apache.iotdb.consensus.ratis.ApplicationStateMachineProxy.initialize(ApplicationStateMachineProxy.java:69)
at org.apache.ratis.server.impl.ServerState.<init>(ServerState.java:136)
at
org.apache.ratis.server.impl.RaftServerImpl.<init>(RaftServerImpl.java:201)
at
org.apache.ratis.server.impl.RaftServerProxy.lambda$newRaftServerImpl$5(RaftServerProxy.java:274)
at
java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
2022-08-02 18:04:17,376 [pool-4-thread-1] ERROR
o.a.i.d.c.s.DataRegionStateMachine:95 - Fail to load snapshot from
/data/iotdb/master_0801_2de0dd8/datanode/./sbin/../data/consensus/data_region/47474747-4747-4747-4747-000100000001/sm/1_354536
1. 复现流程
私有云172.20.70.2/3/4/5/13/14/16/18/19
benchmark 在ip15(连ip4)
停ip4/启动ip4 , 停ip18/启动ip18,ip18报错
!image-2022-08-03-09-39-10-230.png!
!image-2022-08-03-09-39-48-739.png!
2. 启动benchmark
2022-08-02 17:34:57 启动bm
3. 停止ip4的datanode
2022-08-02 17:45:42停止datanode
sleep 300
启动ip4
4. 停止ip18的datanode
2022-08-02 17:54:11 停止ip18的datanode
sleep 600
启动ip18
{color:#DE350B}*启动过程中,报错*{color}:
见问题描述
--
This message was sent by Atlassian Jira
(v8.20.10#820010)