刘珍 created IOTDB-4494:
-------------------------
Summary: [ MultiLeaderConsensus ]
org.apache.iotdb.db.mpp.execution.exchange.LocalSourceHandle cannot be cast to
org.apache.iotdb.db.mpp.execution.exchange.SourceHandle
Key: IOTDB-4494
URL: https://issues.apache.org/jira/browse/IOTDB-4494
Project: Apache IoTDB
Issue Type: Bug
Components: mpp-cluster
Affects Versions: 0.14.0-SNAPSHOT
Reporter: 刘珍
Assignee: Jinrui Zhang
master_0921_c9ee123
DataRegion: {color:#DE350B}MultiLeaderConsensus{color}
3副本3C3D
2022-09-21 22:34:36,539 [pool-14-IoTDB-MPPDataExchangeRPC-Processor-10] ERROR
o.a.t.ProcessFunction:47 - Internal error processing onNewDataBlockEvent
java.lang.ClassCastException:
org.apache.iotdb.db.mpp.execution.exchange.LocalSourceHandle cannot be cast to
org.apache.iotdb.db.mpp.execution.exchange.SourceHandle
at
org.apache.iotdb.db.mpp.execution.exchange.MPPDataExchangeManager$MPPDataExchangeServiceImpl.onNewDataBlockEvent(MPPDataExchangeManager.java:171)
at
org.apache.iotdb.mpp.rpc.thrift.MPPDataExchangeService$Processor$onNewDataBlockEvent.getResult(MPPDataExchangeService.java:376)
at
org.apache.iotdb.mpp.rpc.thrift.MPPDataExchangeService$Processor$onNewDataBlockEvent.getResult(MPPDataExchangeService.java:356)
at org.apache.thrift.ProcessFunction.process(ProcessFunction.java:38)
at org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:38)
at
org.apache.thrift.server.TThreadPoolServer$WorkerProcess.run(TThreadPoolServer.java:248)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
田原分析:retry导致的,因为retry没改变queryId,所以SourceHandle生成了一样的id,导致第一次的SinkHandle还在向重试后重新生成的LocalSourceHandle发送消息
测试环境
1. 192.168.10.62/66/68
bm在64
2. 集群配置参数
ConfigNode
MAX_HEAP_SIZE="8G"
MAX_DIRECT_MEMORY_SIZE="8G"
schema_region_consensus_protocol_class=org.apache.iotdb.consensus.ratis.RatisConsensus
data_region_consensus_protocol_class=org.apache.iotdb.consensus.multileader.MultiLeaderConsensus
schema_replication_factor=3
connection_timeout_ms=3600000
data_replication_factor=3
DataNode
MAX_HEAP_SIZE="192G"
MAX_DIRECT_MEMORY_SIZE="32G"
enable_timed_flush_seq_memtable=true
seq_memtable_flush_interval_in_ms=3600000
seq_memtable_flush_check_interval_in_ms=600000
enable_timed_flush_unseq_memtable=true
unseq_memtable_flush_interval_in_ms=3600000
unseq_memtable_flush_check_interval_in_ms=600000
max_waiting_time_when_insert_blocked=3600000
query_timeout_threshold=36000000
3. bm 配置
见附件
--
This message was sent by Atlassian Jira
(v8.20.10#820010)