modyuan commented on issue #17286:
URL: https://github.com/apache/doris/issues/17286#issuecomment-1562221724
> 我这里有同样的问题,已经设置 `vm.max_map_count` 了。
>
> FE 日志
>
> ```
> doris-fe | 2023-03-29 01:52:14,184 WARN (heartbeat mgr|21)
[HeartbeatMgr.runAfterCatalogReady():139] get bad heartbeat response: type:
BACKEND, status: BAD, msg: java.net.ConnectException: Connection refused
(Connection refused), beId: 10003, beHost: 172.20.80.3, bePort: 0, httpPort: 0,
brpcPort: 0
> doris-fe | 2023-03-29 01:52:19,227 WARN (heartbeat-mgr-pool-7|147)
[HeartbeatMgr$BackendHeartbeatHandler.call():268] backend heartbeat got
exception
> doris-fe | org.apache.thrift.transport.TTransportException:
java.net.ConnectException: Connection refused (Connection refused)
> doris-fe | at
org.apache.thrift.transport.TSocket.open(TSocket.java:226)
~[spark-dpp-1.0-SNAPSHOT.jar:1.0-SNAPSHOT]
> doris-fe | at
org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:143)
~[doris-fe.jar:1.0-SNAPSHOT]
> doris-fe | at
org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:126)
~[doris-fe.jar:1.0-SNAPSHOT]
> doris-fe | at
org.apache.commons.pool2.BaseKeyedPooledObjectFactory.makeObject(BaseKeyedPooledObjectFactory.java:62)
~[commons-pool2-2.2.jar:2.2]
> doris-fe | at
org.apache.commons.pool2.impl.GenericKeyedObjectPool.create(GenericKeyedObjectPool.java:1012)
~[commons-pool2-2.2.jar:2.2]
> doris-fe | at
org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:356)
~[commons-pool2-2.2.jar:2.2]
> doris-fe | at
org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:277)
~[commons-pool2-2.2.jar:2.2]
> doris-fe | at
org.apache.doris.common.GenericPool.borrowObject(GenericPool.java:95)
~[doris-fe.jar:1.0-SNAPSHOT]
> doris-fe | at
org.apache.doris.system.HeartbeatMgr$BackendHeartbeatHandler.call(HeartbeatMgr.java:225)
~[doris-fe.jar:1.0-SNAPSHOT]
> doris-fe | at
org.apache.doris.system.HeartbeatMgr$BackendHeartbeatHandler.call(HeartbeatMgr.java:203)
~[doris-fe.jar:1.0-SNAPSHOT]
> doris-fe | at
java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[?:1.8.0_342]
> doris-fe | at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
~[?:1.8.0_342]
> doris-fe | at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
~[?:1.8.0_342]
> doris-fe | at java.lang.Thread.run(Thread.java:750) ~[?:1.8.0_342]
> doris-fe | Caused by: java.net.ConnectException: Connection refused
(Connection refused)
> doris-fe | at java.net.PlainSocketImpl.socketConnect(Native Method)
~[?:1.8.0_342]
> doris-fe | at
java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:350)
~[?:1.8.0_342]
> doris-fe | at
java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:206)
~[?:1.8.0_342]
> doris-fe | at
java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:188)
~[?:1.8.0_342]
> doris-fe | at
java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392) ~[?:1.8.0_342]
> doris-fe | at java.net.Socket.connect(Socket.java:607) ~[?:1.8.0_342]
> doris-fe | at
org.apache.thrift.transport.TSocket.open(TSocket.java:221)
~[spark-dpp-1.0-SNAPSHOT.jar:1.0-SNAPSHOT]
> doris-fe | ... 13 more
> doris-fe | 2023-03-29 01:52:19,228 WARN (heartbeat mgr|21)
[HeartbeatMgr.runAfterCatalogReady():139] get bad heartbeat response: type:
BACKEND, status: BAD, msg: java.net.ConnectException: Connection refused
(Connection refused), beId: 10003, beHost: 172.20.80.3, bePort: 0, httpPort: 0,
brpcPort: 0
> doris-fe | 2023-03-29 01:52:21,546 INFO (tablet checker|28)
[TabletChecker.checkTablets():331] finished to check tablets.
unhealth/total/added/in_sched/not_ready: 0/0/0/0/0, cost: 0 ms
> doris-fe | 2023-03-29 01:52:21,550 INFO (leaderCheckpointer|86)
[BDBJEJournal.getFinalizedJournalId():405] database names: 1
> doris-fe | 2023-03-29 01:52:21,550 INFO (leaderCheckpointer|86)
[Checkpoint.doCheckpoint():94] last checkpoint journal id: 0, current finalized
journal id: 0
> doris-fe | 2023-03-29 01:52:21,668 WARN (ForkJoinPool-2-worker-2|159)
[TabletStatMgr.lambda$null$0():70] task exec error. backend[10003]
> doris-fe | org.apache.thrift.transport.TTransportException: Invalid port
-1
> doris-fe | at
org.apache.thrift.transport.TSocket.open(TSocket.java:213)
~[spark-dpp-1.0-SNAPSHOT.jar:1.0-SNAPSHOT]
> doris-fe | at
org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:143)
~[doris-fe.jar:1.0-SNAPSHOT]
> doris-fe | at
org.apache.doris.common.GenericPool$ThriftClientFactory.create(GenericPool.java:126)
~[doris-fe.jar:1.0-SNAPSHOT]
> doris-fe | at
org.apache.commons.pool2.BaseKeyedPooledObjectFactory.makeObject(BaseKeyedPooledObjectFactory.java:62)
~[commons-pool2-2.2.jar:2.2]
> doris-fe | at
org.apache.commons.pool2.impl.GenericKeyedObjectPool.create(GenericKeyedObjectPool.java:1012)
~[commons-pool2-2.2.jar:2.2]
> doris-fe | at
org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:356)
~[commons-pool2-2.2.jar:2.2]
> doris-fe | at
org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:277)
~[commons-pool2-2.2.jar:2.2]
> doris-fe | at
org.apache.doris.common.GenericPool.borrowObject(GenericPool.java:95)
~[doris-fe.jar:1.0-SNAPSHOT]
> doris-fe | at
org.apache.doris.catalog.TabletStatMgr.lambda$null$0(TabletStatMgr.java:63)
~[doris-fe.jar:1.0-SNAPSHOT]
> doris-fe | at
java.util.stream.ForEachOps$ForEachOp$OfRef.accept(ForEachOps.java:183)
~[?:1.8.0_342]
> doris-fe | at java.util.Iterator.forEachRemaining(Iterator.java:116)
~[?:1.8.0_342]
> doris-fe | at
java.util.Spliterators$IteratorSpliterator.forEachRemaining(Spliterators.java:1801)
~[?:1.8.0_342]
> doris-fe | at
java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:482)
~[?:1.8.0_342]
> doris-fe | at
java.util.stream.ForEachOps$ForEachTask.compute(ForEachOps.java:290)
~[?:1.8.0_342]
> doris-fe | at
java.util.concurrent.CountedCompleter.exec(CountedCompleter.java:731)
~[?:1.8.0_342]
> doris-fe | at
java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289) ~[?:1.8.0_342]
> doris-fe | at
java.util.concurrent.ForkJoinTask.doInvoke(ForkJoinTask.java:401) ~[?:1.8.0_342]
> doris-fe | at
java.util.concurrent.ForkJoinTask.invoke(ForkJoinTask.java:734) ~[?:1.8.0_342]
> doris-fe | at
java.util.stream.ForEachOps$ForEachOp.evaluateParallel(ForEachOps.java:159)
~[?:1.8.0_342]
> doris-fe | at
java.util.stream.ForEachOps$ForEachOp$OfRef.evaluateParallel(ForEachOps.java:173)
~[?:1.8.0_342]
> doris-fe | at
java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:233)
~[?:1.8.0_342]
> doris-fe | at
java.util.stream.ReferencePipeline.forEach(ReferencePipeline.java:485)
~[?:1.8.0_342]
> doris-fe | at
java.util.stream.ReferencePipeline$Head.forEach(ReferencePipeline.java:650)
~[?:1.8.0_342]
> doris-fe | at
org.apache.doris.catalog.TabletStatMgr.lambda$runAfterCatalogReady$1(TabletStatMgr.java:57)
~[doris-fe.jar:1.0-SNAPSHOT]
> doris-fe | at
java.util.concurrent.ForkJoinTask$AdaptedRunnableAction.exec(ForkJoinTask.java:1386)
~[?:1.8.0_342]
> doris-fe | at
java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289) ~[?:1.8.0_342]
> doris-fe | at
java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056)
~[?:1.8.0_342]
> doris-fe | at
java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692)
~[?:1.8.0_342]
> doris-fe | at
java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:175)
~[?:1.8.0_342]
> doris-fe | 2023-03-29 01:52:21,669 INFO (tablet stat mgr|23)
[TabletStatMgr.runAfterCatalogReady():125] finished to update index row num of
all databases. cost: 0 ms
> ```
>
> BE 日志
>
> ```
> doris-be | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: register is
failed, wait next~
> doris-be | 172.20.80.2_9010_1680054434346 172.20.80.2 9010
8030 9030 9020 FOLLOWER true 578058187 true true
25 2023-03-29 01:48:20 true doris-1.2.1-rc01-Unknown
Yes
> doris-be | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: MASTER FE is
started!
> doris-be | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: add
priority_networks 172.20.80.0/24 to /opt/apache-doris/be/conf/be.conf
> doris-be | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: Register BE to
FE is failed. retry.
> doris-be | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: BE successfully
registered to FE!
> doris-be | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: BE is not
register. retry.
> doris-be | 10003 default_cluster 172.20.80.3 9050 -1 -1
-1 NULL NULL false false false 0 0.000 1.000 B
0.000 0.00 % 0.00 % 0.000 {"location" : "default"}
{"lastSuccessReportTabletsTime":"N/A","lastStreamLoadTime":-1,"isQueryDisabled":false,"isLoadDisabled":false}
0
> doris-be | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: Init Check -
Verify that BE is registered to FE successfully
> doris-be | 2023-03-29T01:48:23+00:00 [Note] [Entrypoint]: Ready to start
BE!
> doris-be | 2023-03-29T01:48:44+00:00 [Warn] [Entrypoint]: start check be
status~
> doris-be | 2023-03-29T01:48:44+00:00 [Note] [Entrypoint]: register is
failed, wait next~
> doris-be | 2023-03-29T01:49:04+00:00 [Warn] [Entrypoint]: start check be
status~
> doris-be | 2023-03-29T01:49:04+00:00 [Note] [Entrypoint]: register is
failed, wait next~
> doris-be | 2023-03-29T01:49:25+00:00 [Warn] [Entrypoint]: start check be
status~
> doris-be | 2023-03-29T01:49:25+00:00 [Note] [Entrypoint]: register is
failed, wait next~
> doris-be | 2023-03-29T01:49:45+00:00 [Warn] [Entrypoint]: start check be
status~
> doris-be | 2023-03-29T01:49:45+00:00 [Note] [Entrypoint]: register is
failed, wait next~
> doris-be | 2023-03-29T01:50:14+00:00 [Warn] [Entrypoint]: start check be
status~
> doris-be | 2023-03-29T01:50:14+00:00 [Note] [Entrypoint]: register is
failed, wait next~
> doris-be | Instruction check fail. The CPU does not support AVX2
instruction set.
> doris-be | 2023-03-29T01:50:36+00:00 [Warn] [Entrypoint]: start check be
status~
> doris-be | For example, if your CPU does not support AVX2, you need to
rebuild the Doris BE with: USE_AVX2=0 sh build.sh --be2023-03-29T01:50:36+00:00
[Note] [Entrypoint]: register is failed, wait next~
> doris-be | 2023-03-29T01:50:57+00:00 [Warn] [Entrypoint]: start check be
status~
> doris-be | 2023-03-29T01:50:57+00:00 [Note] [Entrypoint]: register is
failed, wait next~
> doris-be | 2023-03-29T01:51:17+00:00 [Warn] [Entrypoint]: start check be
status~
> doris-be | 2023-03-29T01:51:17+00:00 [Note] [Entrypoint]: register is
failed, wait next~
> doris-be | 2023-03-29T01:51:38+00:00 [Warn] [Entrypoint]: start check be
status~
> doris-be | 2023-03-29T01:51:38+00:00 [Note] [Entrypoint]: register is
failed, wait next~
> doris-be | 2023-03-29T01:51:58+00:00 [Warn] [Entrypoint]: start check be
status~
> doris-be | 2023-03-29T01:51:58+00:00 [Note] [Entrypoint]: register is
failed, wait next~
> doris-be | 2023-03-29T01:52:19+00:00 [Warn] [Entrypoint]: start check be
status~
> doris-be | 2023-03-29T01:52:19+00:00 [Note] [Entrypoint]: register is
failed, wait next~
> doris-be | 2023-03-29T01:52:39+00:00 [Warn] [Entrypoint]: start check be
status~
> doris-be | 2023-03-29T01:52:39+00:00 [Note] [Entrypoint]: register is
failed, wait next~
> doris-be | 2023-03-29T01:52:59+00:00 [Warn] [Entrypoint]: start check be
status~
> doris-be | 2023-03-29T01:53:00+00:00 [Note] [Entrypoint]: register is
failed, wait next~
> ```
我也遇到了一样的问题,部署完几天后就出现了这个问题。
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]