zbye opened a new issue, #16140:
URL: https://github.com/apache/pulsar/issues/16140
**Describe the bug**
**One Broker shutdown with the following logs :**
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] INFO
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Un-fencing topic...
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] INFO
org.apache.bookkeeper.mledger.impl.ManagedLedgerImpl -
[gmtp_cffex/SF_test/persistent/bar] Managed ledger is now ready to accept
writes again
2022-06-17T13:27:16,824+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.pulsar.broker.service.persistent.PersistentTopic -
[persistent://gmtp_cffex/SF_test/bar] Failed to persist msg in store: Error
while using ZooKeeper
2022-06-17T13:27:16,825+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.bookkeeper.meta.AbstractZkLedgerManager - Conditional update ledger
metadata failed: SESSIONEXPIRED
2022-06-17T13:27:16,825+0800 [main-SendThread(10.187.128.229:2181)] ERROR
org.apache.bookkeeper.client.MetadataUpdateLoop -
UpdateLoop(ledgerId=135600,loopId=4f19d4e3) Error writing metadata to store
org.apache.bookkeeper.client.BKException$ZKException: Error while using
ZooKeeper
at
org.apache.bookkeeper.meta.AbstractZkLedgerManager$4.processResult(AbstractZkLedgerManager.java:509)
[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at
org.apache.bookkeeper.zookeeper.ZooKeeperClient$22$1.processResult(ZooKeeperClient.java:1094)
[org.apache.bookkeeper-bookkeeper-server-4.14.4.jar:4.14.4]
at
org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:638)
[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
at
org.apache.zookeeper.ClientCnxn$EventThread.queuePacket(ClientCnxn.java:541)
[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
at org.apache.zookeeper.ClientCnxn.finishPacket(ClientCnxn.java:781)
[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
at org.apache.zookeeper.ClientCnxn.conLossPacket(ClientCnxn.java:818)
[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
at org.apache.zookeeper.ClientCnxn.access$2600(ClientCnxn.java:106)
[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
at
org.apache.zookeeper.ClientCnxn$SendThread.cleanup(ClientCnxn.java:1403)
[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
at
org.apache.zookeeper.ClientCnxn$SendThread.cleanAndNotifyState(ClientCnxn.java:1331)
[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1309)
[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
KeeperErrorCode = Session expired for /ledgers/00/0013/L5600
at
org.apache.zookeeper.KeeperException.create(KeeperException.java:134)
~[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
at org.apache.zookeeper.KeeperException.create(KeeperException.java:54)
~[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
... 10 more
2022-06-17T13:27:16,825+0800 [main-SendThread(10.187.128.229:2181)] WARN
org.apache.bookkeeper.client.LedgerHandle - Close failed: ZKException: Error
while using ZooKeeper
2022-06-17T13:27:16,988+0800 [ZKC-connect-executor-0-EventThread] INFO
org.apache.bookkeeper.discover.ZKRegistrationClient - Update BookieInfoCache
(writable bookie) 10.187.128.232:3181 -> BookieServiceInfo{properties={},
endpoints=[EndpointInfo{id=bookie, port=3181, host=10.187.128.232,
protocol=bookie-rpc, auth=[], extensions=[]}]}
2022-06-17T13:27:16,988+0800 [ZKC-connect-executor-0-EventThread] INFO
org.apache.bookkeeper.discover.ZKRegistrationClient - Update BookieInfoCache
(writable bookie) 10.187.128.231:3181 -> BookieServiceInfo{properties={},
endpoints=[EndpointInfo{id=bookie, port=3181, host=10.187.128.231,
protocol=bookie-rpc, auth=[], extensions=[]}]}
2022-06-17T13:27:16,989+0800 [ZKC-connect-executor-0-EventThread] INFO
org.apache.bookkeeper.discover.ZKRegistrationClient - Update BookieInfoCache
(writable bookie) 10.187.128.230:3181 -> BookieServiceInfo{properties={},
endpoints=[EndpointInfo{id=bookie, port=3181, host=10.187.128.230,
protocol=bookie-rpc, auth=[], extensions=[]}]}
2022-06-17T13:27:17,015+0800 [main-SendThread(10.187.128.227:2181)] INFO
org.apache.zookeeper.ClientCnxn - Opening socket connection to server
10.187.128.227/10.187.128.227:2181.
2022-06-17T13:27:17,015+0800 [main-SendThread(10.187.128.227:2181)] INFO
org.apache.zookeeper.ClientCnxn - SASL config status: Will not attempt to
authenticate using SASL (unknown error)
2022-06-17T13:27:17,015+0800 [main-SendThread(10.187.128.227:2181)] INFO
org.apache.zookeeper.ClientCnxn - Socket connection established, initiating
session, client: /10.187.128.226:49916, server:
10.187.128.227/10.187.128.227:2181
2022-06-17T13:27:17,020+0800 [main-SendThread(10.187.128.227:2181)] WARN
org.apache.zookeeper.ClientCnxn - Unable to reconnect to ZooKeeper service,
session 0x304454ce5fc006e has expired
2022-06-17T13:27:17,020+0800 [main-SendThread(10.187.128.227:2181)] WARN
org.apache.zookeeper.ClientCnxn - Session 0x304454ce5fc006e for sever
10.187.128.227/10.187.128.227:2181, Closing socket connection. Attempting
reconnect except it is a SessionExpiredException.
org.apache.zookeeper.ClientCnxn$SessionExpiredException: Unable to reconnect
to ZooKeeper service, session 0x304454ce5fc006e has expired
at
org.apache.zookeeper.ClientCnxn$SendThread.onConnected(ClientCnxn.java:1434)
~[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
at
org.apache.zookeeper.ClientCnxnSocket.readConnectResult(ClientCnxnSocket.java:154)
~[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
at
org.apache.zookeeper.ClientCnxnSocketNIO.doIO(ClientCnxnSocketNIO.java:86)
~[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
at
org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350)
~[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1290)
[org.apache.zookeeper-zookeeper-3.6.3.jar:3.6.3]
2022-06-17T13:27:17,331+0800 [pulsar-timer-52-1] INFO
org.apache.pulsar.client.impl.ConnectionHandler -
[public/__kafka/__consumer_offsets-partition-48] [pulsar-cluster-test-148-18]
Reconnecting after connection was closed
2022-06-17T13:27:17,591+0800 [BookKeeperClientWorker-OrderedExecutor-2-0]
INFO org.apache.bookkeeper.mledger.impl.ManagedLedgerImpl -
[gmtp_cffex/SF_test/persistent/bar] Creating a new ledger
2022-06-17T13:27:17,592+0800 [BookKeeperClientWorker-OrderedExecutor-2-0]
INFO org.apache.bookkeeper.mledger.impl.ManagedLedgerImpl -
[gmtp_cffex/SF_test/persistent/bar] Creating ledger, metadata: {component=[109,
97, 110, 97, 103, 101, 100, 45, 108, 101, 100, 103, 101, 114],
pulsar/managed-ledger=[103, 109, 116, 112, 95, 99, 102, 102, 101, 120, 47, 83,
70, 95, 116, 101, 115, 116, 47, 112, 101, 114, 115, 105, 115, 116, 101, 110,
116, 47, 98, 97, 114], application=[112, 117, 108, 115, 97, 114]} - metadata
ops timeout : 60 seconds
2022-06-17T13:27:17,592+0800 [BookKeeperClientWorker-OrderedExecutor-2-0]
WARN org.apache.bookkeeper.client.RackawareEnsemblePlacementPolicyImpl -
Failed to find 2 bookies : excludeBookies [<Bookie:10.187.128.232:3181>,
<Bookie:10.187.128.231:3181>, <Bookie:10.187.128.230:3181>], allBookies
[<Bookie:10.187.128.232:3181>, <Bookie:10.187.128.230:3181>,
<Bookie:10.187.128.231:3181>].
2022-06-17T13:27:17,592+0800 [BookKeeperClientWorker-OrderedExecutor-2-0]
WARN org.apache.bookkeeper.client.BookieWatcherImpl - New ensemble:
[10.187.128.232:3181, 10.187.128.231:3181] is not adhering to Placement Policy
2022-06-17T13:27:17,599+0800 [ZKC-connect-executor-0-EventThread] INFO
org.apache.bookkeeper.client.LedgerCreateOp - Ensemble: [10.187.128.232:3181,
10.187.128.231:3181] for ledger: 135693
2022-06-17T13:27:17,600+0800 [ZKC-connect-executor-0-EventThread] INFO
org.apache.bookkeeper.mledger.impl.ManagedLedgerImpl -
[gmtp_cffex/SF_test/persistent/bar] Created new ledger 135693
2022-06-17T13:27:18,053+0800 [metadata-store-zk-session-watcher-7-1] WARN
org.apache.pulsar.metadata.impl.ZKSessionWatcher - ZooKeeper client is
disconnected. Waiting to reconnect, time remaining = 22.988 seconds
2022-06-17T13:27:19,189+0800 [pulsar-web-37-7] INFO
org.eclipse.jetty.server.RequestLog - 10.187.98.151 - - [17/Jun/2022:13:27:19
+0800] "GET /metrics?format=prometheus HTTP/1.1" 302 0 "-" "Prometheus/2.16.0" 0
2022-06-17T13:27:19,248+0800 [prometheus-stats-38-1] INFO
org.eclipse.jetty.server.RequestLog - 10.187.98.151 - - [17/Jun/2022:13:27:19
+0800] "GET /metrics/?format=prometheus HTTP/1.1" 200 625724
"http://10.187.128.226:8080/metrics?format=prometheus" "Prometheus/2.16.0" 58
2022-06-17T13:27:19,622+0800 [pulsar-timer-52-1] INFO
org.apache.pulsar.client.impl.ConnectionHandler -
[public/__kafka/__consumer_offsets-partition-40] [pulsar-cluster-test-148-8]
Reconnecting after connection was closed
2022-06-17T13:27:20,054+0800 [metadata-store-zk-session-watcher-7-1] WARN
org.apache.pulsar.metadata.impl.ZKSessionWatcher - ZooKeeper client is
disconnected. Waiting to reconnect, time remaining = 20.987 seconds
2022-06-17T13:27:21,966+0800 [pulsar-timer-52-1] INFO
org.apache.pulsar.client.impl.ConnectionHandler -
[public/__kafka/__consumer_offsets-partition-35] [pulsar-cluster-test-148-9]
Reconnecting after connection was closed
2022-06-17T13:27:22,054+0800 [metadata-store-zk-session-watcher-7-1] WARN
org.apache.pulsar.metadata.impl.ZKSessionWatcher - ZooKeeper client is
disconnected. Waiting to reconnect, time remaining = 18.987 seconds
2022-06-17T13:27:23,011+0800 [pulsar-timer-52-1] INFO
org.apache.pulsar.client.impl.ConnectionHandler -
[public/__kafka/__consumer_offsets-partition-25] [reader-fc21d8c429]
Reconnecting after connection was closed
2022-06-17T13:27:24,054+0800 [metadata-store-zk-session-watcher-7-1] WARN
org.apache.pulsar.metadata.impl.ZKSessionWatcher - ZooKeeper client is
disconnected. Waiting to reconnect, time remaining = 16.987 seconds
2022-06-17T13:27:24,189+0800 [pulsar-web-37-19] INFO
org.eclipse.jetty.server.RequestLog - 10.187.98.151 - - [17/Jun/2022:13:27:24
+0800] "GET /metrics?format=prometheus HTTP/1.1" 302 0 "-" "Prometheus/2.16.0" 0
2022-06-17T13:27:24,257+0800 [prometheus-stats-38-1] INFO
org.eclipse.jetty.server.RequestLog - 10.187.98.151 - - [17/Jun/2022:13:27:24
+0800] "GET /metrics/?format=prometheus HTTP/1.1" 200 625723
"http://10.187.128.226:8080/metrics?format=prometheus" "Prometheus/2.16.0" 68
2022-06-17T13:27:25,455+0800 [pulsar-timer-52-1] INFO
org.apache.pulsar.client.impl.ConnectionHandler -
[public/__kafka/__consumer_offsets-partition-15] [reader-cf54e565f4]
Reconnecting after connection was closed
2022-06-17T13:27:26,055+0800 [metadata-store-zk-session-watcher-7-1] WARN
org.apache.pulsar.metadata.impl.ZKSessionWatcher - ZooKeeper client is
disconnected. Waiting to reconnect, time remaining = 14.986 seconds
2022-06-17T13:27:28,055+0800 [metadata-store-zk-session-watcher-7-1] WARN
org.apache.pulsar.metadata.impl.ZKSessionWatcher - ZooKeeper client is
disconnected. Waiting to reconnect, time remaining = 12.986 seconds
2022-06-17T13:27:29,189+0800 [pulsar-web-37-7] INFO
org.eclipse.jetty.server.RequestLog - 10.187.98.151 - - [17/Jun/2022:13:27:29
+0800] "GET /metrics?format=prometheus HTTP/1.1" 302 0 "-" "Prometheus/2.16.0" 1
2022-06-17T13:27:29,239+0800 [pulsar-timer-52-1] INFO
org.apache.pulsar.client.impl.ConnectionHandler - [public/__kafka/__consume
**To Reproduce**
This happens occasionally.
**Desktop (please complete the following information):**
CentOS Linux release 7.4.1708
**Additional context**
Pulsar 2.9.2
KoP 2.9.2.20
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]