tisonkun commented on issue #13954:
URL: https://github.com/apache/pulsar/issues/13954#issuecomment-1157622153
Could you verify whether this exception occurs while it suffers a ZK
disconnection? If there is a period of disconnection, it's possible that the op
success on server but fail to response, and since we retry on CONNECTIONLOSS
then it can be this case. #13892 can be in such situation, cc
@MichalKoziorowski-TomTom
```java
// ZKMetadataStore#batchOperation 8914b84115bacaf38fe892d66533c1b70431acbf
if (results == null) {
Code code = Code.get(rc);
if (code == Code.CONNECTIONLOSS) {
// There is the chance that we caused a connection
reset by sending or requesting a batch
// that passed the max ZK limit. Retry with the
individual operations
executor.schedule(() -> {
ops.forEach(o ->
batchOperation(Collections.singletonList(o)));
}, 100, TimeUnit.MILLISECONDS);
} else {
MetadataStoreException e = getException(code, path);
ops.forEach(o ->
o.getFuture().completeExceptionally(e));
}
return;
}
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]