thetumbled opened a new issue, #21989:
URL: https://github.com/apache/pulsar/issues/21989

   ### Search before asking
   
   - [X] I searched in the [issues](https://github.com/apache/pulsar/issues) 
and found nothing similar.
   
   
   ### Version
   
   master
   
   ### Minimal reproduce step
   
   We will do the health check for the cluster by rest api 
`org.apache.pulsar.broker.admin.impl.BrokersBase#healthCheck` every 1 min.  But 
recently we meet a case that the health check fail.
   ```
   bin/pulsar-admin --admin-url http://XXX brokers healthcheck
   ```
   <img width="417" alt="image" 
src="https://github.com/apache/pulsar/assets/52550727/d39e50ce-fff4-4ab7-98f0-e5da3bac6417";>
   
   There is no any error log in the broker log, the random message has been 
produced to the health-check topic, but the reader fail to read it out, so the 
logic stuck at  
   ```
   org.apache.pulsar.broker.admin.impl.BrokersBase#healthCheckRecursiveReadNext
   ```
   , which result into the created producers and readers can't be closed, and 
the scheduled async tasks can't be completed as well.
   For example:
   `bin/pulsar-admin topics stats persistent://pulsar/XXX/healthcheck`
   
![image](https://github.com/apache/pulsar/assets/52550727/6482b3e0-5078-4d56-840b-c8650a2a92fa)
   
   
   
   
   ### What did you expect to see?
   
   1
   
   ### What did you see instead?
   
   1
   
   ### Anything else?
   
   _No response_
   
   ### Are you willing to submit a PR?
   
   - [X] I'm willing to submit a PR!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to