lordcheng10 opened a new pull request #2973:
URL: https://github.com/apache/bookkeeper/pull/2973


   ### Motivation
   When checkAllLedgers is scheduled periodically, because it will try to read 
almost all entry data, it may cause the bookkeeper to time out and cause the 
entry to be incorrectly marked with markLedgerUnderreplicatedAsync.
   In our cluster, the execution cycle of checkAllLedgers is 1 week. Then we 
found that a large number of ledger will be marked 
markLedgerUnderreplicatedAsync each time it is executed. Analyzing the log 
found that there are some reading bookkeeper timeouts:
   
![image](https://user-images.githubusercontent.com/19296967/148194737-142e38ab-119e-4466-a7f8-75f9ce9e1d2b.png)
   
   
![image](https://user-images.githubusercontent.com/19296967/148195160-3cab0783-5f4d-4c79-842d-4fa056ba6507.png)
   
   Due to too many read requests, the cluster pressure is too high, and the 
latency of pulsar's write time continues to soar until the recovery is 
completed.:
   
![image](https://user-images.githubusercontent.com/19296967/148195503-277cec00-0a08-45a1-9228-4c1a46dc2fd9.png)
   
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to