chihsuan commented on PR #10535:
URL: https://github.com/apache/ozone/pull/10535#issuecomment-4814149193
@adoroszlai I've opened #10620 for the residual flakiness.
**TL;DR:** 1s report interval alone wasn't enough. On datanode restart a
replica is briefly reported `CLOSING`, which makes SCM resurrect the container
out of `DELETING/DELETED` (`CLOSING` → `QUASI_CLOSED`), so replicas never get
deleted and it times out. The restart was only needed under the old 60-min
interval (#6967); at 1s the periodic report already drives deletion, so #10620
just drops the restart (also ~25% faster).
cc @amaliujia
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]