prabhat2718 opened a new issue, #17446: URL: https://github.com/apache/pinot/issues/17446
- _SegmentStatusChecker_ periodic task occasionally emits incorrect _percentsegmentsavailable_ metric by marking segments as OFFLINE when they have already transitioned to CONSUMING state. - This issue has only been observed for large tables (40k-100k segments) - No signs of ZK replica lag or memory bottleneck - Increasing _controller.statuschecker.waitForPushTimePeriod_ from 10min (default) to 20 min hasn't resolved the issue since the lag can be over 30 mins which is unacceptable ### Example Timeline Timestamp | Component | Event -- | -- | -- 02:45:57 | Broker | Received new segment _table__28__2014__20251207T2115Z_ via EV update 02:46:12 | Server | Segment transitioned OFFLINE → CONSUMING 03:13:50 | Controller | SegmentStatusChecker reports segment has no ONLINE/CONSUMING replica -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
