jadami10 opened a new pull request, #11040: URL: https://github.com/apache/pinot/pull/11040
This is an issue we saw internally. A DNS lookup failure would cause the KafkaConsumer to fail creation which then caused the segment to go into error state. There's a long chain of calls all the way to `org.apache.pinot.core.data.manager.realtime.LLRealtimeSegmentDataManager#makeStreamConsumer` where we could add retries, but at that level, it's not clear you're necessarily using kafka. So we added the retry at the boundary from pinot<>kafka code. We've been running this internally for ~2 weeks, and it continues to catch and retrty errors without having error segments. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
