apoorvmittal10 commented on PR #17645: URL: https://github.com/apache/kafka/pull/17645#issuecomment-2450070393
> I remain unconvinced by this PR. There is certainly a problem somewhere in this area, but I was unable to reproduce it yesterday in spite of trying. I'll try again today. But I want to understand WHY the problem is occurring before approving a change to fix it. So previously we never removed cached SharePartition in manager despite receiving NOT_COORDINATOR error during initialization. The `testShareGroups` used to pass as that test vaildates group related data not consumption. The response from read state persister RPC is flaky, sometimes successful response do appear but sometimes not. This check in the PR adds retries on NOT_COORDINATOR. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org