Gakhramanzode commented on PR #16767:
URL: https://github.com/apache/kafka/pull/16767#issuecomment-2470591629

   @C0urante @frankvicky 
   Hello,
   
   I'm encountering an issue that appears to be related to this pull request.
   
   **Context:**
   - After upgrading from Kafka 3.6.0 to 3.9.0, our MirrorCheckpointConnector 
consistently fails with the error:
   ```bash
    [2024-11-12 16:41:53,084] ERROR [Worker clientId=app-dev->telemetry-dev, 
groupId=app-dev-mm2] Failed to reconfigure connector's tasks 
(MirrorCheckpointConnector), retrying after backoff. 
(org.apache.kafka.connect.runtime.distributed.DistributedHerder:2195)
    org.apache.kafka.connect.errors.RetriableException: Timeout while loading 
consumer groups.
            at 
org.apache.kafka.connect.mirror.MirrorCheckpointConnector.taskConfigs(MirrorCheckpointConnector.java:138)
            at 
org.apache.kafka.connect.runtime.Worker.connectorTaskConfigs(Worker.java:398)
            at 
org.apache.kafka.connect.runtime.distributed.DistributedHerder.reconfigureConnector(DistributedHerder.java:2243)
            at 
org.apache.kafka.connect.runtime.distributed.DistributedHerder.reconfigureConnectorTasksWithExponentialBackoffRetries(DistributedHerder.java:2183)
            at 
org.apache.kafka.connect.runtime.distributed.DistributedHerder.lambda$null$47(DistributedHerder.java:2199)
            at 
org.apache.kafka.connect.runtime.distributed.DistributedHerder.runRequest(DistributedHerder.java:2402)
            at 
org.apache.kafka.connect.runtime.distributed.DistributedHerder.tick(DistributedHerder.java:498)
            at 
org.apache.kafka.connect.runtime.distributed.DistributedHerder.run(DistributedHerder.java:383)
            at 
java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:539)
            at 
java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
            at 
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
            at 
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
            at java.base/java.lang.Thread.run(Thread.java:840)
   ```
   - Increasing `admin.timeout.ms` and related timeout settings did not resolve 
the issue.
   - The problem seems to stem from `knownConsumerGroups` remaining `null`, 
causing `taskConfigs()` to throw a `RetriableException`.
   
   **Questions:**
   - Alternatively, are there any suggested workarounds to mitigate this issue 
in the meantime?
   
   Thank you for your efforts in resolving this problem. Any assistance would 
be greatly appreciated.
   
   Best regards


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to