[ 
https://issues.apache.org/jira/browse/KAFKA-19697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Martin Andersson updated KAFKA-19697:
-------------------------------------
    Environment: Kafka connect cluster with 20 workers running in kubernetes, 
on homebrewed kafka images built from eclipse-temurin:21-jre-alpine-3.21 and 
the official kafka 4.0.0 distribution  (was: Kafka connect cluster with 20 
workers running in kubernetes, on homebrewed kafka images built from 
eclipse-temurin:21-jre-alpine-3.21)

> NPE Cannot invoke 
> org.apache.kafka.connect.runtime.ConnectMetrics$MetricGroup.close()
> -------------------------------------------------------------------------------------
>
>                 Key: KAFKA-19697
>                 URL: https://issues.apache.org/jira/browse/KAFKA-19697
>             Project: Kafka
>          Issue Type: Bug
>          Components: connect
>    Affects Versions: 4.0.0
>         Environment: Kafka connect cluster with 20 workers running in 
> kubernetes, on homebrewed kafka images built from 
> eclipse-temurin:21-jre-alpine-3.21 and the official kafka 4.0.0 distribution
>            Reporter: Martin Andersson
>            Priority: Major
>              Labels: connect, connect-worker
>
> Several tasks in multiple sink connectors in a long-running connect cluster 
> broke spontaneously (within a couple of hours) with the following stacktrace:
> {code:java}
> java.lang.NullPointerException: Cannot invoke 
> \"org.apache.kafka.connect.runtime.ConnectMetrics$MetricGroup.close()\" 
> because the return value of 
> \"java.util.concurrent.ConcurrentMap.get(Object)\" is null
>     at 
> org.apache.kafka.connect.runtime.Worker$ConnectorStatusMetricsGroup.recordTaskRemoved(Worker.java:2333)
>     at org.apache.kafka.connect.runtime.Worker.startTask(Worker.java:707)
>     at org.apache.kafka.connect.runtime.Worker.startSinkTask(Worker.java:568)
>     at 
> org.apache.kafka.connect.runtime.distributed.DistributedHerder.startTask(DistributedHerder.java:2009)
>     at 
> org.apache.kafka.connect.runtime.distributed.DistributedHerder.lambda$getTaskStartingCallable$39(DistributedHerder.java:2059)
>     at java.base/java.util.concurrent.FutureTask.run(Unknown Source)
>     at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown 
> Source)
>     at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown 
> Source)
>     at java.base/java.lang.Thread.run(Unknown Source) {code}
> Restarting the failed tasks with the REST API lead to another task failure 
> with the following stacktrace:
> {code:java}
> java.lang.NullPointerException: Cannot invoke \"java.util.Map.size()\" 
> because \"inputMap\" is null
>     at 
> org.apache.kafka.common.utils.Utils.castToStringObjectMap(Utils.java:1476)
>     at 
> org.apache.kafka.common.config.AbstractConfig.<init>(AbstractConfig.java:112)
>     at 
> org.apache.kafka.common.config.AbstractConfig.<init>(AbstractConfig.java:146)
>     at org.apache.kafka.connect.runtime.TaskConfig.<init>(TaskConfig.java:51)
>     at org.apache.kafka.connect.runtime.Worker.startTask(Worker.java:661)
>     at org.apache.kafka.connect.runtime.Worker.startSinkTask(Worker.java:568)
>     at 
> org.apache.kafka.connect.runtime.distributed.DistributedHerder.startTask(DistributedHerder.java:2009)
>     at 
> org.apache.kafka.connect.runtime.distributed.DistributedHerder.lambda$getTaskStartingCallable$39(DistributedHerder.java:2059)
>     at java.base/java.util.concurrent.FutureTask.run(Unknown Source)
>     at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown 
> Source)
>     at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown 
> Source)
>     at java.base/java.lang.Thread.run(Unknown Source)
>  {code}
> Likely related to KAFKA-17719 ?
> The failed tasks did not show up on the _connector-failed-task-count_ metric 
> (or in the _restarting/paused/failed_ task metrics), but the failing tasks 
> disappeared from the connector-running-task-count metric.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to