Stéphane Derosiaux created KAFKA-9639:
-----------------------------------------
Summary: Task is not killed despite being dead
Key: KAFKA-9639
URL: https://issues.apache.org/jira/browse/KAFKA-9639
Project: Kafka
Issue Type: Bug
Components: KafkaConnect
Affects Versions: 2.3.1
Reporter: Stéphane Derosiaux
After restarting my postgresql, my Debezium connector failed and stopped (no
more metrics, and see the logs below), but the task and the connector in
Connect were still "RUNNING".
It seems the task "stop" procedure was interrupted by the Kafka Connect
framework and let the connector in a "zombie" state.
{code:java}
WorkerSourceTask\{id=customer-order-cdc-0} Task threw an uncaught and
unrecoverable exception (org.apache.kafka.connect.runtime.WorkerTask:179)
org.apache.kafka.connect.errors.ConnectException: An exception occurred in the
change event producer. This connector will be stopped.
Caused by: org.postgresql.util.PSQLException: Database connection failed when
reading from copy
WorkerSourceTask\{id=customer-order-cdc-0} Task is being killed and will not
recover until manually restarted
(org.apache.kafka.connect.runtime.WorkerTask:180)
Creating thread debezium-postgresconnector-customer-order-dev-error-handler
(io.debezium.util.Threads:268)
Connector has already been stopped
(io.debezium.connector.postgresql.PostgresConnectorTask:241)
Finished streaming (io.debezium.pipeline.ChangeEventSourceCoordinator:100)
Connected metrics set to 'false'
(io.debezium.pipeline.metrics.StreamingChangeEventSourceMetrics:59)
[2020-03-02 08:48:08,074] ERROR [customer-order-cdc|task-0] Interrupted while
stopping (io.debezium.connector.postgresql.PostgresConnectorTask:263)
java.lang.InterruptedException
[Producer clientId=connector-producer-customer-order-cdc-0] Closing the Kafka
producer with timeoutMillis = 30000 ms.
(org.apache.kafka.clients.producer.KafkaProducer:1153)
{code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)