[ https://issues.apache.org/jira/browse/KAFKA-15408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17759529#comment-17759529 ]
Sagar Rao commented on KAFKA-15408: ----------------------------------- [~patrickpang], thanks for filing this ! IMO, this is a feature which is long overdue on the Connect framework. Do you plan to pick this one up? I ask because if the answer is yes, we would need a KIP for this considering we might change some of the behaviour on how the status end point responses might not reflected task failure as soon as a task fails. Also, the configurable max-tries means the addition of a new config possibly. > Restart failed tasks in Kafka Connect up to a configurable max-tries > -------------------------------------------------------------------- > > Key: KAFKA-15408 > URL: https://issues.apache.org/jira/browse/KAFKA-15408 > Project: Kafka > Issue Type: New Feature > Components: KafkaConnect > Reporter: Patrick Pang > Priority: Major > > h2. Issue > Currently, Kafka Connect just reports failed tasks on REST API, with the > error. Users are expected to monitor the status and restart individual > connectors if there is transient errors. Unfortunately these are common for > database connectors, e.g. transient connection error, flip of DNS, database > downtime, etc. Kafka Connect silently failing due to these scenarios would > lead to stale data downstream. > h2. Proposal > Kafka Connect should be able to restart failed tasks automatically, up to a > configurable max-tries. > h2. Prior arts > * > [https://github.com/strimzi/proposals/blob/main/007-restarting-kafka-connect-connectors-and-tasks.md] > > * > [https://docs.aiven.io/docs/products/kafka/kafka-connect/howto/enable-automatic-restart] > -- This message was sent by Atlassian Jira (v8.20.10#820010)