dirrao commented on code in PR #36882:
URL: https://github.com/apache/airflow/pull/36882#discussion_r1476927807
##########
airflow/providers/cncf/kubernetes/provider.yaml:
##########
@@ -350,6 +350,15 @@ config:
type: string
example: ~
default: ""
+ task_publish_max_retries:
+ description: |
+ The Maximum number of retries for queuing the task to the kubernetes
scheduler when
+ failing due to Kube API transient errors before giving up and
marking task as failed.
Review Comment:
The existing code put the task in the queue indefinitely for all the error
codes except (400, 404, 422). I have retained the old behavior with a -1 flag.
Do you want me to requeue the task when the quota is exceeded only, then fail
the rest of the error codes?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]