jedcunningham commented on code in PR #66412:
URL: https://github.com/apache/airflow/pull/66412#discussion_r3191433560
##########
airflow-core/src/airflow/config_templates/config.yml:
##########
@@ -2689,6 +2689,17 @@ triggerer:
type: float
example: ~
default: "30"
+ runner_health_check_threshold:
+ description: |
+ If the TriggerRunner subprocess's async event loop sends no
communication to the parent
+ process for more than this many seconds, the parent stops updating the
triggerer's
+ heartbeat in the database. The triggerer then appears unhealthy to the
scheduler, which
+ will reassign its triggers. This detects a deadlocked or hung event
loop that the normal
+ process-alive check cannot catch. Set to 0 to disable the watchdog.
+ version_added: 3.2.2
+ type: float
+ example: ~
+ default: "30"
Review Comment:
30s is generous enough - 5 minutes is _way_ too long IMO.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]