1fanwang opened a new issue, #56273:
URL: https://github.com/apache/airflow/issues/56273

   ### Description
   
   Currently, when a DAG run times out and is marked as failed, the 
notify_dagrun_state_changed() call doesn't include any context about the 
timeout reason. This makes it difficult for listeners and observability systems 
to distinguish between timeout failures and other types of DAG failures.
   
   ### Use case/motivation
   
   Problem: DAG timeout failures are indistinguishable from other failure types 
in listener, limiting observability and error classification capabilities.
   Solution: Pass a "timed_out" message to notify_dagrun_state_changed() when a 
DAG run fails due to timeout, providing context to listeners about the specific 
failure reason.
   Benefits:
   Enables better error categorization in observability systems
   Allows listeners to handle timeout failures differently from other failure 
types
   Improves debugging and monitoring capabilities
   Maintains backward compatibility (msg parameter is optional)
   
   ### Related issues
   
   _No response_
   
   ### Are you willing to submit a PR?
   
   - [x] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [x] I agree to follow this project's [Code of 
Conduct](https://github.com/apache/airflow/blob/main/CODE_OF_CONDUCT.md)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to