ShubhamGondane opened a new pull request, #63569:
URL: https://github.com/apache/airflow/pull/63569
When a Kubernetes Job completes but the pod is deleted before Airflow
fetches logs (e.g., by cluster autoscaler), `execute_complete` fails with an
unhandled `ApiException(404)`. Task retries also fail repeatedly since they
re-enter `execute_complete` and hit the same error.
This fix catches the 404 and skips log retrieval for deleted pods, resolving
both the initial failure and the retry loop.
Note: whether retries of deferred tasks should re-run `execute` instead of
`execute_complete` is a broader SDK-level concern and out of scope here.
closes: #56693
---
##### Was generative AI tooling used to co-author this PR?
- [X] Yes — Claude Code
Generated-by: Claude Code following [the
guidelines](https://github.com/apache/airflow/blob/main/contributing-docs/05_pull_requests.rst#gen-ai-assisted-contributions)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]