Amey2400 opened a new issue #21945:
URL: https://github.com/apache/airflow/issues/21945


   ### Apache Airflow version
   
   2.1.3
   
   ### What happened
   
   DAG run gets failed due to some task instances getting failed in the first 
try either due to "pod name must be provided" or "pod not found" issue.
   ![Screenshot from 2022-03-02 
21-31-46](https://user-images.githubusercontent.com/65292207/156399611-2da1b9a3-a1fc-4997-af69-e56f5d7b363f.png)
   ![Screenshot from 2022-03-02 
21-30-28](https://user-images.githubusercontent.com/65292207/156399711-a655367e-412d-48a7-a004-b0e6b782b8a4.png)
   
   But in the second retry task gets completed successfully and that task is 
marked as a success but before that downstream tasks were marked as "upstream 
failed" and they didn't rerun and the whole dag run is marked as failed.
   ![Screenshot from 2022-03-02 
21-35-59](https://user-images.githubusercontent.com/65292207/156400249-1e027e99-6749-4207-ba0e-847e44423f5a.png)
   
   In some instances, in 1st try run of the task, there are no errors for 
fetching logs but no logs are shown except 1st reading from the log file 
statement.
   ![Screenshot from 2022-03-02 
21-41-27](https://user-images.githubusercontent.com/65292207/156401383-caf3dc7f-1488-4ec5-96c1-bf3ab671d3a0.png)
   But on 2nd try proper logs are shown and that task is completed 
successfully. But in that case, the same issue occurs, downstream tasks were 
marked as "upstream failed" before 2nd retry was complete, and the dag run was 
marked as failed.
   
   
   
   ### What you expected to happen
   
   For 1st try of task instance, it should be able to fetch worker pod logs and 
the "pod name must be provided" error must not occur or logs should not empty.
   And if 1st try of the task has failed but if some no of retries is set for 
it then the downstream task should not be marked as "upstream failed" 
completely even if downstream tasks also have the same no of retries set. And 
if in 2nd try of task if the task gets completed successfully then downstream 
tasks should also rerun and dag run should not be marked as failed.
   
   ### How to reproduce
   
   _No response_
   
   ### Operating System
   
   Debian GNU/Linux
   
   ### Versions of Apache Airflow Providers
   
   _No response_
   
   ### Deployment
   
   Other 3rd-party Helm chart
   
   ### Deployment details
   
   _No response_
   
   ### Anything else
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://github.com/apache/airflow/blob/main/CODE_OF_CONDUCT.md)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to