GitHub user KarthikeyanDevendhiran added a comment to the discussion: Report of 
increased incidence of tasks getting stuck in queued in Airflow 2.10.5

Hi @karenbraganz, I’ve run the diagnostics as requested and observed consistent 
logs across the worker pods that failed due to this issue.

**When executing the following command:**

``` kubectl logs airflow-scheduler-65cfcf89c9-67jlx -n airflow | grep 
<worker-pod-name> ```

**I received logs indicating:**

```
Defaulted container "scheduler" out of: scheduler, git-sync, 
scheduler-log-groomer, wait-for-airflow-migrations (init), git-sync-init (init)

```

**Additionally, when running:**


`kubectl get events -n airflow --field-selector 
involvedObject.name=<worker-pod-name>`

**I found that:**

```
LAST SEEN   TYPE      REASON        OBJECT                                      
     MESSAGE
12m         Warning   FailedMount   pod/worker-pod-3j33j446   
MountVolume.MountDevice failed for volume 
"pvc-0de664bb-ff12-411c-aa2b-5ca54f2fd4ac" : rpc error: code = Internal desc = 
Mount failed with error: rpc error: code = Unknown desc = exit status 1 *** 
blobfuse2: A new version [2.5.0~preview.1] is available. Consider upgrading to 
latest version for bug-fixes & new features. ***...
```

- Kubernetes events report a FailedMount error for volume 
pvc-0de664bb-ff12-411c-aa2b-5ca54f2fd4ac, caused by a Blobfuse2 mount failure.
- The scheduler logs confirm that the pod is registered, but the task likely 
fails before execution due to the volume issue.

I’m currently reviewing the PVC configuration to identify the root cause. 
Please let me know if there are specific areas you'd like me to focus on or if 
additional diagnostics would be helpful.

Thanks!

GitHub link: 
https://github.com/apache/airflow/discussions/51597#discussioncomment-13453174

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]

Reply via email to