MatrixManAtYrService commented on issue #13542:
URL: https://github.com/apache/airflow/issues/13542#issuecomment-849031715


   While trying to recreate this, I wrote a [stress 
test](https://github.com/MatrixManAtYrService/airflow-git-sync/blob/master/scheduler_stress.py)
 which I ran overnight on my local microk8s cluster (airflow v2.0.1, 
release:2.0.1+beb8af5ac6c438c29e2c186145115fb1334a3735 configured like 
[this](https://github.com/MatrixManAtYrService/airflow-git-sync/blob/master/zsh.stdin)).
   
   I was hoping that it would get fully stuck by the time I woke.  Instead 
there were only two stuck tasks: 
    
   
![stucktasks](https://user-images.githubusercontent.com/5834582/119712787-34db0f80-be1e-11eb-9a41-875dd88c0566.gif)
   
   Deleting the scheduler pod and letting kubernetes recreate it did cause the 
stuck tasks to complete.  My current guess is that it won't get stuck in a 
restarting-scheduler-doesnt-help kind of way until the number of stuck-queued 
tasks matches `dag_concurrency`.  I'll update here if I'm able to confirm that 
guess.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to