First and foremost I am assuming that getting “stuck” is only happening when using a CeleryExecutor.
We have seen repeated instanced of the scheduler "dying" - i.e. no more scheduler threads in a ps output - with LocalExecutor too. If you feel this fits the description of "getting stuck", happy to provide more detail to try to get to a reproducible situation.
Regards ap
