epitomizelu opened a new issue, #15633: URL: https://github.com/apache/dolphinscheduler/issues/15633
### Search before asking - [X] I had searched in the [issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and found no similar issues. ### What happened it looks same as the bug 7441. the version is 3.1.8, and the cluster have 2 masters, 4 workers. I found a workflow instance running for more than 12 hours and it is abnormal. Then I found a task of the sub workflow is waiting to be executed for more than 10 hours. When I ended the workflow and restart it, the problem usually does not reproduce. ### What you expected to happen if a task waits for more than 5 minuts and can not be executed, the task should be failed. ### How to reproduce It is hard to reproduce, the workflow works normally for most of the time. And When I ended the workflow and restart it, the problem usually does not reproduce. ### Anything else _No response_ ### Version 3.1.x ### Are you willing to submit PR? - [X] Yes I am willing to submit a PR! ### Code of Conduct - [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
