cyber-free-bot opened a new issue, #17584:
URL: https://github.com/apache/dolphinscheduler/issues/17584

   ### Search before asking
   
   - [x] I had searched in the 
[issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and 
found no similar issues.
   
   
   ### What happened
   
   workflow instance keep stuck in Ready_Stop Status.
   <img width="3218" height="382" alt="Image" 
src="https://github.com/user-attachments/assets/4daf86dd-20b0-40ed-8b19-3ccafa81f319";
 />
   
   
   ### What you expected to happen
   
   workflow instance with STOP status in the end.
   
   ### How to reproduce
   
   how to reproduce it: 
   1. create some parallel execution task nodes, all of which have downstream 
nodes. The faster these task nodes end, the higher the probability of 
replication.
   2. in my env, all the shell node only execute echo 1.
   3. start workflow instance, and stop it immediately.
   <img width="3184" height="1248" alt="Image" 
src="https://github.com/user-attachments/assets/6ba3d276-8a2d-48c2-9d82-5e7cd3e3aef8";
 />
   
   ### Anything else
   
   1. when process instance in Ready_Stop state, dolphinscheduler will send 
workflow state change event to this process instance, this event call 
KillAllTasks
   2. at this moment, TaskStateEventHandler received task finished event, will 
call workflowExecuteRunnable.taskFinished, and submitPostNode
   3. submitPostNode will submit standby tasks, if a standby task dependResult 
is success, ds need save it to db, and there's bug here
   
   <img width="2342" height="944" alt="Image" 
src="https://github.com/user-attachments/assets/e0d99e71-7fcb-4b27-8f49-0960253c59aa";
 />
   4. submitTaskExec failed means task id is null, Map put (null, xx) will 
throw npe exception.
   
   <img width="3770" height="854" alt="Image" 
src="https://github.com/user-attachments/assets/c64bb440-be72-4fd3-9365-67883e97623b";
 />
   5. this exception without any handle method, so 
workflowExecuteRunnable.stateEvents never remove the finished event, means 
workflowExecuteRunnable need keep dealing with this event all the time, it's 
endless loop
   
   ### Version
   
   3.1.x
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [x] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: 
[email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to