ChPi opened a new issue, #12938:
URL: https://github.com/apache/dolphinscheduler/issues/12938

   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and 
found no similar issues.
   
   
   ### What happened
   
   version 3.1.0.
   
   i run a flow on dolphinscheduler k8s cluster, then delete the worker which 
task running,the task cannot be resubmitted. 
   the task state is always `Need fault tolerance`.
   
   when a worker is down, the master will set `TASK_STATE_CHANGE` and 
`NEED_FAULT_TOLERANCE` for the task, then call `action(run)` at 
`TaskStateEventHandler`, should it call `action(resubmit)` for 
`NEED_FAULT_TOLERANCE` ?
   
   ### What you expected to happen
   
   resubmit the task. 
   
   ### How to reproduce
   
   1. The task is running in worker1
   2. delete worker1
   3. The task need fault tolerance, but it cannot be resubmitted
   
   ### Anything else
   
   _No response_
   
   ### Version
   
   3.1.x
   
   ### Are you willing to submit PR?
   
   - [X] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: 
[email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to