ydgpaysit opened a new issue, #16928: URL: https://github.com/apache/dolphinscheduler/issues/16928
### Search before asking - [X] I had searched in the [issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and found no similar issues. ### What happened We occasionally encounter an issue where, after initiating a workflow, the taskInstance is not assigned to any worker by master, resulting in the host field being null and FAILURE. Below is the relevant log for reference: `2024-12-26T13:47:20.950725657Z [INFO] 2024-12-26 16:47:20.950 +0300 org.apache.dolphinscheduler.server.master.runner.MasterSchedulerBootstrap:[185] - [WorkflowInstance-0][TaskInstance-0] - Master schedule bootstrap loop command success, fetch command size: 1, cost: 2ms, current slot: 1, total slot size: 2 2024-12-26T13:47:20.958173290Z [ERROR] 2024-12-26 16:47:20.957 +0300 org.apache.dolphinscheduler.server.master.runner.MasterSchedulerBootstrap:[154] - [WorkflowInstance-0][TaskInstance-0] - Master handle command 147 error 2024-12-26T13:47:20.958191432Z org.apache.dolphinscheduler.server.master.exception.WorkflowCreateException: Create workflow execute runnable failed 2024-12-26T13:47:20.958194941Z at org.apache.dolphinscheduler.server.master.runner.WorkflowExecuteRunnableFactory.createWorkflowExecuteRunnable(WorkflowExecuteRunnableFactory.java:93) 2024-12-26T13:47:20.958197529Z at org.apache.dolphinscheduler.server.master.runner.MasterSchedulerBootstrap.lambda$run$0(MasterSchedulerBootstrap.java:137) 2024-12-26T13:47:20.958200630Z at java.util.stream.ForEachOps$ForEachOp$OfRef.accept(ForEachOps.java:183) 2024-12-26T13:47:20.958203498Z at java.util.ArrayList$ArrayListSpliterator.forEachRemaining(ArrayList.java:1384) 2024-12-26T13:47:20.958205987Z at java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:482) 2024-12-26T13:47:20.958208505Z at java.util.stream.ForEachOps$ForEachTask.compute(ForEachOps.java:290) 2024-12-26T13:47:20.958210929Z at java.util.concurrent.CountedCompleter.exec(CountedCompleter.java:731) 2024-12-26T13:47:20.958230215Z at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289) 2024-12-26T13:47:20.958232832Z at java.util.concurrent.ForkJoinTask.doInvoke(ForkJoinTask.java:401) 2024-12-26T13:47:20.958235240Z at java.util.concurrent.ForkJoinTask.invoke(ForkJoinTask.java:734) 2024-12-26T13:47:20.958237681Z at java.util.stream.ForEachOps$ForEachOp.evaluateParallel(ForEachOps.java:159) 2024-12-26T13:47:20.958240365Z at java.util.stream.ForEachOps$ForEachOp$OfRef.evaluateParallel(ForEachOps.java:173) 2024-12-26T13:47:20.958242834Z at java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:233) 2024-12-26T13:47:20.958245245Z at java.util.stream.ReferencePipeline.forEach(ReferencePipeline.java:485) 2024-12-26T13:47:20.958247619Z at java.util.stream.ReferencePipeline$Head.forEach(ReferencePipeline.java:650) 2024-12-26T13:47:20.958250457Z at org.apache.dolphinscheduler.server.master.runner.MasterSchedulerBootstrap.run(MasterSchedulerBootstrap.java:134) 2024-12-26T13:47:20.958253917Z Caused by: org.apache.dolphinscheduler.service.exceptions.ServiceException: delete command fail, id:147 2024-12-26T13:47:20.958257023Z at org.apache.dolphinscheduler.service.process.ProcessServiceImpl.deleteCommandWithCheck(ProcessServiceImpl.java:2556) 2024-12-26T13:47:20.958259627Z at org.apache.dolphinscheduler.service.process.ProcessServiceImpl.handleCommand(ProcessServiceImpl.java:344) 2024-12-26T13:47:20.958262012Z at org.apache.dolphinscheduler.service.process.ProcessServiceImpl$$FastClassBySpringCGLIB$$9d3e18f9.invoke(<generated>) 2024-12-26T13:47:20.958264501Z at org.springframework.cglib.proxy.MethodProxy.invoke(MethodProxy.java:218) 2024-12-26T13:47:20.958266880Z at org.springframework.aop.framework.CglibAopProxy$CglibMethodInvocation.invokeJoinpoint(CglibAopProxy.java:793) 2024-12-26T13:47:20.958269252Z at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163) 2024-12-26T13:47:20.958271621Z at org.springframework.aop.framework.CglibAopProxy$CglibMethodInvocation.proceed(CglibAopProxy.java:763) 2024-12-26T13:47:20.958274308Z at org.springframework.transaction.interceptor.TransactionInterceptor$1.proceedWithInvocation(TransactionInterceptor.java:123) 2024-12-26T13:47:20.958277292Z at org.springframework.transaction.interceptor.TransactionAspectSupport.invokeWithinTransaction(TransactionAspectSupport.java:388) 2024-12-26T13:47:20.958279702Z at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:119) 2024-12-26T13:47:20.958282230Z at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:186) 2024-12-26T13:47:20.958284559Z at org.springframework.aop.framework.CglibAopProxy$CglibMethodInvocation.proceed(CglibAopProxy.java:763) 2024-12-26T13:47:20.958286866Z at org.springframework.aop.framework.CglibAopProxy$DynamicAdvisedInterceptor.intercept(CglibAopProxy.java:708) 2024-12-26T13:47:20.958289313Z at org.apache.dolphinscheduler.service.process.ProcessServiceImpl$$EnhancerBySpringCGLIB$$8dc970f9.handleCommand(<generated>) 2024-12-26T13:47:20.958291942Z at org.apache.dolphinscheduler.server.master.runner.WorkflowExecuteContextFactory.createWorkflowInstance(WorkflowExecuteContextFactory.java:81) 2024-12-26T13:47:20.958294408Z at org.apache.dolphinscheduler.server.master.runner.WorkflowExecuteContextFactory.createWorkflowExecuteRunnableContext(WorkflowExecuteContextFactory.java:56) 2024-12-26T13:47:20.958297088Z at org.apache.dolphinscheduler.server.master.runner.WorkflowExecuteRunnableFactory.createWorkflowExecuteRunnable(WorkflowExecuteRunnableFactory.java:79) 2024-12-26T13:47:20.958300003Z ... 15 common frames omitted` ### What you expected to happen Execute successfully ### How to reproduce Trigger the workflow manually or allow the scheduler to execute it automatically. ### Anything else _No response_ ### Version 3.2.x ### Are you willing to submit PR? - [ ] Yes I am willing to submit a PR! ### Code of Conduct - [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
