wangxj3 edited a comment on issue #4084: URL: https://github.com/apache/incubator-dolphinscheduler/issues/4084#issuecomment-731884164
> > > sorry, may be it's not the same problem, please describe the detail of your problem. > eg: the task's content / system info of master and worker ... master向worker发送task: 1.worker接收到task后,会回复给master一个“ack”,然后去执行task,执行结束之后给master发送result。 2.master接受到ack后将taskInstance的状态更新成1,接收到result后将状态更新成7(假设task的状态是成功) 正常: ack和result根据worker执行的顺序,ack会先发送到master,task状态从0->1,result 后发送到master ,task状态从1->7。 异常: 在ack的response处理的时候出现阻塞,或者是在netty消息传输或消费的,反正resul更新数据库的时间早于ack,导致状态变化0->7->1.而实际worker已经执行过task了,所以master就会阻塞住,一直循环查看当前节点的状态。 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
