njnu-seafish commented on issue #17817:
URL: 
https://github.com/apache/dolphinscheduler/issues/17817#issuecomment-3686259543

   Both task-level and workflow-level timeout alerts are failing with 
NullPointerException — first observed this morning.
   
   > 
   [WI-0][TI-0] - 2025-12-23 16:17:50.839 INFO  
[MasterRpcServer-methodInvoker-10] o.a.d.s.m.e.WorkflowEventBus:[41] - Publish 
event: TaskRunningLifecycleEvent{task=s1, runtimeContext=null}
   [WI-174][TI-0] - 2025-12-23 16:17:50.944 INFO  
[ds-workflow-eventbus-worker-3] 
o.a.d.s.m.e.t.l.h.AbstractTaskLifecycleEventHandler:[47] - Fired task s1 
TaskRunningLifecycleEvent{task=s1, runtimeContext=null} with state 
RUNNING_EXECUTION
   [WI-174][TI-0] - 2025-12-23 16:18:50.631 INFO  
[ds-workflow-eventbus-worker-8] 
o.a.d.s.m.e.t.l.h.TaskTimeoutLifecycleEventHandler:[75] - The task s1 
TimeoutStrategy is WARNFAILED, try to publish a kill event and send a timeout 
alert.
   [WI-174][TI-0] - 2025-12-23 16:18:50.632 INFO  
[ds-workflow-eventbus-worker-8] o.a.d.s.m.e.WorkflowEventBus:[41] - Publish 
event: TaskKillLifecycleEvent{task=s1, delayTime=0}
   [WI-174][TI-0] - 2025-12-23 16:18:50.632 ERROR 
[ds-workflow-eventbus-worker-8] o.a.d.s.m.e.WorkflowEventBusFireWorker:[88] - 
Fire event failed for WorkflowExecuteRunnable: s1-20251223161749852
   
org.apache.dolphinscheduler.server.master.engine.exceptions.WorkflowEventFireException:
 Failed to fire event: TaskRetryEvent{task=s1, timeout=60004}
        at 
org.apache.dolphinscheduler.server.master.engine.WorkflowEventBusFireWorker.doFireSingleWorkflowEventBus(WorkflowEventBusFireWorker.java:133)
        at 
org.apache.dolphinscheduler.server.master.engine.WorkflowEventBusFireWorker.fireAllRegisteredEvent(WorkflowEventBusFireWorker.java:86)
        at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308)
        at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180)
        at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:750)
   Caused by: java.lang.NullPointerException: null
        at 
org.apache.dolphinscheduler.dao.AlertDao.sendTaskTimeoutAlert(AlertDao.java:250)
        at 
org.apache.dolphinscheduler.service.alert.WorkflowAlertManager.sendTaskTimeoutAlert(WorkflowAlertManager.java:265)
        at 
org.apache.dolphinscheduler.server.master.engine.task.lifecycle.handler.TaskTimeoutLifecycleEventHandler.doTaskTimeoutAlert(TaskTimeoutLifecycleEventHandler.java:94)
        at 
org.apache.dolphinscheduler.server.master.engine.task.lifecycle.handler.TaskTimeoutLifecycleEventHandler.handle(TaskTimeoutLifecycleEventHandler.java:79)
        at 
org.apache.dolphinscheduler.server.master.engine.task.lifecycle.handler.TaskTimeoutLifecycleEventHandler.handle(TaskTimeoutLifecycleEventHandler.java:38)
        at 
org.apache.dolphinscheduler.server.master.engine.task.lifecycle.handler.AbstractTaskLifecycleEventHandler.handle(AbstractTaskLifecycleEventHandler.java:46)
        at 
org.apache.dolphinscheduler.server.master.engine.task.lifecycle.handler.AbstractTaskLifecycleEventHandler.handle(AbstractTaskLifecycleEventHandler.java:32)
        at 
org.apache.dolphinscheduler.server.master.engine.WorkflowEventBusFireWorker.doFireSingleEvent(WorkflowEventBusFireWorker.java:144)
        at 
org.apache.dolphinscheduler.server.master.engine.WorkflowEventBusFireWorker.doFireSingleWorkflowEventBus(WorkflowEventBusFireWorker.java:122)
        ... 8 common frames omitted
   [WI-174][TI-0] - 2025-12-23 16:18:50.754 INFO  
[ds-workflow-eventbus-worker-8] 
o.a.d.s.m.e.t.c.PhysicalTaskExecutorClientDelegator:[170] - Kill task s1 on 
executor 192.168.4.3:1234 successfully
   [WI-174][TI-0] - 2025-12-23 16:18:50.755 INFO  
[ds-workflow-eventbus-worker-8] 
o.a.d.s.m.e.t.l.h.AbstractTaskLifecycleEventHandler:[47] - Fired task s1 
TaskKillLifecycleEvent{task=s1, delayTime=0} with state RUNNING_EXECUTION
   [WI-0][TI-0] - 2025-12-23 16:19:01.864 INFO  
[MasterRpcServer-methodInvoker-12] o.a.d.s.m.e.WorkflowEventBus:[41] - Publish 
event: TaskFailedLifecycleEvent{task=s1, endTime=Tue Dec 23 16:19:01 GMT+08:00 
2025}
   [WI-174][TI-0] - 2025-12-23 16:19:01.880 INFO  
[ds-workflow-eventbus-worker-12] o.a.d.s.m.e.WorkflowEventBus:[41] - Publish 
event: 
WorkflowTopologyLogicalTransitionWithTaskFinishLifecycleEvent{task=s1taskState=FAILURE}
   [WI-174][TI-0] - 2025-12-23 16:19:01.883 INFO  
[ds-workflow-eventbus-worker-12] 
o.a.d.s.m.e.t.l.h.AbstractTaskLifecycleEventHandler:[47] - Fired task s1 
TaskFailedLifecycleEvent{task=s1, endTime=Tue Dec 23 16:19:01 GMT+08:00 2025} 
with state RUNNING_EXECUTION
   [WI-174][TI-0] - 2025-12-23 16:19:01.883 INFO  
[ds-workflow-eventbus-worker-12] 
o.a.d.s.m.e.w.l.h.AbstractWorkflowLifecycleEventHandler:[47] - Begin fire 
workflow s1-20251223161749852 
LifecycleEvent[WorkflowTopologyLogicalTransitionWithTaskFinishLifecycleEvent{task=s1taskState=FAILURE}]
 with state: RUNNING_EXECUTION
   [WI-174][TI-0] - 2025-12-23 16:19:01.883 INFO  
[ds-workflow-eventbus-worker-12] o.a.d.s.m.e.WorkflowEventBus:[41] - Publish 
event: WorkflowFailedLifecycleEvent{workflow=s1-20251223161749852}
   [WI-174][TI-0] - 2025-12-23 16:19:01.883 INFO  
[ds-workflow-eventbus-worker-12] 
o.a.d.s.m.e.w.l.h.AbstractWorkflowLifecycleEventHandler:[52] - Fired workflow 
s1-20251223161749852 
LifecycleEvent[WorkflowTopologyLogicalTransitionWithTaskFinishLifecycleEvent{task=s1taskState=FAILURE}]
 with state: RUNNING_EXECUTION
   [WI-174][TI-0] - 2025-12-23 16:19:01.883 INFO  
[ds-workflow-eventbus-worker-12] 
o.a.d.s.m.e.w.l.h.AbstractWorkflowLifecycleEventHandler:[47] - Begin fire 
workflow s1-20251223161749852 
LifecycleEvent[WorkflowFailedLifecycleEvent{workflow=s1-20251223161749852}] 
with state: RUNNING_EXECUTION
   [WI-174][TI-0] - 2025-12-23 16:19:01.888 INFO  
[ds-workflow-eventbus-worker-12] 
o.a.d.s.m.e.w.s.AbstractWorkflowStateAction:[161] - Success set 
WorkflowExecuteRunnable: s1-20251223161749852 state from: RUNNING_EXECUTION to 
FAILURE
   [WI-174][TI-0] - 2025-12-23 16:19:01.888 INFO  
[ds-workflow-eventbus-worker-12] o.a.d.s.m.e.WorkflowEventBus:[41] - Publish 
event: WorkflowFinalizeLifecycleEvent{workflow=s1-20251223161749852}
   [WI-174][TI-0] - 2025-12-23 16:19:01.888 INFO  
[ds-workflow-eventbus-worker-12] 
o.a.d.s.m.e.w.l.h.AbstractWorkflowLifecycleEventHandler:[52] - Fired workflow 
s1-20251223161749852 
LifecycleEvent[WorkflowFailedLifecycleEvent{workflow=s1-20251223161749852}] 
with state: FAILURE
   [WI-174][TI-0] - 2025-12-23 16:19:01.888 INFO  
[ds-workflow-eventbus-worker-12] 
o.a.d.s.m.e.w.l.h.AbstractWorkflowLifecycleEventHandler:[47] - Begin fire 
workflow s1-20251223161749852 
LifecycleEvent[WorkflowFinalizeLifecycleEvent{workflow=s1-20251223161749852}] 
with state: FAILURE
   [WI-174][TI-0] - 2025-12-23 16:19:01.888 INFO  
[ds-workflow-eventbus-worker-12] 
o.a.d.s.m.e.w.s.AbstractWorkflowStateAction:[204] - 
   
********************************************************************************
                       Workflow Instance Detail
   
********************************************************************************
   Workflow Instance Name:  s1-20251223161749852
   Command Type:            START_PROCESS
   State:                   FAILURE
   StartNodes:              [s1]
   TotalTasks:              1
   Host:                    192.168.4.3:5678
   Is SubWorkflow:          NO
   Run Times:               1
   Tenant:                  default
   Work Group:              default
   EventBusSummary:         WorkflowEventBusSummary{eventCount=12, 
fireSuccessEventCount=11, fireFailedEventCount=1}
   Schedule Time:           null
   Start Time:              Tue Dec 23 16:17:50 GMT+08:00 2025
   Restart Time:            Tue Dec 23 16:17:50 GMT+08:00 2025
   End Time:                Tue Dec 23 16:19:01 GMT+08:00 2025
   [WI-174][TI-0] - 2025-12-23 16:19:01.895 INFO  
[ds-workflow-eventbus-worker-12] o.a.d.d.AlertDao:[100] - add alert to db , 
alert: Alert(id=2, sign=6f782eb08d17c1b4e52e757f2171994be8bb1c11, title=start 
workflow failed, 
content=[{"projectCode":159047419031776,"projectName":"bug-dp","workflowInstanceId":174,"workflowDefinitionCode":160813687452064,"workflowInstanceName":"s1-20251223161749852","commandType":"START_PROCESS","workflowExecutionStatus":"FAILURE","modifyBy":"dolphinscheduler","recovery":"NO","runTimes":1,"workflowStartTime":"2025-12-23
 16:17:50","workflowEndTime":"2025-12-23 
16:19:01","workflowHost":"192.168.4.3:5678"}], alertStatus=null, 
warningType=FAILURE, log=null, alertGroupId=2, createTime=Tue Dec 23 16:19:01 
GMT+08:00 2025, updateTime=null, projectCode=159047419031776, 
workflowDefinitionCode=160813687452064, workflowInstanceId=174, 
alertType=WORKFLOW_INSTANCE_FAILURE, info={})
   [WI-174][TI-0] - 2025-12-23 16:19:01.895 INFO  
[ds-workflow-eventbus-worker-12] 
o.a.d.s.m.e.w.s.AbstractWorkflowStateAction:[210] - Successfully finalize 
WorkflowExecuteRunnable: s1-20251223161749852
   [WI-174][TI-0] - 2025-12-23 16:19:01.895 INFO  
[ds-workflow-eventbus-worker-12] 
o.a.d.s.m.e.w.l.h.AbstractWorkflowLifecycleEventHandler:[52] - Fired workflow 
s1-20251223161749852 
LifecycleEvent[WorkflowFinalizeLifecycleEvent{workflow=s1-20251223161749852}] 
with state: FAILURE


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to