Gallardot commented on PR #14379:
URL:
https://github.com/apache/dolphinscheduler/pull/14379#issuecomment-1628813282
HI @Radeity , good job!
I tested this PR. Sometimes it's OK, but sometimes there's an error log.
```
[INFO] 2023-07-10 19:42:57.004 +0800 - [K8sJobExecutor-echo-77] start to
submit job
[INFO] 2023-07-10 19:42:57.073 +0800 - [K8sJobExecutor-echo-77] submitted
job successfully
[INFO] 2023-07-10 19:42:57.095 +0800 - event received : job:echo-77
action:ADDED
[INFO] 2023-07-10 19:42:57.421 +0800 - event received : job:echo-77
action:MODIFIED
[INFO] 2023-07-10 19:42:57.421 +0800 - job echo-77 status 1
[INFO] 2023-07-10 19:43:02.459 +0800 - [K8S-pod-log]
{"kind":"Status","apiVersion":"v1","metadata":{},"status":"Failure","message":"container
\"echo-77\" in pod \"echo-77-pngf7\" is waiting to start:
ContainerCreating","reason":"BadRequest","code":400}
[INFO] 2023-07-10 19:43:25.109 +0800 - event received : job:echo-77
action:MODIFIED
[INFO] 2023-07-10 19:43:25.109 +0800 - job echo-77 status 0
[INFO] 2023-07-10 19:43:25.110 +0800 - [K8sJobExecutor-echo-77] succeed in
k8s
```
Maybe some fault tolerance work is needed here. For example, before getting
pod logs, check the status of the current task pod?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]