Gallardot commented on PR #14379:
URL: 
https://github.com/apache/dolphinscheduler/pull/14379#issuecomment-1628813282

   HI @Radeity , good job!
   I tested this PR. Sometimes it's OK, but sometimes there's an error log. 
   ```
   [INFO] 2023-07-10 19:42:57.004 +0800 - [K8sJobExecutor-echo-77] start to 
submit job
   [INFO] 2023-07-10 19:42:57.073 +0800 - [K8sJobExecutor-echo-77] submitted 
job successfully
   [INFO] 2023-07-10 19:42:57.095 +0800 - event received : job:echo-77 
action:ADDED
   [INFO] 2023-07-10 19:42:57.421 +0800 - event received : job:echo-77 
action:MODIFIED
   [INFO] 2023-07-10 19:42:57.421 +0800 - job echo-77 status 1
   [INFO] 2023-07-10 19:43:02.459 +0800 - [K8S-pod-log] 
{"kind":"Status","apiVersion":"v1","metadata":{},"status":"Failure","message":"container
 \"echo-77\" in pod \"echo-77-pngf7\" is waiting to start: 
ContainerCreating","reason":"BadRequest","code":400}
   [INFO] 2023-07-10 19:43:25.109 +0800 - event received : job:echo-77 
action:MODIFIED
   [INFO] 2023-07-10 19:43:25.109 +0800 - job echo-77 status 0
   [INFO] 2023-07-10 19:43:25.110 +0800 - [K8sJobExecutor-echo-77] succeed in 
k8s
   ```
   Maybe some fault tolerance work is needed here. For example, before getting 
pod logs, check the status of the current task pod?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to