pan3793 commented on code in PR #7132: URL: https://github.com/apache/kyuubi/pull/7132#discussion_r2196357153
########## kyuubi-server/src/main/scala/org/apache/kyuubi/operation/BatchJobSubmission.scala: ########## @@ -292,13 +292,23 @@ class BatchJobSubmission( try { info(s"Submitting $batchType batch[$batchId] job:\n$builder") val process = builder.start - while (process.isAlive && !applicationFailed(_applicationInfo, appOperation)) { + + // continue polling if process is alive and application not started yet + // or application is not failed + while (process.isAlive && ( + applicationId(_applicationInfo).isEmpty || + !applicationFailed(_applicationInfo, appOperation))) { Review Comment: what do you mean "application not started yet"? Pod is not started, or Pod can not be found (no matter the status)? for the latter case, previously we introduced `kyuubi.engine.kubernetes.submit.timeout` to allow Kyuubi wait for more time -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: notifications-unsubscr...@kyuubi.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: notifications-unsubscr...@kyuubi.apache.org For additional commands, e-mail: notifications-h...@kyuubi.apache.org