[ https://issues.apache.org/jira/browse/GOBBLIN-2193?focusedWorklogId=956037&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-956037 ]
ASF GitHub Bot logged work on GOBBLIN-2193: ------------------------------------------- Author: ASF GitHub Bot Created on: 07/Feb/25 10:59 Start Date: 07/Feb/25 10:59 Worklog Time Spent: 10m Work Description: iPalash commented on code in PR #4096: URL: https://github.com/apache/gobblin/pull/4096#discussion_r1946352560 ########## gobblin-yarn/src/main/java/org/apache/gobblin/yarn/GobblinYarnAppLauncher.java: ########## @@ -380,6 +382,19 @@ public void launch() throws IOException, YarnException, InterruptedException { }, 0, this.appReportIntervalMinutes, TimeUnit.MINUTES); addServices(); + + synchronized (this.applicationDone) { + while (!this.applicationCompleted) { + try { + this.applicationDone.wait(); + if (this.applicationFailed) { + throw new RuntimeException("Gobblin Yarn application failed"); + } + } catch (InterruptedException ie) { + LOGGER.error("Interrupted while waiting for the Gobblin Yarn application to finish", ie); + } + } + } Review Comment: We register a shutdown hook via `Runtime.getRuntime().addShutdownHook` From the documentation: `Uncaught exceptions are handled in shutdown hooks just as in any other thread` Issue Time Tracking ------------------- Worklog Id: (was: 956037) Time Spent: 1.5h (was: 1h 20m) > Fail Azkaban job on when temporal job fails > ------------------------------------------- > > Key: GOBBLIN-2193 > URL: https://issues.apache.org/jira/browse/GOBBLIN-2193 > Project: Apache Gobblin > Issue Type: Improvement > Reporter: Swapnil Palash > Assignee: Hung Tran > Priority: Major > Time Spent: 1.5h > Remaining Estimate: 0h > > Currently when the temporal job running on Yarn fails, we don't propagate the > error back to Azkaban job which launches the Yarn Application. > The change here bubbles the issues encountered when the job fails upto the > GobblinYarnAppLaucher run by the Azkaban job and fails with a > RuntimeException after logging the issues summary. -- This message was sent by Atlassian Jira (v8.20.10#820010)