Ahmed, one thing I do to detect missing clients is run a workflow with shell action such as https://github.com/dbist/oozie-examples/blob/master/apps/shell/scripts/script.sh
Obviously it gets tricky when you have a lot of nodes but it's a something so menial you can include it as part of each of your workflow. On Thu, Jun 21, 2018, 3:35 AM Ahmed (JIRA) <[email protected]> wrote: > > [ > https://issues.apache.org/jira/browse/OOZIE-3294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16519209#comment-16519209 > ] > > Ahmed commented on OOZIE-3294: > ------------------------------ > > [~andras.piros] Thanks for the quick response, > > Yes we are doing oozie shell actions. post your comment i evaluated all > the servers with the spark-submit available, i see 1 node is not installed > with the client(suspecting that node is causing intermittent issue > sometimes if the oozie calls its action and YARN runs on that node & the > failure Error is noticed.) > > How do i exactly identifiy node without the programme spark-submit is > causing failure as the error logs says no such file. is there a way to find > the exact failure cause on that specific node ? > > How do we fix this going forward? does installing the spark-client > resolves this or we stop the nodemanager on this host and enable on other > node? > > > > Regards > > -Ahmed > > > Launcher exception: java.io.IOException > > --------------------------------------- > > > > Key: OOZIE-3294 > > URL: https://issues.apache.org/jira/browse/OOZIE-3294 > > Project: Oozie > > Issue Type: Bug > > Components: action > > Affects Versions: 4.2.0 > > Reporter: Ahmed > > Priority: Major > > > > Hi , > > There is an intermittent issue in oozie workflow, where the oozie action > fails sometimes with file not found error for some actions of spark-submit > or sqoop. (The issue is intermittent.) > > > > I am using hadoop - 2.7.3 > > oozie version-4.2.0 > > > > "Launcher exception: java.io.IOException Cannot run program > "./spark-submit" (in directory "/usr/hdp/current/spark2-client/bin"): > error=2, No such file or directory" > > > > > > > > -- > This message was sent by Atlassian JIRA > (v7.6.3#76005) >
