[ 
https://issues.apache.org/jira/browse/OOZIE-2787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15851652#comment-15851652
 ] 

Satish Subhashrao Saley commented on OOZIE-2787:
------------------------------------------------

[SPARK-18099|https://issues.apache.org/jira/browse/SPARK-18099] added an 
exception to be thrown if same file gets added multiple times to distributed 
cache. 
If user has application jar in workflow/lib directory and he/she mentions 
relative path of the jar in <jar> tag, then it results in distributing 
application jar multiple times. Earlier (before spark 2.1) this wasn't an 
issue, because spark used to show a WARN message.

Solution is to include complete hdfs path while specifying application jar and 
exclude it from --files option.

> Oozie distributes application jar twice making the spark job fail
> -----------------------------------------------------------------
>
>                 Key: OOZIE-2787
>                 URL: https://issues.apache.org/jira/browse/OOZIE-2787
>             Project: Oozie
>          Issue Type: Bug
>            Reporter: Satish Subhashrao Saley
>            Assignee: Satish Subhashrao Saley
>         Attachments: OOZIE-2787-1.patch
>
>
> Oozie adds the application jar to the list of files to be uploaded to 
> distributed cache. Since this gets added twice, the job fails. This is 
> observed from spark 2.1.0 which introduces a check for same file and fails 
> the job.
> {code}
> --master
> yarn
> --deploy-mode
> cluster
> --name
> oozieSparkStarter
> --class
> ScalaWordCount
> --queue 
> default
> --conf
> spark.executor.extraClassPath=$PWD/*
> --conf
> spark.driver.extraClassPath=$PWD/*
> --conf
> spark.executor.extraJavaOptions=-Dlog4j.configuration=spark-log4j.properties
> --conf
> spark.driver.extraJavaOptions=-Dlog4j.configuration=spark-log4j.properties
> --conf
> spark.yarn.security.tokens.hive.enabled=false
> --conf
> spark.yarn.security.tokens.hbase.enabled=false
> --files
> hdfs://mycluster.com/user/saley/oozie/apps/sparkapp/lib/spark-example.jar
> --properties-file
> spark-defaults.conf
> --verbose
> spark-example.jar
> samplefile.txt
> output
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to