[jira] [Commented] (OOZIE-2547) Add mapreduce.job.cache.files to spark action

Robert Kanter (JIRA) Thu, 07 Sep 2017 11:39:46 -0700

    [ 
https://issues.apache.org/jira/browse/OOZIE-2547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16157411#comment-16157411
 ]


Robert Kanter commented on OOZIE-2547:
--------------------------------------

That should be working correctly.  Even though we removed 
{{determineSparkJarsAndClasspath}}, additional code was changed/added to 
compensate.  [~gezapeti], any idea why it's not finding the Hadoop 
{{Configuration}} class?

> Add mapreduce.job.cache.files to spark action
> ---------------------------------------------
>
>                 Key: OOZIE-2547
>                 URL: https://issues.apache.org/jira/browse/OOZIE-2547
>             Project: Oozie
>          Issue Type: Bug
>            Reporter: Satish Subhashrao Saley
>            Assignee: Satish Subhashrao Saley
>            Priority: Minor
>             Fix For: 4.3.0
>
>         Attachments: OOZIE-2547-1.patch, OOZIE-2547-4.patch, 
> OOZIE-2547-5.patch, yarn-cluster_launcher.txt
>
>
> Currently, we pass jars using --jars option while submitting spark job. Also, 
> we add spark.yarn.dist.files option in case of yarn-client mode. 
> Instead of that, we can have only --files option and pass on the files which 
> are present in mapreduce.job.cache.files. While doing so, we make sure that 
> spark won't make another copy of the files if files exist on the hdfs. We saw 
> the issues when files are getting copied multiple times and causing 
> exceptions such as :
> {code}
> Diagnostics: Resource 
> hdfs://localhost/user/saley/.sparkStaging/application_1234_123/oozie-examples.jar
>  changed on src filesystem
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

[jira] [Commented] (OOZIE-2547) Add mapreduce.job.cache.files to spark action

Reply via email to