You may also try distributing your JARS along with your Spark app; see
options below. You put on the client node whatever that is necessary and
submit them all in each run. There is also a --files option which you can
remove below, but may be helpful for some configs.
You do not need to specify all the arguments; the default values are
picked up when not explicitly given.
spark-submit \
--master yarn \
--deploy-mode cluster \
--num-executors 2 \
--driver-memory 4g \
--executor-memory 8g \
--files /usr/hdp/current/spark-client/conf/hive-site.xml \
--jars
/usr/hdp/current/spark-client/lib/datanucleus-api-jdo-3.2.6.jar,/usr/hdp/current/spark-client/lib/datanucleus-rdbms-3.2.9.jar,/usr/hdp/current/spark-client/lib/datanucleus-core-3.2.10.jar
\
--class "SparkApp" \
/pathToAppOnTheClientNode/SparkApp.jar (if any, arguments passed to the
Spark App here)
Masood
--
Masood Krohy, Ph.D.
Data Scientist, Intact Lab-R
Intact Financial Corporation
http://ca.linkedin.com/in/masoodkh
De :Silvio Fiorito <silvio.fior...@granturing.com>
A : "saif.a.ell...@wellsfargo.com" <saif.a.ell...@wellsfargo.com>,
"user@spark.apache.org" <user@spark.apache.org>
Date : 2016-11-22 08:02
Objet : Re: Cluster deploy mode driver location
Hi Saif!
Unfortunately, I don't think this is possible for YARN driver-cluster
mode. Regarding the JARs you're referring to, can you place them on HDFS
so you can then have them in a central location and refer to them that way
for dependencies?
http://spark.apache.org/docs/latest/submitting-applications.html#advanced-dependency-management
Thanks,
Silvio
From: saif.a.ell...@wellsfargo.com <saif.a.ell...@wellsfargo.com>
Sent: Monday, November 21, 2016 2:04:06 PM
To: user@spark.apache.org
Subject: Cluster deploy mode driver location
Hello there,
I have a Spark program in 1.6.1, however, when I submit it to cluster, it
randomly picks the driver.
I know there is a driver specification option, but along with it it is
mandatory to define many other options I am not familiar with. The trouble
is, the .jars I am launching need to be available at the driver host, and
I would like to have this jars in just a specific host, which I like it to
be the driver.
Any help?
Thanks!
Saif