Badger created TOREE-404:
----------------------------

             Summary: Issues with SPARK_OPTS
                 Key: TOREE-404
                 URL: https://issues.apache.org/jira/browse/TOREE-404
             Project: TOREE
          Issue Type: Bug
    Affects Versions: 0.2.0
            Reporter: Badger


Hi
I’m using Toree 0.2.0.dev1 and I’ve spent the last day or so trying to 
configure the kernel to talk to a cluster in yarn-client mode. I’m putting 
together a docker image for our analysts so that they can talk directly to our 
cluster. This is largely based around the jupyter docker-stacks docker images.

When configuring toree with :

jupyter toree install --spark_opts='--master yarn --deploy-mode client 
--driver-memory 512m  --executor-memory 512m  --executor-cores 4 
--driver-java-options -Dhdp.version=2.5.3.0-37 --conf 
spark.hadoop.yarn.timeline-service.enabled=false'

I see the following in the kernel start up logs 

17/04/13 12:34:37 [INFO] o.a.t.b.l.StandardComponentInitialization$$anon$1 - 
Connecting to spark.master local[*]

I also tried the following:
jupyter toree install --spark_opts='--master yarn --deploy-mode client 
--driver-memory 512m  --executor-memory 512m  --executor-cores 4 
--driver-java-options -Dhdp.version=2.5.3.0-37 --conf 
spark.hadoop.yarn.timeline-service.enabled=false spark.master=yarn-client'

After lots of digging I added the following to my spark-defaults.conf 
spark.master=yarn-client
spark.hadoop.yarn.timeline-service.enabled=false

It seems that the options set by “conf” in the “spark_opts” are not used. 

After setting these options in the spark-defaults.conf, I can see an 
application registered as running in the hadoop node manager. 




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to