Hi Spark folks,

So on our production Spark cluster, it lives in the data center and I need
to attach to a VPN from my laptop, so that I can then submit a Spark
application job to the Spark Master (behind the VPN).

However, the problem arises that I have a local IP address on the laptop
which is on a separate network segment than the VPN. I figured out that I
can set the SparkConf configuration property called "spark.driver.host" to
be my VPN IP address - this got me further, because I could initiate a
connection to the master.

However, it was still not working. I was hoping to set the SparkConf
variables "spark.fileserver.uri" and "spark.httpBroadcast.uri" to NOT be my
local IP addresses, but the VPN IP addresses, because I think these
configuration variables actually pass my application files to the master
(in this case, it is a python script). So my spark.fileserver.uri and the
Spark Master need to be on the same network segment (the VPN subnetwork ).

Am I on the right track? How can I set "spark.fileserver.uri" and
"spark.httpBroadcast.uri" ? I see that these are actually run by Jetty
server...any thoughts?

Thank you so much!

-- 
Άρης Βλασακάκης
Aris Vlasakakis

Reply via email to