Hi Ruslan, I believe you can just set PYSPARK_PYTHON in spark interpreter setting to switch between python2 and python3
Ruslan Dautkhanov <dautkha...@gmail.com>于2018年10月27日周六 上午2:26写道: > I'd like to give users ability to switch between Python2 and Python3 for > their PySpark jobs. > Was somebody able to set up something like this, so they can switch > between python2 and python3 pyspark interpreters? > > For this experiment, created a new %py3spark interpreter, assigned to > spark interpreter group. > > Added following options there for %py3spark: [1] > /opt/cloudera/parcels/Anaconda3 is our Anaconda python3 home that's > available on all worker nodes and on zeppelin server too. > > For default %pyspark interpreter it's very similar to [1], except all > paths have "/opt/cloudera/parcels/Anaconda" instead of " > /opt/cloudera/parcels/Anaconda3". > > Nevertheless, zeppelin_ipythonxxx/ipython_server.py > seems catching environment variable from zeppelin-env.sh and not from > interpreter settings. > > Zeppelin documentation reads that all uppercase variables will be > treated as environment variables, so I assume it should overwrite what's > in zeppelin-env.sh, no? > > It seems environment variables at interpreter level are broken - notice > "pyspark" paragraph has "Anaconda3" and not "Anaconda" in PATH > (highlighted). > > [image: image.png] > > > > [1] > > LD_LIBRARY_PATH /opt/cloudera/parcels/Anaconda3/lib > PATH > /usr/java/latest/bin:/opt/cloudera/parcels/Anaconda3/bin:/usr/local/bin:/usr/bin:/usr/local/sbin:/usr/sbin:/home/rdautkha/bin > PYSPARK_DRIVER_PYTHON /opt/cloudera/parcels/Anaconda3/bin/python > PYSPARK_PYTHON /opt/cloudera/parcels/Anaconda3/bin/python > PYTHONHOME /opt/cloudera/parcels/Anaconda3 > > spark.executorEnv.LD_LIBRARY_PATH/ opt/cloudera/parcels/Anaconda3/lib > spark.executorEnv.PYSPARK_PYTHON > /opt/cloudera/parcels/Anaconda3/bin/python > spark.pyspark.driver.python /opt/cloudera/parcels/Anaconda3/bin/python > spark.pyspark.python /opt/cloudera/parcels/Anaconda3/bin/python > spark.yarn.appMasterEnv.PYSPARK_PYTHON > /opt/cloudera/parcels/Anaconda3/bin/python > > -- > Ruslan Dautkhanov >