The provided Spark EC2
scripts<https://spark.incubator.apache.org/docs/0.9.0/ec2-scripts.html>and
default AMI ship with Python 2.6.8.

I would like to use Python 2.7.5 or later. I believe that among the 2.x
versions, 2.7 is the most popular.

What's the easiest way to get my Spark cluster on Python 2.7 and working
correctly?

For starters, I naively upgraded Python on the master node using the
instructions 
here<http://www.lecloud.net/post/61401763496/install-update-to-python-2-7-and-latest-pip-on-ec2>.
The pyspark shell started up fine but a basic map on an RDD that used to
work fine failed. Perhaps I just need to upgrade all the nodes in my
cluster, and perhaps it's not that simple. I thought I'd check here first.

Nick




--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Running-Spark-with-Python-2-7-5-tp2131.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

Reply via email to