The provided Spark EC2 scripts<https://spark.incubator.apache.org/docs/0.9.0/ec2-scripts.html>and default AMI ship with Python 2.6.8.
I would like to use Python 2.7.5 or later. I believe that among the 2.x versions, 2.7 is the most popular. What's the easiest way to get my Spark cluster on Python 2.7 and working correctly? For starters, I naively upgraded Python on the master node using the instructions here<http://www.lecloud.net/post/61401763496/install-update-to-python-2-7-and-latest-pip-on-ec2>. The pyspark shell started up fine but a basic map on an RDD that used to work fine failed. Perhaps I just need to upgrade all the nodes in my cluster, and perhaps it's not that simple. I thought I'd check here first. Nick -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Running-Spark-with-Python-2-7-5-tp2131.html Sent from the Apache Spark User List mailing list archive at Nabble.com.
