I found this solution: https://stackoverflow.com/questions/3390484/python-hashlib-md5-differs-between-linux-windows
Does anybody see a reason why I shouldn't put in a PR to make this change? FROM with open(tgz_file_path) as tar: TO with open(tgz_file_path, "rb") as tar: On Mon, Aug 24, 2015 at 11:58 AM, Justin Pihony <justin.pih...@gmail.com> wrote: > Additional info...If I use an online md5sum check then it matches...So, > it's either windows or python (using 2.7.10) > > On Mon, Aug 24, 2015 at 11:54 AM, Justin Pihony <justin.pih...@gmail.com> > wrote: > >> When running the spark_ec2.py script, I'm getting a wrong md5sum. I've now >> seen this on two different machines. I am running on windows, but I would >> imagine that shouldn't affect the md5. Is this a boto problem, python >> problem, spark problem? >> >> >> >> -- >> View this message in context: >> http://apache-spark-user-list.1001560.n3.nabble.com/Got-wrong-md5sum-for-boto-tp24420.html >> Sent from the Apache Spark User List mailing list archive at Nabble.com. >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org >> For additional commands, e-mail: user-h...@spark.apache.org >> >> >