I have the option of running PySpark with Python 2.7 or Python 3.5. I am
fairly expert with Python and know the Python-side history of the
differences. All else being the same, I have a preference for Python
3.5. I'm using CDH 5.8 and I'm wondering if that biases whether I
should proceed with PySpark on top of Python 2.7 or 3.5. Opinions? Does
Cloudera have an official (or unofficial) position on this?
Thanks,
Ian
_______________________________
Ian Stokes-Rees
Computational Scientist
Continuum Analytics <http://continuum.io>
@ijstokes Twitter <http://twitter.com/ijstokes> LinkedIn
<http://linkedin.com/in/ijstokes> Github
<http://github.com/ijstokes>617.942.0218