Github user holdenk commented on the issue:
https://github.com/apache/spark/pull/14963
@Stibbons the jenkins environment for Spark is not OS X, but since a lot of
the developers work in OS X I figured it would be good to test there too. I
think we should probably figure out why @sethah is getting a different result
than you and I are.
As for Conda I'm not sure I 100% agree, looking at
https://www.continuum.io/blog/developer-blog/using-anaconda-pyspark-distributed-language-processing-hadoop-cluster
and
http://blog.cloudera.com/blog/2016/02/making-python-on-apache-hadoop-easier-with-anaconda-and-cdh/
it seems like many people are looking at ways to use conda together with
PySpark so I think we should consider the possibility of running on a machine
with Conda installed (but we can investigate other work arounds besides the one
I proposed in the PR to your branch) :)
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]