Github user holdenk commented on a diff in the pull request:

    https://github.com/apache/spark/pull/15659#discussion_r85372583
  
    --- Diff: python/README.md ---
    @@ -0,0 +1,32 @@
    +# Apache Spark
    +
    +Spark is a fast and general cluster computing system for Big Data. It 
provides
    +high-level APIs in Scala, Java, Python, and R, and an optimized engine that
    +supports general computation graphs for data analysis. It also supports a
    +rich set of higher-level tools including Spark SQL for SQL and DataFrames,
    +MLlib for machine learning, GraphX for graph processing,
    +and Spark Streaming for stream processing.
    +
    +<http://spark.apache.org/>
    +
    +## Online Documentation
    +
    +You can find the latest Spark documentation, including a programming
    +guide, on the [project web 
page](http://spark.apache.org/documentation.html)
    +
    +
    +## Python Packaging
    +
    +This README file only contains basic information related to pip installed 
PySpark.
    +This packaging is currently experimental and may change in future versions 
(although we will do our best to keep compatibility).
    +Using PySpark requires the Spark JARs, and if you are building this from 
source please see the builder instructions at
    +["Building 
Spark"](http://spark.apache.org/docs/latest/building-spark.html).
    +
    +The Python packaging for Spark is not intended to replace all of the other 
use cases. This Python packaged version of Spark is suitable for interacting 
with an existing cluster (be it Spark standalone, YARN, or Mesos) - but does 
not contain the tools required to setup your own standalone Spark cluster. You 
can download the full version of Spark from the [Apache Spark downloads 
page](http://spark.apache.org/downloads.html).
    --- End diff --
    
    Indeed! That is the goal, and then we can have all of the nice things of 
having virtualenvs, just importing PySpark rather than importing a package to 
find spark and then importing spark, and rainbows and kittens*.
    
    (*rainbows and kittens are not a gaurantee - see vendor for details :p)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to