Github user mnazbro commented on a diff in the pull request:

    https://github.com/apache/spark/pull/8318#discussion_r41315226
  
    --- Diff: python/setup.py ---
    @@ -0,0 +1,18 @@
    +#!/usr/bin/env python
    +
    +from setuptools import setup
    +
    +exec(compile(open("pyspark/pyspark_version.py").read(), 
    +   "pyspark/pyspark_version.py", 'exec'))
    +VERSION = __version__
    +
    +setup(name='pyspark',
    +    version=VERSION,
    +    description='Apache Spark Python API',
    +    author='Spark Developers',
    +    author_email='[email protected]',
    +    url='https://github.com/apache/spark/tree/master/python',
    +    packages=['pyspark', 'pyspark.mllib', 'pyspark.ml', 'pyspark.sql', 
'pyspark.streaming'],
    +    install_requires=['numpy>=1.7', 'py4j==0.8.2.1', 'pandas'],
    --- End diff --
    
    An alternative which will work for both of you would be to have an 
extra_requires field which has extra requirements just like ipython does for 
its notebook.
    
    So there could be something like:
    
        extra_requires={
            "sql": ["numpy>=1.7", "pandas"]
        }
    
    That way you can install pyspark with `pip install pyspark` and pyspark for 
spark sql with `pip install pyspark[sql]`.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to