Github user HyukjinKwon commented on a diff in the pull request:
https://github.com/apache/spark/pull/20089#discussion_r158774077
--- Diff: python/README.md ---
@@ -29,4 +29,4 @@ The Python packaging for Spark is not intended to replace
all of the other use c
## Python Requirements
-At its core PySpark depends on Py4J (currently version 0.10.6), but
additional sub-packages have their own requirements (including numpy and
pandas).
+At its core PySpark depends on Py4J (currently version 0.10.6), but
additional sub-packages have their own requirements (including numpy, pandas,
and pyarrow).
--- End diff --
Yea, Pandas and PyArrow are optional. Maybe, it's nicer if we have some
more details here too.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]