Github user ifilonenko commented on a diff in the pull request:
https://github.com/apache/spark/pull/13599#discussion_r195920245
--- Diff: python/pyspark/context.py ---
@@ -1035,6 +1044,46 @@ def getConf(self):
conf.setAll(self._conf.getAll())
return conf
+ @since(2.4)
+ def install_packages(self, packages):
+ """
+ Install python packages on all executors and driver through pip.
pip will be installed
+ by default no matter using native virtualenv or conda. So it is
guaranteed that pip is
+ available if virtualenv is enabled.
--- End diff --
This will only be the case if in Kubernetes you specify the `spark-py`
image. So this will be need to be expanded per cluster-manager.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]