Installing Python Dependencies on Spark Cluster Hosts with Toree

Mana M Tue, 12 Dec 2017 16:30:26 -0800

Hello,

I am new to Spark + Jupyter and setting them up for our data analysis team.
I had one question for which I cannot really find answer anywhere - hope
someone can help here.


I have setup multi-host Spark cluster and also have successfully installed
Jupyter with Jupyter Hub. This setup will be shared among several data
analysis team.

The Spark cluster is setup with some common Python libraries. But each user
may require additional libraries for their experimentation time to time. Is
it possible for Jupyter user to install Python dependencies for her/his
notebook, so dependencies are available on all Spark cluster nodes before
user runs the notebook through Jupyter?

I read about line magics (addDeps) in Apache toree, but I did not find any
information on adding Python dependencies.

Thanks,
Mana

Installing Python Dependencies on Spark Cluster Hosts with Toree

Reply via email to