Hi,

I’m experiencing a strange issue when running a toree kernel with jupyterhub. 
The python version used for spark in the kernel.json is 2.7, I verified that in 
the notebook itself, but in the jupyterhub logs, I see errors from two python 
files, created by ambari:

/usr/bin/hdp-select
/etc/hadoop/conf/topology_script.py

The errors come from any code which works in python 2, but not 3, since ambari 
needs python 2 to run. Unfortunately, jupyterhub needs python 3. I’m not sure 
why Toree is using python 3 with these files, instead of the python 2, 
specified in the kernel.json. I tested with a pyspark notebook and did not have 
the same issue, so it seems to be something related to toree’s integration with 
jupyterhub.

I’ve updated the files to handle both python 2 & 3, but they seem to be 
recreated when ambari and the cluster restart. I’m looking for a more stable 
long term solution.

Any Ideas?

Ian

Reply via email to