Re: [jupyter] Re: need rewrite.xml and service.xml for Jupyter exposed via knox

2018-12-19 Thread Lian Jiang
No. It is more complicated than I expected. I did not use knox for jupyter in the end. On Wed, Dec 19, 2018 at 6:03 PM Z wrote: > Hey Lian, >> > > Did you manage to get those files? I am also looking for those files. > > -- > You received this message because you are subscribed to the Google

[jupyter] Re: jupyter cannot find oracle jdbc driver

2018-11-19 Thread Lian Jiang
Never mind. Problem solved. On Mon, Nov 19, 2018 at 12:09 PM Lian Jiang wrote: > Hi, > > I am trying to use oracle jdbc to load the table from oracle db but got > error. > > The code: > ### > import findspark > findspark.init() > >

[jupyter] jupyter cannot find oracle jdbc driver

2018-11-19 Thread Lian Jiang
Hi, I am trying to use oracle jdbc to load the table from oracle db but got error. The code: ### import findspark findspark.init() from pyspark import SparkContext, SQLContext import os os.environ['PYSPARK_SUBMIT_ARGS'] = '--master yarn --deploy-mode client --driver-cores 4

Re: [jupyter] spark in jupyter cannot find a class in a jar

2018-11-14 Thread Lian Jiang
Could anybody help? Thanks a lot. On Fri, Nov 9, 2018 at 3:50 PM Lian Jiang wrote: > I am using spark in Jupyter as below: > > import findspark > findspark.init() > > from pyspark import SQLContext, SparkContext > sqlCtx = SQLContext(sc) > df = sqlCtx.read.parquet(

[jupyter] spark in jupyter cannot find a class in a jar

2018-11-09 Thread Lian Jiang
I am using spark in Jupyter as below: import findspark findspark.init() from pyspark import SQLContext, SparkContext sqlCtx = SQLContext(sc) df = sqlCtx.read.parquet("oci://mybucket@mytenant/myfile.parquet") The error is: Py4JJavaError: An error occurred while calling o198.parquet. :

[jupyter] need rewrite.xml and service.xml for Jupyter exposed via knox

2018-09-12 Thread Lian Jiang
I want to expose Jupyter via KNOX on a HDP3 cluster. Does anybody have the routing files (aka service.xml and rewrite.xml)? Knox community does not have and it is not easy to compose from scratch. Expose Jupyter via Knox on a hadoop cluster should be a common requirement. Appreciate any hints.