Dharmesh Kakadia created SPARK-27287:
----------------------------------------

             Summary: PCAModel.load() does not honor spark configs
                 Key: SPARK-27287
                 URL: https://issues.apache.org/jira/browse/SPARK-27287
             Project: Spark
          Issue Type: Bug
          Components: ML
    Affects Versions: 2.4.0
            Reporter: Dharmesh Kakadia


PCAModel.load() does not seem to be using the configurations set on the current 
spark session. 

Repro:

 

The following will fail to read the data because the storage account 
credentials config used/propagated. 

conf.set("fs.azure.account.key.test.blob.core.windows.net","Xosad==")

spark = 
SparkSession.builder.appName("dharmesh").config(conf=conf).master('spark://spark-master:7077').getOrCreate()

model = PCAModel.load('wasb://[email protected]/model')

 

The following however works:

conf.set("fs.azure.account.key.test.blob.core.windows.net","Xosad==")

spark = 
SparkSession.builder.appName("dharmesh").config(conf=conf).master('spark://spark-master:7077').getOrCreate()

blah = spark.read.json('wasb://[email protected]/somethingelse/')

blah.show()

model = PCAModel.load('wasb://[email protected]/model')

 

It looks like spark.read...() does force the use of the config once and then 
PCAModel.load() will work correctly. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to