[jira] [Updated] (SPARK-9280) New HiveContext object unexpectedly loads configuration settings from history

Tien-Dung LE (JIRA) Thu, 23 Jul 2015 08:01:00 -0700

     [ 
https://issues.apache.org/jira/browse/SPARK-9280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Tien-Dung LE updated SPARK-9280:
--------------------------------
    Description: 
In a spark-shell session, stopping a spark context and create a new spark 
context and hive context does not clean the spark sql configuration. More 
precisely, the new hive context still keeps the previous configuration 
settings. It would be great if someone can let us know how to avoid this 
situation.

{code:title=New hive context should not load the configurations from history}
case class Foo ( x: Int = (math.random * 1e3).toInt)
val foo = (1 to 100).map(i => Foo()).toDF
foo.saveAsParquetFile( "foo" )
sqlContext.setConf( "spark.sql.shuffle.partitions", "10")

sc.stop

val sparkConf2 = new org.apache.spark.SparkConf()
val sc2 = new org.apache.spark.SparkContext( sparkConf2 ) 
val sqlContext2 = new org.apache.spark.sql.hive.HiveContext( sc2 )

sqlContext2.getConf( "spark.sql.shuffle.partitions", "20") 
// got 20 as expected
val foo2 = sqlContext2.parquetFile( "foo" )
sqlContext2.getConf( "spark.sql.shuffle.partitions", "30")
// expected 30 but got 10
{code}

  was:
In a spark-shell session, stopping a spark context and create a new spark 
context and hive context does not clean the spark sql configuration. More 
precisely, the new hive context still keeps the previous configuration 
settings. It would be great if someone can let us know how to avoid this 
situation.

{code:title=New hive context should not load the configurations from history}
case class Foo ( x: Int = (math.random * 1e3).toInt)
val foo = (1 to 100).map(i => Foo()).toDF
foo.saveAsParquetFile( "foo" )
sqlContext.setConf( "spark.sql.shuffle.partitions", "10")

sc.stop

val sparkConf2 = new org.apache.spark.SparkConf()
val sc2 = new org.apache.spark.SparkContext( sparkConf2 ) 
val sqlContext2 = new org.apache.spark.sql.hive.HiveContext( sc2 )

sqlContext2.getConf( "spark.sql.shuffle.partitions", "20") 
val foo2 = sqlContext2.parquetFile( "foo" )
sqlContext2.getConf( "spark.sql.shuffle.partitions", "30")
// expected 30 but got 10
{code}


> New HiveContext object unexpectedly loads configuration settings from history 
> ------------------------------------------------------------------------------
>
>                 Key: SPARK-9280
>                 URL: https://issues.apache.org/jira/browse/SPARK-9280
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 1.3.1
>            Reporter: Tien-Dung LE
>
> In a spark-shell session, stopping a spark context and create a new spark 
> context and hive context does not clean the spark sql configuration. More 
> precisely, the new hive context still keeps the previous configuration 
> settings. It would be great if someone can let us know how to avoid this 
> situation.
> {code:title=New hive context should not load the configurations from history}
> case class Foo ( x: Int = (math.random * 1e3).toInt)
> val foo = (1 to 100).map(i => Foo()).toDF
> foo.saveAsParquetFile( "foo" )
> sqlContext.setConf( "spark.sql.shuffle.partitions", "10")
> sc.stop
> val sparkConf2 = new org.apache.spark.SparkConf()
> val sc2 = new org.apache.spark.SparkContext( sparkConf2 ) 
> val sqlContext2 = new org.apache.spark.sql.hive.HiveContext( sc2 )
> sqlContext2.getConf( "spark.sql.shuffle.partitions", "20") 
> // got 20 as expected
> val foo2 = sqlContext2.parquetFile( "foo" )
> sqlContext2.getConf( "spark.sql.shuffle.partitions", "30")
> // expected 30 but got 10
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Updated] (SPARK-9280) New HiveContext object unexpectedly loads configuration settings from history

Reply via email to