[
https://issues.apache.org/jira/browse/SPARK-9280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tien-Dung LE updated SPARK-9280:
--------------------------------
Description:
In a spark-shell session, stopping a spark context and create a new spark
context and hive context does not clean the spark sql configuration. More
precisely, the new hive context still keeps the previous configuration
settings. It would be great if someone can let us know how to avoid this
situation.
{code:title=New hive context should not load the configurations from history}
case class Foo ( x: Int = (math.random * 1e3).toInt)
val foo = (1 to 100).map(i => Foo()).toDF
foo.saveAsParquetFile( "foo" )
sqlContext.setConf( "spark.sql.shuffle.partitions", "10")
sc.stop
val sparkConf2 = new org.apache.spark.SparkConf()
val sc2 = new org.apache.spark.SparkContext( sparkConf2 )
val sqlContext2 = new org.apache.spark.sql.hive.HiveContext( sc2 )
sqlContext2.getConf( "spark.sql.shuffle.partitions", "20")
// got 20 as expected
val foo2 = sqlContext2.parquetFile( "foo" )
sqlContext2.getConf( "spark.sql.shuffle.partitions", "30")
// expected 30 but got 10
{code}
was:
In a spark-shell session, stopping a spark context and create a new spark
context and hive context does not clean the spark sql configuration. More
precisely, the new hive context still keeps the previous configuration
settings. It would be great if someone can let us know how to avoid this
situation.
{code:title=New hive context should not load the configurations from history}
case class Foo ( x: Int = (math.random * 1e3).toInt)
val foo = (1 to 100).map(i => Foo()).toDF
foo.saveAsParquetFile( "foo" )
sqlContext.setConf( "spark.sql.shuffle.partitions", "10")
sc.stop
val sparkConf2 = new org.apache.spark.SparkConf()
val sc2 = new org.apache.spark.SparkContext( sparkConf2 )
val sqlContext2 = new org.apache.spark.sql.hive.HiveContext( sc2 )
sqlContext2.getConf( "spark.sql.shuffle.partitions", "20")
val foo2 = sqlContext2.parquetFile( "foo" )
sqlContext2.getConf( "spark.sql.shuffle.partitions", "30")
// expected 30 but got 10
{code}
> New HiveContext object unexpectedly loads configuration settings from history
> ------------------------------------------------------------------------------
>
> Key: SPARK-9280
> URL: https://issues.apache.org/jira/browse/SPARK-9280
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Affects Versions: 1.3.1
> Reporter: Tien-Dung LE
>
> In a spark-shell session, stopping a spark context and create a new spark
> context and hive context does not clean the spark sql configuration. More
> precisely, the new hive context still keeps the previous configuration
> settings. It would be great if someone can let us know how to avoid this
> situation.
> {code:title=New hive context should not load the configurations from history}
> case class Foo ( x: Int = (math.random * 1e3).toInt)
> val foo = (1 to 100).map(i => Foo()).toDF
> foo.saveAsParquetFile( "foo" )
> sqlContext.setConf( "spark.sql.shuffle.partitions", "10")
> sc.stop
> val sparkConf2 = new org.apache.spark.SparkConf()
> val sc2 = new org.apache.spark.SparkContext( sparkConf2 )
> val sqlContext2 = new org.apache.spark.sql.hive.HiveContext( sc2 )
> sqlContext2.getConf( "spark.sql.shuffle.partitions", "20")
> // got 20 as expected
> val foo2 = sqlContext2.parquetFile( "foo" )
> sqlContext2.getConf( "spark.sql.shuffle.partitions", "30")
> // expected 30 but got 10
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]