Re: spark-submit does not use hive-site.xml

Cheng Lian Wed, 10 Jun 2015 01:19:08 -0700

Hm, this is a common confusion... Although the variable name is`sqlContext` in Spark shell, it's actually a `HiveContext`, whichextends `SQLContext` and has the ability to communicate with Hive metastore.

So your program need to instantiate a`org.apache.spark.sql.hive.HiveContext` instead.


Cheng

On 6/10/15 10:19 AM, James Pirz wrote:

I am using Spark (standalone) to run queries (from a remote client)against data in tables that are already defined/loaded in Hive.
I have started metastore service in Hive successfully, and by puttinghive-site.xml, with proper metastore.uri, in $SPARK_HOME/confdirectory, I tried to share its config with spark.
When I start spark-shell, it gives me a default sqlContext, and I canuse that to access my Hive's tables with no problem.
But once I submit a similar query via Spark application through'spark-submit', it does not see the tables and it seems it does notpick hive-site.xml which is under conf directory in Spark's home. Itried to use '--files' argument with spark-submit to pass"hive-site.xml' to the workers, but it did not change anything.
Here is how I try to run the application:
$SPARK_HOME/bin/spark-submit --class "SimpleClient" --masterspark://my-spark-master:7077 --files=$SPARK_HOME/conf/hive-site.xmlsimple-sql-client-1.0.jar
Here is the simple example code that I try to run (in Java):

SparkConf conf = new SparkConf().setAppName("Simple SQL Client");

JavaSparkContext sc = new JavaSparkContext(conf);

SQLContext sqlContext = new org.apache.spark.sql.SQLContext(sc);

DataFrame res = sqlContext.sql("show tables");

res.show();



Here are the SW versions:
Spark: 1.3
Hive: 1.2
Hadoop: 2.6

Thanks in advance for any suggestion.

Re: spark-submit does not use hive-site.xml

Reply via email to