[
https://issues.apache.org/jira/browse/HIVE-12616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15052100#comment-15052100
]
Xuefu Zhang commented on HIVE-12616:
------------------------------------
Thanks for the explanation. I guess the problem is that user didn't set
spark.master explicitly, Hive's default, yarn-cluster, is set only for the
HiveConf of the first operation.
I think we should set "spark.master" in session level HiveConf. It seems we
just need to add one line doing that in the if block below:
{code}
// load properties from hive configurations, including both spark.*
properties,
// properties for remote driver RPC, and yarn properties for Spark on YARN
mode.
String sparkMaster = hiveConf.get("spark.master");
if (sparkMaster == null) {
sparkMaster = sparkConf.get("spark.master");
hiveConf.set("spark.master", sparkMaster);
}
{code}
> NullPointerException when spark session is reused to run a mapjoin
> ------------------------------------------------------------------
>
> Key: HIVE-12616
> URL: https://issues.apache.org/jira/browse/HIVE-12616
> Project: Hive
> Issue Type: Bug
> Components: Spark
> Affects Versions: 1.3.0
> Reporter: Nemon Lou
> Assignee: Nemon Lou
> Attachments: HIVE-12616.patch
>
>
> The way to reproduce:
> {noformat}
> set hive.execution.engine=spark;
> create table if not exists test(id int);
> create table if not exists test1(id int);
> insert into test values(1);
> insert into test1 values(1);
> select max(a.id) from test a ,test1 b
> where a.id = b.id;
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)