Chao Sun created HIVE-16328:
-------------------------------
Summary: HoS: more aggressive mapjoin optimization when
hive.spark.use.file.size.for.mapjoin is true
Key: HIVE-16328
URL: https://issues.apache.org/jira/browse/HIVE-16328
Project: Hive
Issue Type: Bug
Components: Logical Optimizer
Reporter: Chao Sun
Assignee: Chao Sun
In HIVE-15489, when {{hive.spark.use.file.size.for.mapjoin}} is set to true,
and if the JOIN op has any upstream RS operator, then we will stop converting
the JOIN op to MAPJOIN op.
However, this is overly conservative. A better solution is to treat the branch
that has upstream RS as the big table and check if all other branches are
map-only AND can fit in hash table size.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)