Jesus Camacho Rodriguez created HIVE-14096:
----------------------------------------------
Summary: Extend RS dedup logic to merge GBy operators
Key: HIVE-14096
URL: https://issues.apache.org/jira/browse/HIVE-14096
Project: Hive
Issue Type: Bug
Components: Physical Optimizer
Affects Versions: 2.2.0
Reporter: Jesus Camacho Rodriguez
Since we always generate map-side GBy at plan generation time, there are
occasion when we could collapse GBy after RS dedup optimization. This means
that GBy would be executed in a single stage with {{mode = complete}}.
Example in {{reduce_deduplicate_extended2.q.out}}:
{noformat}
SELECT f.key, g.value
FROM src f
JOIN src g ON (f.key = g.key AND f.value = g.value)
GROUP BY g.value, f.key
{noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)