Jesus Camacho Rodriguez created HIVE-14096:
----------------------------------------------

             Summary: Extend RS dedup logic to merge GBy operators
                 Key: HIVE-14096
                 URL: https://issues.apache.org/jira/browse/HIVE-14096
             Project: Hive
          Issue Type: Bug
          Components: Physical Optimizer
    Affects Versions: 2.2.0
            Reporter: Jesus Camacho Rodriguez


Since we always generate map-side GBy at plan generation time, there are 
occasion when we could collapse GBy after RS dedup optimization. This means 
that GBy would be executed in a single stage with {{mode = complete}}.

Example in {{reduce_deduplicate_extended2.q.out}}:

{noformat}
SELECT f.key, g.value
FROM src f
JOIN src g ON (f.key = g.key AND f.value = g.value)
GROUP BY g.value, f.key
{noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to