Seonggon Namgung created HIVE-28833:
---------------------------------------

             Summary: Do not perform SharedWorkOptimization if the merge 
results in an excessively heavy MapJoin workload.
                 Key: HIVE-28833
                 URL: https://issues.apache.org/jira/browse/HIVE-28833
             Project: Hive
          Issue Type: Improvement
            Reporter: Seonggon Namgung


SharedWorkOptimization merges two TableScan operators, which is equivalent to 
merging two Map vertices into one. Suppose we are merging two TableScan 
operators, and the corresponding Map vertices contain MapJoin operators. In 
this case, merging the two TableScan operators results in a large Map vertex 
that contains both MapJoin operators, which could lead to high memory pressure.

To avoid potential reliability issues, we should not perform SWO if the merge 
results in an excessively heavy MapJoin workload. This might be done by 
evaluating the number and/or the size of the MapJoin operators before merging.

This issue was first observed with an increase in OOM errors in the CI 
environment after applying HIVE-26986. As a temporary solution, HIVE-28832 
disabled SWO in the affected query file. This fix should be rolled back once 
the underlying issue is resolved.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to