Chao created HIVE-8920:
--------------------------

             Summary: SplitSparkWorkResolver doesn't work with UnionWork
                 Key: HIVE-8920
                 URL: https://issues.apache.org/jira/browse/HIVE-8920
             Project: Hive
          Issue Type: Sub-task
          Components: Spark
    Affects Versions: spark-branch
            Reporter: Chao


The following query will not work:
{code}
from (select * from table0 union all select * from table1) s
insert overwrite table table3 select s.x, count(1) group by s.x
insert overwrite table table4 select s.y, count(1) group by s.y;
{code}

Currently, the plan for this query, before SplitSparkWorkResolver, looks like 
below:

{noformat}
   M1    M2
     \  / \
      U3   R5
      |
      R4
{noformat}

In {{SplitSparkWorkResolver#splitBaseWork}}, it assumes that the {{childWork}} 
is a ReduceWork, but for this case, you can see that for M2 the childWork could 
be UnionWork U3. Thus, the code will fail.





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to