[
https://issues.apache.org/jira/browse/HIVE-1047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carl Steinbach updated HIVE-1047:
---------------------------------
Component/s: Query Processor
Issue Type: Improvement (was: Bug)
> Merge tasks in GenMRUnion1
> --------------------------
>
> Key: HIVE-1047
> URL: https://issues.apache.org/jira/browse/HIVE-1047
> Project: Hadoop Hive
> Issue Type: Improvement
> Components: Query Processor
> Affects Versions: 0.6.0
> Reporter: Ning Zhang
> Assignee: Ning Zhang
> Fix For: 0.6.0
>
> Attachments: HIVE-1047.patch
>
>
> In the following query:
> from (select * from src union all select * from src) s
> insert overwrite table src_multi1 select * where key < 10
> insert overwrite table src_multi2 select * where key > 10 and key < 20;
> There are two topOps (TableScaneOperator) for the same MapRed task. In
> genTableScan1, each TableScanOperator will create a new task as currTask. The
> genMRUnion1 should merge two tasks into one. Currently GenMRUnion1 does not
> merge currTask, this will cause down stream operators like genFileSink1 to
> do some hacks to effectively merge the two tasks. A cleaner way is to merge
> the tasks in GenMRUnion1 as done by join operators etc.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.