[jira] Commented: (HIVE-1695) MapJoin followed by ReduceSink should be done as single MapReduce Job

He Yongqiang (JIRA) Wed, 01 Dec 2010 14:36:47 -0800

    [ 
https://issues.apache.org/jira/browse/HIVE-1695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965892#action_12965892
 ]


He Yongqiang commented on HIVE-1695:
------------------------------------

To be accurate,  MapJoin followed by ReduceSink followed by GroupBy should be 
in a one mapreduce job.

So at some point (like processing MapJoin%SEL), we know this mapjoin is 
followed by a reducesink. 
And at that point if we know the reduce sink is for a group by, then we can 
just try to skip the work splitting the task.

would that be easier?

> MapJoin followed by ReduceSink should be done as single MapReduce Job
> ---------------------------------------------------------------------
>
>                 Key: HIVE-1695
>                 URL: https://issues.apache.org/jira/browse/HIVE-1695
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Processor
>            Reporter: Amareshwari Sriramadasu
>
> Currently MapJoin followed by ReduceSink runs as two MapReduce jobs : One map 
> only job followed by a Map-Reduce job. It can be combined into single 
> MapReduce Job.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HIVE-1695) MapJoin followed by ReduceSink should be done as single MapReduce Job

Reply via email to