Optimize map-side scans for right-side of join
----------------------------------------------

                 Key: HIVE-2116
                 URL: https://issues.apache.org/jira/browse/HIVE-2116
             Project: Hive
          Issue Type: Improvement
            Reporter: Ron Bodkin


I had a large query like select * from a join b on a.key=b.key where...; 

Table b was too large, so I attempted to optimize by adding constraints on b to 
the where clause, e.g.,

where b.size>=mn and b.size<=mx and ...;

However, the Hive 0.8.0 optimizer pushed the constraint on b into the reduce 
phase (defeating its purpose).

I was able to force Hive to run the optimization map-side by this workaround:
join (select * from b where size>=mn and size<=mx) b on a.key=b.key where ...;

But it would be nice for Hive to pull filters on joined records into the map 
phase where possible. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to