[ 
https://issues.apache.org/jira/browse/HIVE-556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12718687#action_12718687
 ] 

Min Zhou commented on HIVE-556:
-------------------------------

it's very common for us, and blocked us badly. we ofen have one or more aux 
tables with about 10k records, which the major table would do theta joins on. I 
don't think current solution by the means of cartesian product is a good way, 
it would bring so terrible  sorting and i/o overhead to us.


> let hive support theta join
> ---------------------------
>
>                 Key: HIVE-556
>                 URL: https://issues.apache.org/jira/browse/HIVE-556
>             Project: Hadoop Hive
>          Issue Type: New Feature
>    Affects Versions: 0.4.0
>            Reporter: Min Zhou
>             Fix For: 0.4.0
>
>
> Right now , hive only support equal joins .  Sometimes it's not enough, we 
> must consider implementing theta joins like
> {code:sql}
> SELECT
>   a.subid, a.id, t.url
> FROM
>   tbl t JOIN aux_tbl a ON t.url rlike a.url_pattern
> WHERE
>   t.dt='20090609'
>   AND a.dt='20090609';
> {code}
> any condition expression following 'ON' is  appropriate.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to