That's a problem which is hadoop related and not really hive related. The solution is to use only equal (as you know it). For that, you should first extract your real identifier for a, which can be a.pid or a part of it. I assume that you can know it in advance which one will be used.
Bertrand On Thu, Jul 26, 2012 at 5:11 PM, 周彩钦 <caiqinz...@gmail.com> wrote: > Hi all, > > I have problem when using left join with hive 0.7.1. > I have a query below: > > select > a.pid, > b.pid > tab1 a > left join > tab2 b > on (a.pid=b.pid or substr(a.pid,1,27)=b.pid); > > But hive don't support "OR" in left join. > Table a is huge, and table b has 40000 rows now(will increase). > Is there any other solution to achieve this? > > Thanks very much. > > -- > > -- Bertrand Dechoux