Thanks Bertrand,
You said it's hadoop problem, is it means that if I change to use MapReduce
(java MR or streaming), it still can't  achieve the purpose?
PS: I'm not very familiar with java MR and streaming:)  but I have to find
a way to implement it.


On Thu, Jul 26, 2012 at 11:19 PM, Bertrand Dechoux <decho...@gmail.com>wrote:

> That's a problem which is hadoop related and not really hive related.
> The solution is to use only equal (as you know it). For that, you should
> first extract your real identifier for a, which can be a.pid or a part of
> it.
> I assume that you can know it in advance which one will be used.
>
> Bertrand
>
>
>
> On Thu, Jul 26, 2012 at 5:11 PM, 周彩钦 <caiqinz...@gmail.com> wrote:
>
>> Hi all,
>>
>> I have problem when using left join with hive 0.7.1.
>> I have a query below:
>>
>> select
>>   a.pid,
>>   b.pid
>> tab1 a
>>   left join
>> tab2 b
>>   on (a.pid=b.pid or substr(a.pid,1,27)=b.pid);
>>
>> But hive don't support "OR" in left join.
>> Table a is huge, and table b has 40000 rows now(will increase).
>> Is there any other solution to achieve this?
>>
>> Thanks very much.
>>
>> --
>>
>>
>
>
> --
> Bertrand Dechoux
>



-- 
/**********************************************************/
// 姓名:周彩钦
// 联系电话:15210364513
// E-mail:caiqinz...@gmail.com
/**********************************************************/

Reply via email to