I see only 1 reduce run forerver. Skew join?
r7raul1...@163.com
From: Eugene Koifman
Date: 2015-05-12 01:43
To: user
CC: r7raul1...@163.com
Subject: Re: hive sql on tez run forever
This isn’t a valid rewrite.
if a(x,y) has 1 row (1,2) and b(x,z) has 1 row (1,1) then the 1st query
Hi,
You’re correct - that is not a valid rewrite.
Both tables have to be shuffled across due to the OR clause with no
reductions.
Cheers,
Gopal
On 5/11/15, 10:43 AM, Eugene Koifman ekoif...@hortonworks.com wrote:
This isn’t a valid rewrite.
if a(x,y) has 1 row (1,2) and b(x,z) has 1 row (1,1)
Hi,
I change the sql where condition to (where t.update_time =
'2015-05-04') , the sql can return result for a while. Because
t.update_time
= '2015-05-04' can filter many row when table scan. But why change
where condition to
(where t.update_time = '2015-05-04' or length(t8.end_user_id)0)
This isn’t a valid rewrite.
if a(x,y) has 1 row (1,2) and b(x,z) has 1 row (1,1) then the 1st query
will produce 1 row
but the 2nd query with subselects will not.
On 5/11/15, 10:13 AM, Gopal Vijayaraghavan gop...@apache.org wrote:
Hi,
I change the sql where condition to (where t.update_time =
Subject: Re: hive sql on tez run forever
Hi,
I change the sql where condition to (where t.update_time =
'2015-05-04') , the sql can return result for a while. Because
t.update_time
= '2015-05-04' can filter many row when table scan. But why change
where condition to
(where
I change the sql where condition to (where t.update_time = '2015-05-04') ,
the sql can return result for a while. Because t.update_time = '2015-05-04'
can filter many row when table scan. But why change where condition to (where
t.update_time = '2015-05-04' or length(t8.end_user_id)0) ,the