Re: Re: hive sql on tez run forever

2015-05-11 Thread r7raul1...@163.com
I see only 1 reduce run forerver. Skew join? r7raul1...@163.com From: Eugene Koifman Date: 2015-05-12 01:43 To: user CC: r7raul1...@163.com Subject: Re: hive sql on tez run forever This isn’t a valid rewrite. if a(x,y) has 1 row (1,2) and b(x,z) has 1 row (1,1) then the 1st query

Re: hive sql on tez run forever

2015-05-11 Thread Gopal Vijayaraghavan
Hi, You’re correct - that is not a valid rewrite. Both tables have to be shuffled across due to the OR clause with no reductions. Cheers, Gopal On 5/11/15, 10:43 AM, Eugene Koifman ekoif...@hortonworks.com wrote: This isn’t a valid rewrite. if a(x,y) has 1 row (1,2) and b(x,z) has 1 row (1,1)

Re: hive sql on tez run forever

2015-05-11 Thread Gopal Vijayaraghavan
Hi, I change the sql where condition to (where t.update_time = '2015-05-04') , the sql can return result for a while. Because t.update_time = '2015-05-04' can filter many row when table scan. But why change where condition to (where t.update_time = '2015-05-04' or length(t8.end_user_id)0)

Re: hive sql on tez run forever

2015-05-11 Thread Eugene Koifman
This isn’t a valid rewrite. if a(x,y) has 1 row (1,2) and b(x,z) has 1 row (1,1) then the 1st query will produce 1 row but the 2nd query with subselects will not. On 5/11/15, 10:13 AM, Gopal Vijayaraghavan gop...@apache.org wrote: Hi, I change the sql where condition to (where t.update_time =

RE: hive sql on tez run forever

2015-05-11 Thread Mich Talebzadeh
Subject: Re: hive sql on tez run forever Hi, I change the sql where condition to (where t.update_time = '2015-05-04') , the sql can return result for a while. Because t.update_time = '2015-05-04' can filter many row when table scan. But why change where condition to (where