Re:Re: RE: Re:Re:RE: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL

2015-09-13 Thread Todd
t;> >> >> >> BTW: spark.shuffle.reduceLocality.enabled is the configuration of Spark, not >> Spark SQL. >> >> >> >> From: Todd [mailto:bit1...@163.com] >> Sent: Friday, September 11, 2015 3:39 PM >> To: Todd >> Cc: Cheng, Hao; Jesse F Ch

Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL

2015-09-11 Thread Todd
Thanks Hao for the reply. I turn the merge sort join off, the physical plan is below, but the performance is roughly the same as it on... == Physical Plan == TungstenProject [ss_quantity#10,ss_list_price#12,ss_coupon_amt#19,ss_cdemo_sk#4,ss_item_sk#2,ss_promo_sk#8,ss_sold_date_sk#0]

RE: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL

2015-09-11 Thread Cheng, Hao
. From: Todd [mailto:bit1...@163.com] Sent: Friday, September 11, 2015 2:17 PM To: Cheng, Hao Cc: Jesse F Chen; Michael Armbrust; user@spark.apache.org Subject: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL Thanks Hao for the reply. I turn the merge sort join off

Re:RE: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL

2015-09-11 Thread Todd
5, and it’s true by default, but we found it probably causes the performance reduce dramatically. From: Todd [mailto:bit1...@163.com] Sent: Friday, September 11, 2015 2:17 PM To: Cheng, Hao Cc: Jesse F Chen; Michael Armbrust; user@spark.apache.org Subject: Re:RE: spark 1.5 SQL slows down dr

Re: RE: Re:Re:RE: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL

2015-09-11 Thread Davies Liu
ark SQL. > > > > From: Todd [mailto:bit1...@163.com] > Sent: Friday, September 11, 2015 3:39 PM > To: Todd > Cc: Cheng, Hao; Jesse F Chen; Michael Armbrust; user@spark.apache.org > Subject: Re:Re:RE: Re:RE: spark 1.5 SQL slows down dramatically by 50

RE: Re:Re:RE: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL

2015-09-11 Thread Cheng, Hao
, September 11, 2015 3:39 PM To: Todd Cc: Cheng, Hao; Jesse F Chen; Michael Armbrust; user@spark.apache.org Subject: Re:Re:RE: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL I add the following two options: spark.sql.planner.sortMergeJoin=false

Re:Re:RE: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL

2015-09-11 Thread Todd
. From: Todd [mailto:bit1...@163.com] Sent: Friday, September 11, 2015 2:17 PM To: Cheng, Hao Cc: Jesse F Chen; Michael Armbrust; user@spark.apache.org Subject: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL Thanks Hao for the reply. I turn the merge

Re: Re:Re:RE: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL

2015-09-11 Thread Davies Liu
.@intel.com>, Todd <bit1...@163.com>, Michael > Armbrust <mich...@databricks.com>, "user@spark.apache.org" > <user@spark.apache.org> > Date: 09/11/2015 10:41 AM > Subject: Re: Re:Re:RE: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ > compared w

Re: Re:Re:RE: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL

2015-09-11 Thread Jesse F Chen
e? Not the local mode. Can you print the c > > From: "Cheng, Hao" <hao.ch...@intel.com> > To: Todd <bit1...@163.com> > Cc: Jesse F Chen/San Francisco/IBM@IBMUS, Michael Armbrust <mich...@databricks.com>, "user@spark.apache.org" <user@spark.ap

Re: Re:Re:RE: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL

2015-09-11 Thread Davies Liu
@intel.com> > To: Todd <bit1...@163.com> > Cc: Jesse F Chen/San Francisco/IBM@IBMUS, Michael Armbrust > <mich...@databricks.com>, "user@spark.apache.org" <user@spark.apache.org> > Date: 09/11/2015 01:00 AM > Subject: RE: Re:Re:RE: Re:RE: spark 1.5

RE: Re:Re:RE: Re:RE: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL

2015-09-11 Thread Jesse F Chen
: Jesse F Chen/San Francisco/IBM@IBMUS, Michael Armbrust <mich...@databricks.com>, "user@spark.apache.org" <user@spark.apache.org> Date: 09/11/2015 01:00 AM Subject: RE: Re:Re:RE: Re:RE: spark 1.5 SQL slows down dramatically by 5

Re:Re: spark 1.5 SQL slows down dramatically by 50%+ compared with spark 1.4.1 SQL

2015-09-10 Thread Todd
Thanks Michael for the reply. Below is the sql plan for 1.5 and 1.4. 1.5 is using SortMergeJoin, while 1.4.1 is using shuffled hash join. In this case, it seems hash join performs better than sort join.