Re: How to use [SHUFFLE] by default for all JOINS

2018-02-23 Thread Alexander Behm
Btw, you should also know that the following improvements in the upcoming 2.12 release might make "compute stats" more palatable on your huge tables. We'd love your feedback on COMPUTE STATS with TABLESAMPLE, in particular. COMPUTE STATS with TABLESAMPLE

Re: How to use [SHUFFLE] by default for all JOINS

2018-02-23 Thread Alexander Behm
Maybe this improvement could help. It's available since Impala 2.9. https://issues.apache.org/jira/browse/IMPALA-5381 On Fri, Feb 23, 2018 at 6:40 PM, Arya Goudarzi wrote: > Thank you Mostafa. My bad on mentioning the wrong version. We are using > 2.7 and not 1.7. We have

Re: How to use [SHUFFLE] by default for all JOINS

2018-02-23 Thread Arya Goudarzi
Thank you Mostafa. My bad on mentioning the wrong version. We are using 2.7 and not 1.7. We have upgrade in our plans and actually waiting for Impala 2.12 as it has IMPALA-5058 fixes. On Fri, Feb 23, 2018 at 6:18 PM, Mostafa Mokhtar wrote: > AFAIK there is no such flag. >

Re: How to use [SHUFFLE] by default for all JOINS

2018-02-23 Thread Mostafa Mokhtar
AFAIK there is no such flag. You are more likely to get much higher gains if you upgrade to a more recent version of Impala. https://www.slideshare.net/cloudera/performance-of-apache-impala Thanks Mostafa > On Feb 23, 2018, at 6:12 PM, Arya Goudarzi wrote: > > Hi Team,