Re: About the Multiple Join Optimization on AsterixDB

2016-10-31 Thread mingda li
Yeah, actually, I have tested the multiple join on TPCDS dataset (1g) on UCLA‘s AsterixDB cluster (16 nodes). Two different join orders as following: SELECT COUNT(*) FROM (SELECT * FROM catalog_sales cs1 JOIN catalog_returns cr1 ON (cs1.cs_order_number = cr1.cr_order_number AND cs1.cs_item_sk = cr

Re: About the Multiple Join Optimization on AsterixDB

2016-10-31 Thread Ildar Absalyamov
As Yingyi pointed out we don't reorder joins because the framework for stats and cardinalities is not there yet. However what we can do in an meantime is to provide an interface for the statistical information, needed for join reordering, independent of the way the stats were collected (either samp

Re: About the Multiple Join Optimization on AsterixDB

2016-10-31 Thread mingda li
Hi Yingyi, I see. Thanks for your reply:-) Bests, Mingda On Mon, Oct 31, 2016 at 11:23 AM, Yingyi Bu wrote: > Mingda, > > I'm not sure how much re-ordering can be done at the Hyracks level, > i.e., the runtime level. > In the optimizer (the asterixdb/algebricks level), we don't have

Re: About the Multiple Join Optimization on AsterixDB

2016-10-31 Thread Yingyi Bu
Mingda, I'm not sure how much re-ordering can be done at the Hyracks level, i.e., the runtime level. In the optimizer (the asterixdb/algebricks level), we don't have re-ordering for joins, because: --- the cost model has not been added yet. I'm not sure about the timeline for this

About the Multiple Join Optimization on AsterixDB

2016-10-31 Thread mingda li
Dear all, Hi, I am working on multiple join on Hyracks level. I am not sure if I do the multiple join on AsterixDB, whether it will optimize the query by changing the join order or just execute according to how we write the query. I think this may not be done in Algebricks level based on rule but