[GitHub] spark pull request #16334: estimate filter cardinality

2016-12-18 Thread ron8hu
Github user ron8hu closed the pull request at:

https://github.com/apache/spark/pull/16334


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request #16334: estimate filter cardinality

2016-12-18 Thread ron8hu
GitHub user ron8hu opened a pull request:

https://github.com/apache/spark/pull/16334

estimate filter cardinality

## What changes were proposed in this pull request?

This is a WIP PR. In this version, we set up the framework to traverse 
predicate and evaluate the equality (=) expression.

## How was this patch tested?

We just have a simple test case for now. More tests need to be added.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/ron8hu/spark filterEstimate

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/16334.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #16334


commit f0bbb43f7b27c59a3181ce428ed2bb0a7c1fc89d
Author: wangzhenhua 
Date:   2016-12-09T06:34:41Z

estimation for inner join

commit 64603b589bd1b971d2370d848f1dd19f11b52928
Author: wangzhenhua 
Date:   2016-12-09T07:30:33Z

wrong position of comments

commit 02bdf875babac7676a732a01fa41c8b8416a58d3
Author: Ron Hu 
Date:   2016-12-18T02:10:28Z

implemented first version of filter cardinality estimation

commit 6c9d06085f30dd19eb8b0a06f62742bda6ace736
Author: Ron Hu 
Date:   2016-12-18T02:15:57Z

maintained a mutable colStats in Filter class

commit 9e92b4b4c5033441f63e0e76544a1d609ac2a39e
Author: Ron Hu 
Date:   2016-12-19T00:57:00Z

Used mutableColStats to reflect changes for column stats

commit 8e77acc8630df3696dbb12453ee3f469c03b1193
Author: Ron Hu 
Date:   2016-12-19T02:15:41Z

use getOrElse to return filter stats

commit 62315985cc6ec20fb26c8ed109b33160e9f3f80b
Author: Ron Hu 
Date:   2016-12-19T03:27:40Z

added the test case




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org