[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95720653 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/FilterEstimation.scala --- @@ -0,0 +1,555

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95719350 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/FilterEstimation.scala --- @@ -0,0 +1,555

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95719141 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/FilterEstimation.scala --- @@ -0,0 +1,555

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95718997 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/FilterEstimation.scala --- @@ -0,0 +1,555

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95718952 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/FilterEstimation.scala --- @@ -0,0 +1,555

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95718502 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/FilterEstimation.scala --- @@ -0,0 +1,555

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95717900 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/FilterEstimation.scala --- @@ -0,0 +1,555

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95717298 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/EstimationUtils.scala --- @@ -52,3 +56,12 @@ object

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95716353 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/FilterEstimation.scala --- @@ -0,0 +1,555

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95710436 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/statsEstimation/FilterEstimationSuite.scala --- @@ -0,0 +1,173

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95709024 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/statsEstimation/FilterEstimationSuite.scala --- @@ -0,0 +1,173

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95707722 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/statsEstimation/FilterEstimationSuite.scala --- @@ -0,0 +1,173

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95707748 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/statsEstimation/FilterEstimationSuite.scala --- @@ -0,0 +1,173

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95707644 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/statsEstimation/FilterEstimationSuite.scala --- @@ -0,0 +1,173

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95695560 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/FilterEstimation.scala --- @@ -0,0 +1,555

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95691485 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/statsEstimation/FilterEstimationSuite.scala --- @@ -0,0 +1,173

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-11 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95690239 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/statsEstimation/FilterEstimationSuite.scala --- @@ -0,0 +1,173

[GitHub] spark issue #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-10 Thread ron8hu
Github user ron8hu commented on the issue: https://github.com/apache/spark/pull/16395 cc @rxin @wzhfy Have updated code based on rxin's comments. Please review again. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-10 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95511085 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/estimation/FilterEstimationSuite.scala --- @@ -0,0 +1,226 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-08 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95096602 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/estimation/FilterEstimation.scala --- @@ -0,0 +1,479

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-08 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95096610 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/estimation/FilterEstimation.scala --- @@ -0,0 +1,479

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-08 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95096571 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/estimation/FilterEstimation.scala --- @@ -0,0 +1,479

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-08 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95096457 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/estimation/FilterEstimation.scala --- @@ -0,0 +1,479

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-08 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95096432 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/estimation/FilterEstimation.scala --- @@ -0,0 +1,479

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-08 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95096451 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/estimation/FilterEstimation.scala --- @@ -0,0 +1,479

[GitHub] spark pull request #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-08 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/16395#discussion_r95096418 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/estimation/FilterEstimation.scala --- @@ -0,0 +1,479

[GitHub] spark issue #16395: [SPARK-17075][SQL] implemented filter estimation

2017-01-04 Thread ron8hu
Github user ron8hu commented on the issue: https://github.com/apache/spark/pull/16395 cc @wzhfy @rxin @srinathshankar @hvanhovell @cloud-fan Happy New Year! This PR is ready for code review. --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] spark issue #16395: [SPARK-17075][SQL][WIP] implemented filter estimation

2016-12-23 Thread ron8hu
Github user ron8hu commented on the issue: https://github.com/apache/spark/pull/16395 cc @wzhfy @rxin @hvanhovell @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #16395: implemented first version of filter estimation

2016-12-23 Thread ron8hu
GitHub user ron8hu opened a pull request: https://github.com/apache/spark/pull/16395 implemented first version of filter estimation ## What changes were proposed in this pull request? This is a WIP PR. In this version, we set up the framework to traverse predicate

[GitHub] spark issue #16334: estimate filter cardinality

2016-12-18 Thread ron8hu
Github user ron8hu commented on the issue: https://github.com/apache/spark/pull/16334 Sorry. This is a mistake. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #16334: estimate filter cardinality

2016-12-18 Thread ron8hu
Github user ron8hu closed the pull request at: https://github.com/apache/spark/pull/16334 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark pull request #16334: estimate filter cardinality

2016-12-18 Thread ron8hu
GitHub user ron8hu opened a pull request: https://github.com/apache/spark/pull/16334 estimate filter cardinality ## What changes were proposed in this pull request? This is a WIP PR. In this version, we set up the framework to traverse predicate and evaluate the equality

[GitHub] spark issue #16333: Filter estimate

2016-12-18 Thread ron8hu
Github user ron8hu commented on the issue: https://github.com/apache/spark/pull/16333 This is a mistake. I point to the wrong repository. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request #16333: Filter estimate

2016-12-18 Thread ron8hu
Github user ron8hu closed the pull request at: https://github.com/apache/spark/pull/16333 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark pull request #16333: Filter estimate

2016-12-18 Thread ron8hu
GitHub user ron8hu reopened a pull request: https://github.com/apache/spark/pull/16333 Filter estimate ## What changes were proposed in this pull request? This is a WIP PR. In this version, we set up the framework to traverse predicate and evaluate the equality

[GitHub] spark pull request #16333: Filter estimate

2016-12-18 Thread ron8hu
Github user ron8hu closed the pull request at: https://github.com/apache/spark/pull/16333 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark issue #16333: Filter estimate

2016-12-18 Thread ron8hu
Github user ron8hu commented on the issue: https://github.com/apache/spark/pull/16333 cc @wzhfy Please preview it and make comments. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #16333: Filter estimate

2016-12-18 Thread ron8hu
Github user ron8hu commented on the issue: https://github.com/apache/spark/pull/16333 cc @wzhfy --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #16333: Filter estimate

2016-12-18 Thread ron8hu
GitHub user ron8hu opened a pull request: https://github.com/apache/spark/pull/16333 Filter estimate ## What changes were proposed in this pull request? This is a WIP PR. In this version, we set up the framework to traverse predicate and evaluate the equality

[GitHub] spark pull request #15637: [SPARK-18000] [SQL] Aggregation function for comp...

2016-11-04 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/15637#discussion_r86647571 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/MapAggregate.scala --- @@ -0,0 +1,310 @@ +/* + * Licensed

[GitHub] spark pull request #15637: [SPARK-18000] [SQL] Aggregation function for comp...

2016-11-03 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/15637#discussion_r86415255 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/MapAggregate.scala --- @@ -0,0 +1,332 @@ +/* + * Licensed

[GitHub] spark pull request #15637: [SPARK-18000] [SQL] Aggregation function for comp...

2016-11-03 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/15637#discussion_r86413803 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/MapAggregate.scala --- @@ -0,0 +1,332 @@ +/* + * Licensed

[GitHub] spark pull request #15637: [SPARK-18000] [SQL] Aggregation function for comp...

2016-11-03 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/15637#discussion_r86411252 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/MapAggregate.scala --- @@ -0,0 +1,332 @@ +/* + * Licensed

[GitHub] spark pull request #15637: [SPARK-18000] [SQL] Aggregation function for comp...

2016-11-03 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/15637#discussion_r86411132 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/MapAggregate.scala --- @@ -0,0 +1,332 @@ +/* + * Licensed

[GitHub] spark issue #15637: [SPARK-18000] [SQL] Aggregation function for computing e...

2016-11-03 Thread ron8hu
Github user ron8hu commented on the issue: https://github.com/apache/spark/pull/15637 test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #15637: [SPARK-18000] [SQL] Aggregation function for comp...

2016-11-03 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/15637#discussion_r86410847 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/MapAggregate.scala --- @@ -0,0 +1,332 @@ +/* + * Licensed

[GitHub] spark pull request #15637: [SPARK-18000] [SQL] Aggregation function for comp...

2016-11-03 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/15637#discussion_r86408951 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/MapAggregate.scala --- @@ -0,0 +1,332 @@ +/* + * Licensed

[GitHub] spark pull request #15637: [SPARK-18000] [SQL] Aggregation function for comp...

2016-11-03 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/15637#discussion_r86409288 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/MapAggregate.scala --- @@ -0,0 +1,332 @@ +/* + * Licensed

[GitHub] spark pull request #15637: [SPARK-18000] [SQL] Aggregation function for comp...

2016-11-03 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/15637#discussion_r86409090 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/MapAggregate.scala --- @@ -0,0 +1,332 @@ +/* + * Licensed

[GitHub] spark issue #15297: [WIP][SPARK-9862]Handling data skew

2016-10-13 Thread ron8hu
Github user ron8hu commented on the issue: https://github.com/apache/spark/pull/15297 The design note of this PR has been posted at jira page. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #15297: [WIP][SPARK-9862]Handling data skew

2016-10-13 Thread ron8hu
Github user ron8hu commented on the issue: https://github.com/apache/spark/pull/15297 cc @rxin @hvanhovell Can you review and comment this PR? Thanks. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request #15090: [SPARK-17073] [SQL] generate column-level statist...

2016-09-19 Thread ron8hu
Github user ron8hu commented on a diff in the pull request: https://github.com/apache/spark/pull/15090#discussion_r79508043 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeColumnCommand.scala --- @@ -0,0 +1,159 @@ +/* + * Licensed

<    1   2