[GitHub] spark issue #14876: showcase, DO NOT MERGE

2016-08-31 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/14876 closing, @maropu will take over --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #14876: showcase, DO NOT MERGE

2016-08-31 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/14876 yea, pushing down partial aggregate below exchange is a good idea, but I think it's out of the scope of SPARK-12978, which is aim to remove unnecessary partial aggregate right? --- If your

[GitHub] spark issue #14876: showcase, DO NOT MERGE

2016-08-31 Thread maropu
Github user maropu commented on the issue: https://github.com/apache/spark/pull/14876 yea, I thinks so. I like the approach in this pr. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #14876: showcase, DO NOT MERGE

2016-08-31 Thread maropu
Github user maropu commented on the issue: https://github.com/apache/spark/pull/14876 On the other hand, when caching the already-partitioned input table, we cannot push-down them; ``` (0 to 1000).map(x => (x % 2, x.toString)).toDF("a",

[GitHub] spark issue #14876: showcase, DO NOT MERGE

2016-08-31 Thread maropu
Github user maropu commented on the issue: https://github.com/apache/spark/pull/14876 I found that we need to push-down partial aggregation below exchange operators instead of merging them? For example, in the spark v2.0 branch, ``` (0 to 1000).map(x => (x % 2,

[GitHub] spark issue #14876: showcase, DO NOT MERGE

2016-08-30 Thread maropu
Github user maropu commented on the issue: https://github.com/apache/spark/pull/14876 Thank you for your concrete example! I'll check in hours. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark issue #14876: showcase, DO NOT MERGE

2016-08-30 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14876 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14876: showcase, DO NOT MERGE

2016-08-30 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14876 **[Test build #64660 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64660/consoleFull)** for PR 14876 at commit

[GitHub] spark issue #14876: showcase, DO NOT MERGE

2016-08-30 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14876 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64660/ Test FAILed. ---

[GitHub] spark issue #14876: showcase, DO NOT MERGE

2016-08-30 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14876 **[Test build #64660 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64660/consoleFull)** for PR 14876 at commit

[GitHub] spark issue #14876: showcase, DO NOT MERGE

2016-08-30 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/14876 cc @maropu @hvanhovell @yhuai @liancheng @clockfly --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have