[GitHub] spark issue #16985: [SPARK-19122][SQL] Unnecessary shuffle+sort added if joi...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16985 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76704/ Test PASSed. ---

[GitHub] spark issue #16985: [SPARK-19122][SQL] Unnecessary shuffle+sort added if joi...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16985 **[Test build #76704 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76704/testReport)** for PR 16985 at commit [`4f76bd0`](https://github.com/apache/spark/commit/4

[GitHub] spark issue #17916: [SPARK-20590][SQL] Use Spark internal datasource if mult...

2017-05-09 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/17916 Thanks for approving this approach. I will handle the comment soon. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark issue #17847: [SPARK-20590] Map default input data source formats to i...

2017-05-09 Thread sameeragarwal
Github user sameeragarwal commented on the issue: https://github.com/apache/spark/pull/17847 I'm closing this in favor of https://github.com/apache/spark/pull/17916 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request #17847: [SPARK-20590] Map default input data source forma...

2017-05-09 Thread sameeragarwal
Github user sameeragarwal closed the pull request at: https://github.com/apache/spark/pull/17847 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark issue #17916: [SPARK-20590][SQL] Use Spark internal datasource if mult...

2017-05-09 Thread sameeragarwal
Github user sameeragarwal commented on the issue: https://github.com/apache/spark/pull/17916 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request #17916: [SPARK-20590][SQL] Use Spark internal datasource ...

2017-05-09 Thread sameeragarwal
Github user sameeragarwal commented on a diff in the pull request: https://github.com/apache/spark/pull/17916#discussion_r115626517 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala --- @@ -570,10 +570,20 @@ object DataSource {

[GitHub] spark issue #17633: [SPARK-20331][SQL] Enhanced Hive partition pruning predi...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17633 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17633: [SPARK-20331][SQL] Enhanced Hive partition pruning predi...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17633 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76706/ Test PASSed. ---

[GitHub] spark issue #17633: [SPARK-20331][SQL] Enhanced Hive partition pruning predi...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17633 **[Test build #76706 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76706/testReport)** for PR 17633 at commit [`a4cdfb0`](https://github.com/apache/spark/commit/a

[GitHub] spark pull request #17916: [SPARK-20590][SQL] Use Spark internal datasource ...

2017-05-09 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/17916#discussion_r115625060 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/sources/DDLSourceLoadSuite.scala --- @@ -19,26 +19,39 @@ package org.apache.spark.sql.sources

[GitHub] spark issue #17916: [SPARK-20590][SQL] Use Spark internal datasource if mult...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17916 **[Test build #76709 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76709/testReport)** for PR 17916 at commit [`4450da7`](https://github.com/apache/spark/commit/44

[GitHub] spark issue #15821: [SPARK-13534][PySpark] Using Apache Arrow to increase pe...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15821 **[Test build #76710 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76710/testReport)** for PR 15821 at commit [`934c147`](https://github.com/apache/spark/commit/93

[GitHub] spark issue #17925: [SPARK-20205][core] Make sure StageInfo is updated befor...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17925 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17925: [SPARK-20205][core] Make sure StageInfo is updated befor...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17925 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76697/ Test PASSed. ---

[GitHub] spark issue #17925: [SPARK-20205][core] Make sure StageInfo is updated befor...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17925 **[Test build #76697 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76697/testReport)** for PR 17925 at commit [`0d8f717`](https://github.com/apache/spark/commit/0

[GitHub] spark issue #17666: [SPARK-20311][SQL] Support aliases for table value funct...

2017-05-09 Thread maropu
Github user maropu commented on the issue: https://github.com/apache/spark/pull/17666 @yhuai okay, thanks for letting me know! I'll make a new pr to fix. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project doe

[GitHub] spark issue #17680: [SPARK-20364][SQL] Support Parquet predicate pushdown on...

2017-05-09 Thread ash211
Github user ash211 commented on the issue: https://github.com/apache/spark/pull/17680 Are there any comments on this PR or is it ready to be merged? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark issue #17927: [SPARK-20685] Fix BatchPythonEvaluation bug in case of s...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17927 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17927: [SPARK-20685] Fix BatchPythonEvaluation bug in case of s...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17927 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76707/ Test PASSed. ---

[GitHub] spark issue #17927: [SPARK-20685] Fix BatchPythonEvaluation bug in case of s...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17927 **[Test build #76707 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76707/testReport)** for PR 17927 at commit [`17e69b5`](https://github.com/apache/spark/commit/1

[GitHub] spark issue #17916: [SPARK-20590][SQL] Use Spark internal datasource if mult...

2017-05-09 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/17916 Yea. Probably, I think it should check if the length is single with another test as well and checking this would not harm. --- If your project is set up for it, you can reply to this email and

[GitHub] spark issue #15821: [SPARK-13534][PySpark] Using Apache Arrow to increase pe...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15821 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #15821: [SPARK-13534][PySpark] Using Apache Arrow to increase pe...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15821 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76708/ Test FAILed. ---

[GitHub] spark issue #15821: [SPARK-13534][PySpark] Using Apache Arrow to increase pe...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15821 **[Test build #76708 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76708/testReport)** for PR 15821 at commit [`a4d6057`](https://github.com/apache/spark/commit/a

[GitHub] spark issue #15821: [SPARK-13534][PySpark] Using Apache Arrow to increase pe...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15821 **[Test build #76708 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76708/testReport)** for PR 15821 at commit [`a4d6057`](https://github.com/apache/spark/commit/a4

[GitHub] spark issue #17916: [SPARK-20590][SQL] Use Spark internal datasource if mult...

2017-05-09 Thread sameeragarwal
Github user sameeragarwal commented on the issue: https://github.com/apache/spark/pull/17916 Thanks @HyukjinKwon, I like this approach better! One limitation of this patch however is that if there are ever two internal datasources in Spark with the same `shortName`, we might'v

[GitHub] spark issue #17916: [SPARK-20590][SQL] Use Spark internal datasource if mult...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17916 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17916: [SPARK-20590][SQL] Use Spark internal datasource if mult...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17916 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76696/ Test PASSed. ---

[GitHub] spark issue #17916: [SPARK-20590][SQL] Use Spark internal datasource if mult...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17916 **[Test build #76696 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76696/testReport)** for PR 17916 at commit [`8c40eab`](https://github.com/apache/spark/commit/8

[GitHub] spark issue #17927: [SPARK-20685] Fix BatchPythonEvaluation bug in case of s...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17927 **[Test build #76707 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76707/testReport)** for PR 17927 at commit [`17e69b5`](https://github.com/apache/spark/commit/17

[GitHub] spark pull request #17927: [SPARK-20685] Fix BatchPythonEvaluation bug in ca...

2017-05-09 Thread JoshRosen
GitHub user JoshRosen opened a pull request: https://github.com/apache/spark/pull/17927 [SPARK-20685] Fix BatchPythonEvaluation bug in case of single UDF w/ repeated arg. ## What changes were proposed in this pull request? There's a latent corner-case bug in PySpark UDF ev

[GitHub] spark issue #17906: [SPARK-20665][SQL]"Bround" function return NULL

2017-05-09 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/17906 Hi, @10110346 . What about `round`? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this featu

[GitHub] spark issue #17893: [SPARK-20633][SQL] FileFormatWriter wrap the FetchFailed...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17893 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76689/ Test FAILed. ---

[GitHub] spark issue #17893: [SPARK-20633][SQL] FileFormatWriter wrap the FetchFailed...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17893 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17893: [SPARK-20633][SQL] FileFormatWriter wrap the FetchFailed...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17893 **[Test build #76689 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76689/testReport)** for PR 17893 at commit [`c869d9c`](https://github.com/apache/spark/commit/c

[GitHub] spark issue #17904: [SPARK-20630] [Web UI] Fixed column visibility in Execut...

2017-05-09 Thread ajbozarth
Github user ajbozarth commented on the issue: https://github.com/apache/spark/pull/17904 @srowen passed test, this good to merge? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #17896: [SPARK-20373][SQL][SS] Batch queries with 'Datase...

2017-05-09 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/17896 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark issue #17904: [SPARK-20630] [Web UI] Fixed column visibility in Execut...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17904 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17904: [SPARK-20630] [Web UI] Fixed column visibility in Execut...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17904 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76692/ Test PASSed. ---

[GitHub] spark issue #17904: [SPARK-20630] [Web UI] Fixed column visibility in Execut...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17904 **[Test build #76692 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76692/testReport)** for PR 17904 at commit [`766bfb0`](https://github.com/apache/spark/commit/7

[GitHub] spark issue #17896: [SPARK-20373][SQL][SS] Batch queries with 'Dataset/DataF...

2017-05-09 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/17896 @uncleGen Thanks! LGTM. Merging to master and 2.2. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request #17633: [SPARK-20331][SQL] Enhanced Hive partition prunin...

2017-05-09 Thread mallman
Github user mallman commented on a diff in the pull request: https://github.com/apache/spark/pull/17633#discussion_r115614088 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/client/HiveClientSuite.scala --- @@ -43,19 +47,159 @@ class HiveClientSuite extends SparkFunSui

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17926 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76701/ Test PASSed. ---

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17926 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17926 **[Test build #76701 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76701/testReport)** for PR 17926 at commit [`4a9d58d`](https://github.com/apache/spark/commit/4

[GitHub] spark issue #17896: [SPARK-20373][SQL][SS] Batch queries with 'Dataset/DataF...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17896 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17896: [SPARK-20373][SQL][SS] Batch queries with 'Dataset/DataF...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17896 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76694/ Test PASSed. ---

[GitHub] spark issue #17633: [SPARK-20331][SQL] Enhanced Hive partition pruning predi...

2017-05-09 Thread mallman
Github user mallman commented on the issue: https://github.com/apache/spark/pull/17633 > Could you check whether there exists any limit on predicate we can pass to Hive? There are, and I found something in the way of documentation or a grammar a while back that specifies the

[GitHub] spark issue #17644: [SPARK-17729] [SQL] Enable creating hive bucketed tables

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17644 **[Test build #76703 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76703/testReport)** for PR 17644 at commit [`8aaff4c`](https://github.com/apache/spark/commit/8

[GitHub] spark issue #17644: [SPARK-17729] [SQL] Enable creating hive bucketed tables

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17644 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76703/ Test FAILed. ---

[GitHub] spark issue #17644: [SPARK-17729] [SQL] Enable creating hive bucketed tables

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17644 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17896: [SPARK-20373][SQL][SS] Batch queries with 'Dataset/DataF...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17896 **[Test build #76694 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76694/testReport)** for PR 17896 at commit [`5eb7dd4`](https://github.com/apache/spark/commit/5

[GitHub] spark issue #17633: [SPARK-20331][SQL] Enhanced Hive partition pruning predi...

2017-05-09 Thread mallman
Github user mallman commented on the issue: https://github.com/apache/spark/pull/17633 I've pushed a new commit removing the logical for handling "foldables", since these are evaluated earlier in planning. I've also removed the modifications I made to `FiltersSuite.scala` and

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17924 **[Test build #76705 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76705/testReport)** for PR 17924 at commit [`85ef731`](https://github.com/apache/spark/commit/85

[GitHub] spark issue #17633: [SPARK-20331][SQL] Enhanced Hive partition pruning predi...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17633 **[Test build #76706 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76706/testReport)** for PR 17633 at commit [`a4cdfb0`](https://github.com/apache/spark/commit/a4

[GitHub] spark issue #17666: [SPARK-20311][SQL] Support aliases for table value funct...

2017-05-09 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/17666 I have reverted this change from both master and branch-2.2. I have reopened the jira. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well.

[GitHub] spark issue #17923: [SPARK-20591][WEB UI] Succeeded tasks num not equal in a...

2017-05-09 Thread ajbozarth
Github user ajbozarth commented on the issue: https://github.com/apache/spark/pull/17923 This is a rollback of an old Spark 1.1 blocker by @pwendell [SPARK-3020](https://issues.apache.org/jira/browse/SPARK-3020). it seems this was intentionally done this way --- If your project is s

[GitHub] spark issue #16985: [SPARK-19122][SQL] Unnecessary shuffle+sort added if joi...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16985 **[Test build #76702 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76702/testReport)** for PR 16985 at commit [`4507511`](https://github.com/apache/spark/commit/4

[GitHub] spark issue #16985: [SPARK-19122][SQL] Unnecessary shuffle+sort added if joi...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16985 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76702/ Test FAILed. ---

[GitHub] spark issue #16985: [SPARK-19122][SQL] Unnecessary shuffle+sort added if joi...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16985 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #16985: [SPARK-19122][SQL] Unnecessary shuffle+sort added if joi...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16985 **[Test build #76704 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76704/testReport)** for PR 16985 at commit [`4f76bd0`](https://github.com/apache/spark/commit/4f

[GitHub] spark issue #17644: [SPARK-17729] [SQL] Enable creating hive bucketed tables

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17644 **[Test build #76703 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76703/testReport)** for PR 17644 at commit [`8aaff4c`](https://github.com/apache/spark/commit/8a

[GitHub] spark issue #17666: [SPARK-20311][SQL] Support aliases for table value funct...

2017-05-09 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/17666 I am going to revert this PR from master and branch-2.2. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #15009: [SPARK-17443][SPARK-11035] Stop Spark Application if lau...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15009 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76691/ Test PASSed. ---

[GitHub] spark issue #15009: [SPARK-17443][SPARK-11035] Stop Spark Application if lau...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15009 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #15009: [SPARK-17443][SPARK-11035] Stop Spark Application if lau...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15009 **[Test build #76691 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76691/testReport)** for PR 15009 at commit [`2996fb1`](https://github.com/apache/spark/commit/2

[GitHub] spark issue #16985: [SPARK-19122][SQL] Unnecessary shuffle+sort added if joi...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16985 **[Test build #76702 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76702/testReport)** for PR 16985 at commit [`4507511`](https://github.com/apache/spark/commit/45

[GitHub] spark issue #17666: [SPARK-20311][SQL] Support aliases for table value funct...

2017-05-09 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/17666 @maropu Sorry. I think this PR introduces a regression. ``` scala> spark.sql("select * from range(1, 10) cross join range(1, 10)").explain == Physical Plan == org.apache.spark.sql.

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17926 **[Test build #76701 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76701/testReport)** for PR 17926 at commit [`4a9d58d`](https://github.com/apache/spark/commit/4a

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17924 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76699/ Test FAILed. ---

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17924 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17924 **[Test build #76699 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76699/testReport)** for PR 17924 at commit [`4607e0e`](https://github.com/apache/spark/commit/4

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17926 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17926 **[Test build #76700 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76700/testReport)** for PR 17926 at commit [`6ef9fdd`](https://github.com/apache/spark/commit/6

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17926 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76700/ Test FAILed. ---

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17926 **[Test build #76700 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76700/testReport)** for PR 17926 at commit [`6ef9fdd`](https://github.com/apache/spark/commit/6e

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17924 **[Test build #76699 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76699/testReport)** for PR 17924 at commit [`4607e0e`](https://github.com/apache/spark/commit/46

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-09 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/17924 Retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishe

[GitHub] spark issue #17899: [SPARK-20636] Add new optimization rule to flip adjacent...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17899 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76690/ Test PASSed. ---

[GitHub] spark issue #17899: [SPARK-20636] Add new optimization rule to flip adjacent...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17899 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17899: [SPARK-20636] Add new optimization rule to flip adjacent...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17899 **[Test build #76690 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76690/testReport)** for PR 17899 at commit [`1ab81ca`](https://github.com/apache/spark/commit/1

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17926 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17926 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76698/ Test FAILed. ---

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17926 **[Test build #76698 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76698/testReport)** for PR 17926 at commit [`c9a6348`](https://github.com/apache/spark/commit/c

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17926 **[Test build #76698 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76698/testReport)** for PR 17926 at commit [`c9a6348`](https://github.com/apache/spark/commit/c9

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/17926 ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark issue #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSlices in...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17926 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feat

[GitHub] spark pull request #17926: [MINOR][SQL][PYSPARK] Allow user to specify numSl...

2017-05-09 Thread patrick-nicholson
GitHub user patrick-nicholson opened a pull request: https://github.com/apache/spark/pull/17926 [MINOR][SQL][PYSPARK] Allow user to specify numSlices in SparkSession.createDataFrame ## What changes were proposed in this pull request? In my experience, pushing `pandas.DataFr

[GitHub] spark issue #17854: [SPARK-20564][Deploy] Reduce massive executor failures w...

2017-05-09 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/17854 The reason why `spark.yarn.containerLauncherMaxThreads` does not work here is because it only control how many threads simultaneously send a container start command to YARN; that is usually a much qu

[GitHub] spark issue #17925: [SPARK-20205][core] Make sure StageInfo is updated befor...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17925 **[Test build #76697 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76697/testReport)** for PR 17925 at commit [`0d8f717`](https://github.com/apache/spark/commit/0d

[GitHub] spark pull request #17925: [SPARK-20205][core] Make sure StageInfo is update...

2017-05-09 Thread vanzin
GitHub user vanzin opened a pull request: https://github.com/apache/spark/pull/17925 [SPARK-20205][core] Make sure StageInfo is updated before sending event. The DAGScheduler was sending a "stage submitted" event before it properly updated the event's information. This meant that

[GitHub] spark issue #17925: [SPARK-20205][core] Make sure StageInfo is updated befor...

2017-05-09 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/17925 @squito --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the fe

[GitHub] spark issue #17916: [SPARK-20590][SQL] Use Spark internal datasource if mult...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17916 **[Test build #76696 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76696/testReport)** for PR 17916 at commit [`8c40eab`](https://github.com/apache/spark/commit/8c

[GitHub] spark issue #17916: [SPARK-20590][SQL] Use Spark internal datasource if mult...

2017-05-09 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/17916 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes s

[GitHub] spark issue #17854: [SPARK-20564][Deploy] Reduce massive executor failures w...

2017-05-09 Thread foxish
Github user foxish commented on the issue: https://github.com/apache/spark/pull/17854 In Kubernetes/Spark, we see fairly similar behavior in the scenario described. When the simultaneous container launching is not throttled, it is capable of DOSing the system. Our solution so far is t

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17924 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76695/ Test FAILed. ---

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17924 **[Test build #76695 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76695/testReport)** for PR 17924 at commit [`4607e0e`](https://github.com/apache/spark/commit/4

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17924 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-09 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17924 **[Test build #76695 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76695/testReport)** for PR 17924 at commit [`4607e0e`](https://github.com/apache/spark/commit/46

<    1   2   3   4   5   6   7   >