[GitHub] [spark] xccui commented on pull request #28523: [SPARK-31706][SQL] add back the support of streaming update mode

2020-05-14 Thread GitBox


xccui commented on pull request #28523:
URL: https://github.com/apache/spark/pull/28523#issuecomment-629043813


   @HeartSaVioR Yes. It seems the output mode option was mainly designed for 
stateful aggregations, which means it actually works in a restricted way.
   
   Ideally, to support complete mode, all the operators must be capable of 
outputting the "complete" result seen so far for each epoch. Personally, I'm in 
favor of removing this mode in a future version. But for now, I propose to add 
more restrictions while doing the plan check (e.g., disallowing the union 
situation you mentioned) and also a note to the documentation.
   
   IMO, the mode of the result table should only be decided by the operators in 
the plan and it could either be "append" or "update" (including the current 
"complete" mode). Basically, the designated sink should match the mode of the 
result table. Usually, supporting "update" needs more effort and that means 
only part of the sinks could be chosen for a plan containing an aggregation or 
some kind of joins.
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] yaooqinn commented on pull request #28537: [SPARK-31715][SQL][TEST] Fix flaky SparkSQLEnvSuite that sometimes varies single derby instance standard

2020-05-14 Thread GitBox


yaooqinn commented on pull request #28537:
URL: https://github.com/apache/spark/pull/28537#issuecomment-629040211


   cc @cloud-fan @HeartSaVioR thanks



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


dongjoon-hyun commented on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629039005


   It's still assigned to `amp-jenkins-worker-05`.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28537: [SPARK-31715][SQL][TEST] Fix flaky SparkSQLEnvSuite that sometimes varies single derby instance standard

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28537:
URL: https://github.com/apache/spark/pull/28537#issuecomment-629038579







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28537: [SPARK-31715][SQL][TEST] Fix flaky SparkSQLEnvSuite that sometimes varies single derby instance standard

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #28537:
URL: https://github.com/apache/spark/pull/28537#issuecomment-629038579







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28537: [SPARK-31715][SQL][TEST] Fix flaky SparkSQLEnvSuite that sometimes varies single derby instance standard

2020-05-14 Thread GitBox


SparkQA commented on pull request #28537:
URL: https://github.com/apache/spark/pull/28537#issuecomment-629038184


   **[Test build #122653 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122653/testReport)**
 for PR 28537 at commit 
[`97597ac`](https://github.com/apache/spark/commit/97597ac2d958590eb6bacfc768ad023019b12ded).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629036880


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/122650/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


SparkQA commented on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629036862


   **[Test build #122650 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122650/testReport)**
 for PR 25870 at commit 
[`a9f055b`](https://github.com/apache/spark/commit/a9f055b4642d487b724a628c4d6f682c71fd2d2c).
* This patch **fails build dependency tests**.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629036875


   Merged build finished. Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


SparkQA removed a comment on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629033573


   **[Test build #122650 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122650/testReport)**
 for PR 25870 at commit 
[`a9f055b`](https://github.com/apache/spark/commit/a9f055b4642d487b724a628c4d6f682c71fd2d2c).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629036875







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28528: [SPARK-31711][CORE] Register the executor source with the metrics system when running in local mode.

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28528:
URL: https://github.com/apache/spark/pull/28528#issuecomment-629036335







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28509: [SPARK-31655][BUILD][3.0] Upgrade snappy-java to 1.1.7.5

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #28509:
URL: https://github.com/apache/spark/pull/28509#issuecomment-629036386







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28528: [SPARK-31711][CORE] Register the executor source with the metrics system when running in local mode.

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #28528:
URL: https://github.com/apache/spark/pull/28528#issuecomment-629036335







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28509: [SPARK-31655][BUILD][3.0] Upgrade snappy-java to 1.1.7.5

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28509:
URL: https://github.com/apache/spark/pull/28509#issuecomment-629036386







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28528: [SPARK-31711][CORE] Register the executor source with the metrics system when running in local mode.

2020-05-14 Thread GitBox


SparkQA commented on pull request #28528:
URL: https://github.com/apache/spark/pull/28528#issuecomment-629035880


   **[Test build #122651 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122651/testReport)**
 for PR 28528 at commit 
[`52c22ce`](https://github.com/apache/spark/commit/52c22ce34cf95b85e327391e554ea1f7c588dbe7).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28509: [SPARK-31655][BUILD][3.0] Upgrade snappy-java to 1.1.7.5

2020-05-14 Thread GitBox


SparkQA commented on pull request #28509:
URL: https://github.com/apache/spark/pull/28509#issuecomment-629035863


   **[Test build #122652 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122652/testReport)**
 for PR 28509 at commit 
[`692db6a`](https://github.com/apache/spark/commit/692db6aee585af63ce57cd22733b78beaf3d1b51).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #28509: [SPARK-31655][BUILD][3.0] Upgrade snappy-java to 1.1.7.5

2020-05-14 Thread GitBox


dongjoon-hyun commented on pull request #28509:
URL: https://github.com/apache/spark/pull/28509#issuecomment-629035653


   Retest this please.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #28528: [SPARK-31711][CORE] Register the executor source with the metrics system when running in local mode.

2020-05-14 Thread GitBox


dongjoon-hyun commented on pull request #28528:
URL: https://github.com/apache/spark/pull/28528#issuecomment-629035487


   Retest this please.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun edited a comment on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


dongjoon-hyun edited a comment on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629034425


   Ya. The failure jobs are on the same broken Jenkins machine, 
`amp-jenkins-worker-05`.
   And, the running jobs are on `amp-jenkins-worker-03` and 
`amp-jenkins-worker-04`.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


dongjoon-hyun commented on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629034425


   Ya. The failure jobs are on the same broken Jenkins machine, 
`amp-jenkins-worker-05`.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28517: [SPARK-31707][SQL] Revert SPARK-30098 Use default datasource as provider for CREATE TABLE syntax

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #28517:
URL: https://github.com/apache/spark/pull/28517#issuecomment-629033855







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28517: [SPARK-31707][SQL] Revert SPARK-30098 Use default datasource as provider for CREATE TABLE syntax

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28517:
URL: https://github.com/apache/spark/pull/28517#issuecomment-629033855







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


SparkQA commented on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629033573


   **[Test build #122650 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122650/testReport)**
 for PR 25870 at commit 
[`a9f055b`](https://github.com/apache/spark/commit/a9f055b4642d487b724a628c4d6f682c71fd2d2c).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28517: [SPARK-31707][SQL] Revert SPARK-30098 Use default datasource as provider for CREATE TABLE syntax

2020-05-14 Thread GitBox


SparkQA commented on pull request #28517:
URL: https://github.com/apache/spark/pull/28517#issuecomment-629033574


   **[Test build #122649 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122649/testReport)**
 for PR 28517 at commit 
[`84c172f`](https://github.com/apache/spark/commit/84c172fb20321b082fc98fa419f9756ec44a6a7e).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HeartSaVioR commented on a change in pull request #28517: [SPARK-31707][SQL] Revert SPARK-30098 Use default datasource as provider for CREATE TABLE syntax

2020-05-14 Thread GitBox


HeartSaVioR commented on a change in pull request #28517:
URL: https://github.com/apache/spark/pull/28517#discussion_r425571232



##
File path: 
sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/HiveDDLSuite.scala
##
@@ -2706,33 +2706,6 @@ class HiveDDLSuite
 }
   }
 
-  test("SPARK-30098: create table without provider should " +

Review comment:
   These two tests are depending on the config, hence I'd rather not 
restore these tests.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


dongjoon-hyun commented on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629032842







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] xuanyuanking commented on a change in pull request #28501: [SPARK-31663][SQL] Grouping sets with having clause returns the wrong result

2020-05-14 Thread GitBox


xuanyuanking commented on a change in pull request #28501:
URL: https://github.com/apache/spark/pull/28501#discussion_r425571365



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala
##
@@ -629,12 +629,7 @@ class AstBuilder(conf: SQLConf) extends 
SqlBaseBaseVisitor[AnyRef] with Logging
   case p: Predicate => p
   case e => Cast(e, BooleanType)
 }
-plan match {
-  case aggregate: Aggregate =>
-AggregateWithHaving(predicate, aggregate)
-  case _ =>
-Filter(predicate, plan)
-}
+UnresolvedHaving(predicate, plan)

Review comment:
   Yes, it still works, the `UnresolvedHaving` will be changed to Filter in 
rule `ResolveAggregateFunction`.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629031989


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/122647/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


SparkQA removed a comment on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629028801


   **[Test build #122647 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122647/testReport)**
 for PR 25870 at commit 
[`a9f055b`](https://github.com/apache/spark/commit/a9f055b4642d487b724a628c4d6f682c71fd2d2c).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


SparkQA commented on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629031965


   **[Test build #122647 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122647/testReport)**
 for PR 25870 at commit 
[`a9f055b`](https://github.com/apache/spark/commit/a9f055b4642d487b724a628c4d6f682c71fd2d2c).
* This patch **fails build dependency tests**.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629031981







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629031981


   Merged build finished. Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28501: [SPARK-31663][SQL] Grouping sets with having clause returns the wrong result

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #28501:
URL: https://github.com/apache/spark/pull/28501#issuecomment-629031566







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] xuanyuanking commented on a change in pull request #28501: [SPARK-31663][SQL] Grouping sets with having clause returns the wrong result

2020-05-14 Thread GitBox


xuanyuanking commented on a change in pull request #28501:
URL: https://github.com/apache/spark/pull/28501#discussion_r425570419



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala
##
@@ -2163,22 +2235,29 @@ class Analyzer(
   alias.toAttribute
   }
   }
-
-  // Push the aggregate expressions into the aggregate (if any).
-  if (aggregateExpressions.nonEmpty) {
-Project(agg.output,
-  Filter(transformedAggregateFilter,
-agg.copy(aggregateExpressions = agg.aggregateExpressions ++ 
aggregateExpressions)))
-  } else {
-filter
-  }
+  (aggregateExpressions, Some(transformedAggregateFilter))
 } else {
-  filter
+  (aggregateExpressions, None)
 }
   } catch {
 // Attempting to resolve in the aggregate can result in ambiguity.  
When this happens,
 // just return the original plan.
-case ae: AnalysisException => filter
+case ae: AnalysisException => (aggregateExpressions, None)
+  }
+}
+
+def resolveHaving(filter: Filter, agg: Aggregate): LogicalPlan = {
+  // Try resolving the condition of the filter as though it is in the 
aggregate clause
+  val (aggregateExpressions, resolvedHavingCond) =
+resolveFilterCondInAggregate(filter.condition, agg)
+
+  // Push the aggregate expressions into the aggregate (if any).
+  if (aggregateExpressions.nonEmpty) {

Review comment:
   Copy, done in 3b48e38.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] xuanyuanking commented on a change in pull request #28501: [SPARK-31663][SQL] Grouping sets with having clause returns the wrong result

2020-05-14 Thread GitBox


xuanyuanking commented on a change in pull request #28501:
URL: https://github.com/apache/spark/pull/28501#discussion_r425570296



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala
##
@@ -2125,13 +2196,14 @@ class Analyzer(
   condition.find(_.isInstanceOf[AggregateExpression]).isDefined
 }
 
-def resolveHaving(filter: Filter, agg: Aggregate): LogicalPlan = {
-  // Try resolving the condition of the filter as though it is in the 
aggregate clause
+def resolveFilterCondInAggregate(
+filterCond: Expression, agg: Aggregate): (Seq[NamedExpression], 
Option[Expression]) = {

Review comment:
   Thanks, done in 3b48e38.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28501: [SPARK-31663][SQL] Grouping sets with having clause returns the wrong result

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28501:
URL: https://github.com/apache/spark/pull/28501#issuecomment-629031566







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] xuanyuanking commented on a change in pull request #28501: [SPARK-31663][SQL] Grouping sets with having clause returns the wrong result

2020-05-14 Thread GitBox


xuanyuanking commented on a change in pull request #28501:
URL: https://github.com/apache/spark/pull/28501#discussion_r425570468



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/unresolved.scala
##
@@ -540,11 +540,12 @@ case class UnresolvedOrdinal(ordinal: Int)
 }
 
 /**
- * Represents unresolved aggregate with having clause, it is turned by the 
analyzer into a Filter.
+ * Represents unresolved having clause, the child for it can be Aggregate, 
Grouping Sets, Rollup

Review comment:
   Thanks, done in 3b48e38.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] xuanyuanking commented on a change in pull request #28501: [SPARK-31663][SQL] Grouping sets with having clause returns the wrong result

2020-05-14 Thread GitBox


xuanyuanking commented on a change in pull request #28501:
URL: https://github.com/apache/spark/pull/28501#discussion_r425570127



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala
##
@@ -595,8 +601,73 @@ class Analyzer(
   }
 }
 
-// This require transformUp to replace grouping()/grouping_id() in 
resolved Filter/Sort
-def apply(plan: LogicalPlan): LogicalPlan = plan resolveOperatorsUp {
+private def tryResolveHavingCondition(
+a: UnresolvedHaving, havingCond: Expression, agg: LogicalPlan): 
LogicalPlan = {
+  val aggForResolving = agg match {
+// For CUBE/ROLLUP expressions, to avoid resolving repeatedly, here we 
delete them from
+// groupingExpressions for condition resolving.
+case a @ Aggregate(Seq(c @ Cube(groupByExprs)), _, _) =>
+  a.copy(groupingExpressions = groupByExprs)
+case a @ Aggregate(Seq(r @ Rollup(groupByExprs)), _, _) =>
+  a.copy(groupingExpressions = groupByExprs)
+case g: GroupingSets =>
+  Aggregate(
+getFinalGroupByExpressions(g.selectedGroupByExprs, g.groupByExprs),
+g.aggregations, g.child)
+  }
+  // Try resolving the condition of the filter as though it is in the 
aggregate clause
+  val (extraAggExprs, resolvedHavingCond) =
+ResolveAggregateFunctions.resolveFilterCondInAggregate(havingCond, 
aggForResolving)
+
+  // Push the aggregate expressions into the aggregate (if any).
+  if (extraAggExprs.nonEmpty) {
+val newChild = agg match {
+  case Aggregate(Seq(c @ Cube(groupByExprs)), aggregateExpressions, 
child) =>
+constructAggregate(
+  cubeExprs(groupByExprs), groupByExprs, aggregateExpressions ++ 
extraAggExprs, child)
+  case Aggregate(Seq(r @ Rollup(groupByExprs)), aggregateExpressions, 
child) =>
+constructAggregate(
+  rollupExprs(groupByExprs), groupByExprs, aggregateExpressions ++ 
extraAggExprs, child)
+  case x: GroupingSets =>
+constructAggregate(
+  x.selectedGroupByExprs, x.groupByExprs, x.aggregations ++ 
extraAggExprs, x.child)
+}
+
+// Since the exprId of extraAggExprs will be changed in the 
constructed aggregate, and the
+// aggregateExpressions keeps the input order. So here we build an 
exprMap to resolve the
+// condition again.
+val exprMap = extraAggExprs.zip(
+  newChild.asInstanceOf[Aggregate].aggregateExpressions.takeRight(
+extraAggExprs.length)).toMap
+val newCond = resolvedHavingCond.get.transform {
+  case ne: NamedExpression if exprMap.contains(ne) => exprMap(ne)
+}
+Project(newChild.output.dropRight(extraAggExprs.length),
+  Filter(newCond, newChild))
+  } else {
+a
+  }
+}
+
+// This require transformDown to resolve having condition when generating 
aggregate node for
+// CUBE/ROLLUP/GROUPING SETS. This also replace grouping()/grouping_id() 
in resolved
+// Filter/Sort.
+def apply(plan: LogicalPlan): LogicalPlan = plan resolveOperatorsDown {
+  case a @ UnresolvedHaving(
+  havingCondition, agg @ Aggregate(Seq(c @ Cube(groupByExprs)), 
aggregateExpressions, _))
+  if agg.childrenResolved && 
!havingCondition.isInstanceOf[SubqueryExpression]

Review comment:
   Thanks for reminding. Yes, we don't need this. Deleted in 3b48e38.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] xuanyuanking commented on a change in pull request #28501: [SPARK-31663][SQL] Grouping sets with having clause returns the wrong result

2020-05-14 Thread GitBox


xuanyuanking commented on a change in pull request #28501:
URL: https://github.com/apache/spark/pull/28501#discussion_r425569841



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala
##
@@ -595,8 +601,73 @@ class Analyzer(
   }
 }
 
-// This require transformUp to replace grouping()/grouping_id() in 
resolved Filter/Sort
-def apply(plan: LogicalPlan): LogicalPlan = plan resolveOperatorsUp {
+private def tryResolveHavingCondition(
+a: UnresolvedHaving, havingCond: Expression, agg: LogicalPlan): 
LogicalPlan = {

Review comment:
   Thanks, done in 3b48e38.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28501: [SPARK-31663][SQL] Grouping sets with having clause returns the wrong result

2020-05-14 Thread GitBox


SparkQA commented on pull request #28501:
URL: https://github.com/apache/spark/pull/28501#issuecomment-629031174


   **[Test build #122648 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122648/testReport)**
 for PR 28501 at commit 
[`3b48e38`](https://github.com/apache/spark/commit/3b48e38d72e35e24d5f47b0a1d36f27f876455d1).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


SparkQA commented on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629028801


   **[Test build #122647 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122647/testReport)**
 for PR 25870 at commit 
[`a9f055b`](https://github.com/apache/spark/commit/a9f055b4642d487b724a628c4d6f682c71fd2d2c).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


dongjoon-hyun commented on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629027123


   The dependency issue is fixed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #25870: [SPARK-27936][K8S] Support python deps

2020-05-14 Thread GitBox


dongjoon-hyun commented on pull request #25870:
URL: https://github.com/apache/spark/pull/25870#issuecomment-629027156


   Retest this please.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629022492


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/122646/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


SparkQA removed a comment on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629020341


   **[Test build #5006 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/5006/testReport)**
 for PR 28526 at commit 
[`9f14144`](https://github.com/apache/spark/commit/9f14144d191c8c41d8b3bd9585d78eb7ddae8407).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


SparkQA commented on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629022933


   **[Test build #5006 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/5006/testReport)**
 for PR 28526 at commit 
[`9f14144`](https://github.com/apache/spark/commit/9f14144d191c8c41d8b3bd9585d78eb7ddae8407).
* This patch **fails build dependency tests**.
* This patch merges cleanly.
* This patch adds the following public classes _(experimental)_:
 * `class AvroDeserializer(`



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28537: [SPARK-31715][SQL][TEST] Fix flaky SparkSQLEnvSuite that sometimes varies single derby instance standard

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28537:
URL: https://github.com/apache/spark/pull/28537#issuecomment-629022582


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/122645/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28537: [SPARK-31715][SQL][TEST] Fix flaky SparkSQLEnvSuite that sometimes varies single derby instance standard

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28537:
URL: https://github.com/apache/spark/pull/28537#issuecomment-629022578


   Merged build finished. Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


SparkQA removed a comment on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629019973


   **[Test build #122646 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122646/testReport)**
 for PR 28526 at commit 
[`9f14144`](https://github.com/apache/spark/commit/9f14144d191c8c41d8b3bd9585d78eb7ddae8407).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #28537: [SPARK-31715][SQL][TEST] Fix flaky SparkSQLEnvSuite that sometimes varies single derby instance standard

2020-05-14 Thread GitBox


SparkQA removed a comment on pull request #28537:
URL: https://github.com/apache/spark/pull/28537#issuecomment-629012577


   **[Test build #122645 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122645/testReport)**
 for PR 28537 at commit 
[`31d032e`](https://github.com/apache/spark/commit/31d032e8a8525a1527c666b9064d872e1f087460).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629022488


   Merged build finished. Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


SparkQA commented on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629022481


   **[Test build #122646 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122646/testReport)**
 for PR 28526 at commit 
[`9f14144`](https://github.com/apache/spark/commit/9f14144d191c8c41d8b3bd9585d78eb7ddae8407).
* This patch **fails build dependency tests**.
* This patch merges cleanly.
* This patch adds the following public classes _(experimental)_:
 * `class AvroDeserializer(`



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629022488







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28537: [SPARK-31715][SQL][TEST] Fix flaky SparkSQLEnvSuite that sometimes varies single derby instance standard

2020-05-14 Thread GitBox


SparkQA commented on pull request #28537:
URL: https://github.com/apache/spark/pull/28537#issuecomment-629022542


   **[Test build #122645 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122645/testReport)**
 for PR 28537 at commit 
[`31d032e`](https://github.com/apache/spark/commit/31d032e8a8525a1527c666b9064d872e1f087460).
* This patch **fails Spark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28537: [SPARK-31715][SQL][TEST] Fix flaky SparkSQLEnvSuite that sometimes varies single derby instance standard

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #28537:
URL: https://github.com/apache/spark/pull/28537#issuecomment-629022578







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon edited a comment on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


HyukjinKwon edited a comment on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629021118


   FYI, the issue seems not completely fixed yet due to the cached 
`.m2/repository`, see also 
https://issues.apache.org/jira/browse/SPARK-31693?focusedCommentId=17107856=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17107856



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


HyukjinKwon commented on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629021118


   FYI,tThis seems not completely fixed yet due to the cached `.m2/repository`, 
see also 
https://issues.apache.org/jira/browse/SPARK-31693?focusedCommentId=17107856=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17107856



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HeartSaVioR commented on a change in pull request #28517: [SPARK-31707][SQL] Revert SPARK-30098 Use default datasource as provider for CREATE TABLE syntax

2020-05-14 Thread GitBox


HeartSaVioR commented on a change in pull request #28517:
URL: https://github.com/apache/spark/pull/28517#discussion_r425561029



##
File path: 
sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/parser/DDLParserSuite.scala
##
@@ -2195,22 +2195,4 @@ class DDLParserSuite extends AnalysisTest {
   parsePlan("COMMENT ON TABLE a.b.c IS 'xYz'"),
   CommentOnTable(UnresolvedTable(Seq("a", "b", "c")), "xYz"))
   }
-
-  test("create table - without using") {

Review comment:
   Yeah I see. That's one of valid approaches and good idea. Maybe then we 
need to remove some tests which depend on the config (two tests for on and off).





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


SparkQA commented on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629020394







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


SparkQA commented on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629020341


   **[Test build #5006 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/5006/testReport)**
 for PR 28526 at commit 
[`9f14144`](https://github.com/apache/spark/commit/9f14144d191c8c41d8b3bd9585d78eb7ddae8407).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629020204







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629020204







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


SparkQA commented on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629019973


   **[Test build #122646 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122646/testReport)**
 for PR 28526 at commit 
[`9f14144`](https://github.com/apache/spark/commit/9f14144d191c8c41d8b3bd9585d78eb7ddae8407).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HeartSaVioR commented on a change in pull request #28517: [SPARK-31707][SQL] Revert SPARK-30098 Use default datasource as provider for CREATE TABLE syntax

2020-05-14 Thread GitBox


HeartSaVioR commented on a change in pull request #28517:
URL: https://github.com/apache/spark/pull/28517#discussion_r425560435



##
File path: docs/sql-ref-syntax-ddl-create-table-datasource.md
##
@@ -40,7 +40,7 @@ CREATE TABLE [ IF NOT EXISTS ] table_identifier
 [ AS select_statement ]
 ```
 
-Note that, the clauses between the USING clause and the AS SELECT clause can 
come in
+Note that, the clauses between the OPTIONS clause and the AS SELECT clause can 
come in

Review comment:
   Ah OK my bad. I found USING shouldn't com in any order so previous 
statement was correct. Sorry about that.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #28526: [SPARK-31405][SQL][3.0] Fail by default when reading/writing legacy datetime values from/to Parquet/Avro files

2020-05-14 Thread GitBox


cloud-fan commented on pull request #28526:
URL: https://github.com/apache/spark/pull/28526#issuecomment-629019709


   retest this please



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #28501: [SPARK-31663][SQL] Grouping sets with having clause returns the wrong result

2020-05-14 Thread GitBox


cloud-fan commented on pull request #28501:
URL: https://github.com/apache/spark/pull/28501#issuecomment-629019390


   It's a long-standing bug, so it doesn't block 2.4.6. I'll see if I can merge 
it today.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #28517: [SPARK-31707][SQL] Revert SPARK-30098 Use default datasource as provider for CREATE TABLE syntax

2020-05-14 Thread GitBox


cloud-fan commented on a change in pull request #28517:
URL: https://github.com/apache/spark/pull/28517#discussion_r425559542



##
File path: 
sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/parser/DDLParserSuite.scala
##
@@ -2195,22 +2195,4 @@ class DDLParserSuite extends AnalysisTest {
   parsePlan("COMMENT ON TABLE a.b.c IS 'xYz'"),
   CommentOnTable(UnresolvedTable(Seq("a", "b", "c")), "xYz"))
   }
-
-  test("create table - without using") {

Review comment:
   I think he meant to remove 
`withSQLConf(SQLConf.LEGACY_CREATE_HIVE_TABLE_BY_DEFAULT_ENABLED.key -> 
"false")`. We don't need the config anyway, if we unify the syntax.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #28517: [SPARK-31707][SQL] Revert SPARK-30098 Use default datasource as provider for CREATE TABLE syntax

2020-05-14 Thread GitBox


cloud-fan commented on a change in pull request #28517:
URL: https://github.com/apache/spark/pull/28517#discussion_r425559216



##
File path: docs/sql-ref-syntax-ddl-create-table-datasource.md
##
@@ -40,7 +40,7 @@ CREATE TABLE [ IF NOT EXISTS ] table_identifier
 [ AS select_statement ]
 ```
 
-Note that, the clauses between the USING clause and the AS SELECT clause can 
come in
+Note that, the clauses between the OPTIONS clause and the AS SELECT clause can 
come in

Review comment:
   > between the USING clause and the AS SELECT clause
   
   USING and AS SELECT are both not included, so this is still corrected.

##
File path: docs/sql-ref-syntax-ddl-create-table-datasource.md
##
@@ -40,7 +40,7 @@ CREATE TABLE [ IF NOT EXISTS ] table_identifier
 [ AS select_statement ]
 ```
 
-Note that, the clauses between the USING clause and the AS SELECT clause can 
come in
+Note that, the clauses between the OPTIONS clause and the AS SELECT clause can 
come in

Review comment:
   > between the USING clause and the AS SELECT clause
   
   USING and AS SELECT are both not included, so this is still corrected. We 
shouldn't change it.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HeartSaVioR commented on a change in pull request #28517: [SPARK-31707][SQL] Revert SPARK-30098 Use default datasource as provider for CREATE TABLE syntax

2020-05-14 Thread GitBox


HeartSaVioR commented on a change in pull request #28517:
URL: https://github.com/apache/spark/pull/28517#discussion_r425558929



##
File path: docs/sql-ref-syntax-ddl-create-table-datasource.md
##
@@ -40,7 +40,7 @@ CREATE TABLE [ IF NOT EXISTS ] table_identifier
 [ AS select_statement ]
 ```
 
-Note that, the clauses between the USING clause and the AS SELECT clause can 
come in
+Note that, the clauses between the OPTIONS clause and the AS SELECT clause can 
come in

Review comment:
   between is more likely be inclusive, otherwise previous statement was 
also wrong.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan closed pull request #28531: [SPARK-31712][SQL][TESTS] Check casting timestamps before the epoch to Byte/Short/Int/Long types

2020-05-14 Thread GitBox


cloud-fan closed pull request #28531:
URL: https://github.com/apache/spark/pull/28531


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HeartSaVioR commented on a change in pull request #28517: [SPARK-31707][SQL] Revert SPARK-30098 Use default datasource as provider for CREATE TABLE syntax

2020-05-14 Thread GitBox


HeartSaVioR commented on a change in pull request #28517:
URL: https://github.com/apache/spark/pull/28517#discussion_r425558612



##
File path: 
sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/parser/DDLParserSuite.scala
##
@@ -2195,22 +2195,4 @@ class DDLParserSuite extends AnalysisTest {
   parsePlan("COMMENT ON TABLE a.b.c IS 'xYz'"),
   CommentOnTable(UnresolvedTable(Seq("a", "b", "c")), "xYz"))
   }
-
-  test("create table - without using") {

Review comment:
   Sorry but could you please elaborate? Removing config is what this patch 
proposed, and it's a part of revert.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #28531: [SPARK-31712][SQL][TESTS] Check casting timestamps before the epoch to Byte/Short/Int/Long types

2020-05-14 Thread GitBox


cloud-fan commented on pull request #28531:
URL: https://github.com/apache/spark/pull/28531#issuecomment-629018056


   thanks, merging to master/3.0!



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #28535: add jvm opt -Xms to reduce FGC

2020-05-14 Thread GitBox


HyukjinKwon commented on pull request #28535:
URL: https://github.com/apache/spark/pull/28535#issuecomment-629014875


   @oikomi, please keep the PR description and file a JIRA. Also, I think you 
can specify Xms via `spark.driver.extraJavaOptions` and 
`spark.executor.extraJavaOptions` already. Let's don't assume all users want to 
set the minimum memory.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28537: [SPARK-31715][SQL][TEST] Fix flaky SparkSQLEnvSuite that sometimes varies single derby instance standard

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #28537:
URL: https://github.com/apache/spark/pull/28537#issuecomment-629012794







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28537: [SPARK-31715][SQL][TEST] Fix flaky SparkSQLEnvSuite that sometimes varies single derby instance standard

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28537:
URL: https://github.com/apache/spark/pull/28537#issuecomment-629012794







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28537: [SPARK-31715][SQL][TEST] Fix flaky SparkSQLEnvSuite that sometimes varies single derby instance standard

2020-05-14 Thread GitBox


SparkQA commented on pull request #28537:
URL: https://github.com/apache/spark/pull/28537#issuecomment-629012577


   **[Test build #122645 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122645/testReport)**
 for PR 28537 at commit 
[`31d032e`](https://github.com/apache/spark/commit/31d032e8a8525a1527c666b9064d872e1f087460).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] yaooqinn opened a new pull request #28537: [SPARK-31715][SQL][TEST] Fix flaky SparkSQLEnvSuite that sometimes varies single derby instance standard

2020-05-14 Thread GitBox


yaooqinn opened a new pull request #28537:
URL: https://github.com/apache/spark/pull/28537


   ### What changes were proposed in this pull request?
   
   
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122622/testReport/junit/org.apache.spark.sql.hive.thriftserver/SparkSQLEnvSuite/SPARK_29604_external_listeners_should_be_initialized_with_Spark_classloader/history/?start=25
   
   According to the test report history of SparkSQLEnvSuite,this test fails 
frequently which is caused by single derby restriction.
   
   ```java
   Caused by: sbt.ForkMain$ForkError: 
org.apache.derby.iapi.error.StandardException: Another instance of Derby may 
have already booted the database 
/home/jenkins/workspace/SparkPullRequestBuilder/sql/hive-thriftserver/metastore_db.
at org.apache.derby.iapi.error.StandardException.newException(Unknown 
Source)
at 
org.apache.derby.impl.store.raw.data.BaseDataFileFactory.privGetJBMSLockOnDB(Unknown
 Source)
at org.apache.derby.impl.store.raw.data.BaseDataFileFactory.run(Unknown 
Source)
at java.security.AccessController.doPrivileged(Native Method)
at 
org.apache.derby.impl.store.raw.data.BaseDataFileFactory.getJBMSLockOnDB(Unknown
 Source)
at 
org.apache.derby.impl.store.raw.data.BaseDataFileFactory.boot(Unknown Source)
at org.apache.derby.impl.services.monitor.BaseMonitor.boot(Unknown 
Source)
at org.apache.derby.impl.services.monitor.TopService.bootModule(Unknown 
Source)
at 
org.apache.derby.impl.services.monitor.BaseMonitor.startModule(Unknown Source)
at 
org.apache.derby.iapi.services.monitor.Monitor.bootServiceModule(Unknown Source)
at org.apache.derby.impl.store.raw.RawStore.boot(Unknown Source)
at org.apache.derby.impl.services.monitor.BaseMonitor.boot(Unknown 
Source)
at org.apache.derby.impl.services.monitor.TopService.bootModule(Unknown 
Source)
at 
org.apache.derby.impl.services.monitor.BaseMonitor.startModule(Unknown Source)
at 
org.apache.derby.iapi.services.monitor.Monitor.bootServiceModule(Unknown Source)
at org.apache.derby.impl.store.access.RAMAccessManager.boot(Unknown 
Source)
at org.apache.derby.impl.services.monitor.BaseMonitor.boot(Unknown 
Source)
at org.apache.derby.impl.services.monitor.TopService.bootModule(Unknown 
Source)
at 
org.apache.derby.impl.services.monitor.BaseMonitor.startModule(Unknown Source)
at 
org.apache.derby.iapi.services.monitor.Monitor.bootServiceModule(Unknown Source)
at org.apache.derby.impl.db.BasicDatabase.bootStore(Unknown Source)
at org.apache.derby.impl.db.BasicDatabase.boot(Unknown Source)
at org.apache.derby.impl.services.monitor.BaseMonitor.boot(Unknown 
Source)
at org.apache.derby.impl.services.monitor.TopService.bootModule(Unknown 
Source)
at 
org.apache.derby.impl.services.monitor.BaseMonitor.bootService(Unknown Source)
at 
org.apache.derby.impl.services.monitor.BaseMonitor.startProviderService(Unknown 
Source)
at 
org.apache.derby.impl.services.monitor.BaseMonitor.findProviderAndStartService(Unknown
 Source)
at 
org.apache.derby.impl.services.monitor.BaseMonitor.startPersistentService(Unknown
 Source)
at 
org.apache.derby.iapi.services.monitor.Monitor.startPersistentService(Unknown 
Source)
... 138 more
   ```
   
   This PR adds a separate directory to locate the metastore_db for this test 
which runs in a dedicated JVM.
   
   Besides, diable the UI for the potential race on `spark.ui.port` which may 
also let the test case become flaky.
   
   ### Why are the changes needed?
   test fix
   
   
   ### Does this PR introduce _any_ user-facing change?
   NO
   
   
   ### How was this patch tested?
   SparkSQLEnvSuite itself.
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28536: [WIP][SPARK-XXX][SQL] Use a fallback version in HiveExternalCatalogVersionsSuite

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28536:
URL: https://github.com/apache/spark/pull/28536#issuecomment-629009346


   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/27306/
   Test PASSed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28536: [WIP][SPARK-XXX][SQL] Use a fallback version in HiveExternalCatalogVersionsSuite

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #28536:
URL: https://github.com/apache/spark/pull/28536#issuecomment-629009343







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28536: [WIP][SPARK-XXX][SQL] Use a fallback version in HiveExternalCatalogVersionsSuite

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28536:
URL: https://github.com/apache/spark/pull/28536#issuecomment-629009343


   Merged build finished. Test PASSed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28536: [WIP][SPARK-XXX][SQL] Use a fallback version in HiveExternalCatalogVersionsSuite

2020-05-14 Thread GitBox


SparkQA commented on pull request #28536:
URL: https://github.com/apache/spark/pull/28536#issuecomment-629009075


   **[Test build #122644 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122644/testReport)**
 for PR 28536 at commit 
[`5505ab1`](https://github.com/apache/spark/commit/5505ab1c0d7890750f4a5bd4888befb4154450d6).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] yaooqinn commented on pull request #28222: SPARK-31447 Fix issue in ExtractIntervalPart expression

2020-05-14 Thread GitBox


yaooqinn commented on pull request #28222:
URL: https://github.com/apache/spark/pull/28222#issuecomment-629008491


   Hi @sathyaprakashg 
   This could be an API change and may cause performance regression for this 
operator. IMHO, we may need much stronger evidence to make the change, e.g. SQL 
standard, behaviors for `timestamp - timestamp` from other modern DBMS systems 
as much as possible.  Otherwise, returning only micros part may be 
spark-specific but efficient and causes no ambiguity. 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28536: [WIP][SPARK-XXX][SQL] Use a fallback version in HiveExternalCatalogVersionsSuite

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28536:
URL: https://github.com/apache/spark/pull/28536#issuecomment-629007660







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28531: [SPARK-31712][SQL][TESTS] Check casting timestamps before the epoch to Byte/Short/Int/Long types

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28531:
URL: https://github.com/apache/spark/pull/28531#issuecomment-629007220







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28536: [WIP][SPARK-XXX][SQL] Use a fallback version in HiveExternalCatalogVersionsSuite

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #28536:
URL: https://github.com/apache/spark/pull/28536#issuecomment-629007660







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28531: [SPARK-31712][SQL][TESTS] Check casting timestamps before the epoch to Byte/Short/Int/Long types

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #28531:
URL: https://github.com/apache/spark/pull/28531#issuecomment-629007220







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28536: [WIP][SPARK-XXX][SQL] Use a fallback version in HiveExternalCatalogVersionsSuite

2020-05-14 Thread GitBox


SparkQA commented on pull request #28536:
URL: https://github.com/apache/spark/pull/28536#issuecomment-629007295


   **[Test build #122643 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122643/testReport)**
 for PR 28536 at commit 
[`6d44e9e`](https://github.com/apache/spark/commit/6d44e9e1e216a9135ab3e823358cc7d05788f33c).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun opened a new pull request #28536: [WIP][SPARK-XXX][SQL] Use a fallback version in HiveExternalCatalogVersionsSuite

2020-05-14 Thread GitBox


dongjoon-hyun opened a new pull request #28536:
URL: https://github.com/apache/spark/pull/28536


   …
   
   
   
   ### What changes were proposed in this pull request?
   
   
   
   ### Why are the changes needed?
   
   
   
   ### Does this PR introduce _any_ user-facing change?
   
   
   
   ### How was this patch tested?
   
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #28531: [SPARK-31712][SQL][TESTS] Check casting timestamps before the epoch to Byte/Short/Int/Long types

2020-05-14 Thread GitBox


SparkQA removed a comment on pull request #28531:
URL: https://github.com/apache/spark/pull/28531#issuecomment-628901532


   **[Test build #122634 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122634/testReport)**
 for PR 28531 at commit 
[`d6f1ec6`](https://github.com/apache/spark/commit/d6f1ec6ee9429abac6d52b0426706fba627fbaa9).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28531: [SPARK-31712][SQL][TESTS] Check casting timestamps before the epoch to Byte/Short/Int/Long types

2020-05-14 Thread GitBox


SparkQA commented on pull request #28531:
URL: https://github.com/apache/spark/pull/28531#issuecomment-629006662


   **[Test build #122634 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122634/testReport)**
 for PR 28531 at commit 
[`d6f1ec6`](https://github.com/apache/spark/commit/d6f1ec6ee9429abac6d52b0426706fba627fbaa9).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Ngone51 commented on a change in pull request #28517: [SPARK-31707][SQL] Revert SPARK-30098 Use default datasource as provider for CREATE TABLE syntax

2020-05-14 Thread GitBox


Ngone51 commented on a change in pull request #28517:
URL: https://github.com/apache/spark/pull/28517#discussion_r425546566



##
File path: docs/sql-ref-syntax-ddl-create-table-datasource.md
##
@@ -40,7 +40,7 @@ CREATE TABLE [ IF NOT EXISTS ] table_identifier
 [ AS select_statement ]
 ```
 
-Note that, the clauses between the USING clause and the AS SELECT clause can 
come in
+Note that, the clauses between the OPTIONS clause and the AS SELECT clause can 
come in

Review comment:
   Should still be `USING` since `OPTIONS` itself can also come in any 
order?

##
File path: 
sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/parser/DDLParserSuite.scala
##
@@ -2195,22 +2195,4 @@ class DDLParserSuite extends AnalysisTest {
   parsePlan("COMMENT ON TABLE a.b.c IS 'xYz'"),
   CommentOnTable(UnresolvedTable(Seq("a", "b", "c")), "xYz"))
   }
-
-  test("create table - without using") {

Review comment:
   So, just remove the conf after revert?





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] xuanyuanking commented on pull request #28501: [SPARK-31663][SQL] Grouping sets with having clause returns the wrong result

2020-05-14 Thread GitBox


xuanyuanking commented on pull request #28501:
URL: https://github.com/apache/spark/pull/28501#issuecomment-629004304


   @holdenk Thanks for notifying, I'll address all the comments today. Yep, if 
it can be merged before cutting 2.4.6, let's put it in 2.4.7.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28496: [SPARK-31620][SQL] Fix reference binding failure in case of an final agg contains subquery

2020-05-14 Thread GitBox


AmplabJenkins commented on pull request #28496:
URL: https://github.com/apache/spark/pull/28496#issuecomment-629003873







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28496: [SPARK-31620][SQL] Fix reference binding failure in case of an final agg contains subquery

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28496:
URL: https://github.com/apache/spark/pull/28496#issuecomment-629003873







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28496: [SPARK-31620][SQL] Fix reference binding failure in case of an final agg contains subquery

2020-05-14 Thread GitBox


SparkQA commented on pull request #28496:
URL: https://github.com/apache/spark/pull/28496#issuecomment-629003544


   **[Test build #122642 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122642/testReport)**
 for PR 28496 at commit 
[`493157a`](https://github.com/apache/spark/commit/493157a3b97616d221ec2b5ddf1a21cdf9a1a3f4).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28496: [SPARK-31620][SQL] Fix reference binding failure in case of an final agg contains subquery

2020-05-14 Thread GitBox


AmplabJenkins removed a comment on pull request #28496:
URL: https://github.com/apache/spark/pull/28496#issuecomment-628738076


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/122619/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Ngone51 commented on pull request #28496: [SPARK-31620][SQL] Fix reference binding failure in case of an final agg contains subquery

2020-05-14 Thread GitBox


Ngone51 commented on pull request #28496:
URL: https://github.com/apache/spark/pull/28496#issuecomment-629001780


   retest this please



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HeartSaVioR commented on a change in pull request #28517: [SPARK-31707][SQL] Revert SPARK-30098 Use default datasource as provider for CREATE TABLE syntax

2020-05-14 Thread GitBox


HeartSaVioR commented on a change in pull request #28517:
URL: https://github.com/apache/spark/pull/28517#discussion_r425540192



##
File path: 
sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/parser/DDLParserSuite.scala
##
@@ -2195,22 +2195,4 @@ class DDLParserSuite extends AnalysisTest {
   parsePlan("COMMENT ON TABLE a.b.c IS 'xYz'"),
   CommentOnTable(UnresolvedTable(Seq("a", "b", "c")), "xYz"))
   }
-
-  test("create table - without using") {

Review comment:
   It cannot be simply changed to "ignore" instead of "test" as we're 
removing the config. I'll comment out instead.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28525: [SPARK-27562][Shuffle] Complete the verification mechanism for shuffle transmitted data

2020-05-14 Thread GitBox


SparkQA commented on pull request #28525:
URL: https://github.com/apache/spark/pull/28525#issuecomment-628995389


   **[Test build #122641 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/122641/testReport)**
 for PR 28525 at commit 
[`bb15a4d`](https://github.com/apache/spark/commit/bb15a4dab44fdba03cf8fcab06c591f16ca5c4e6).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



  1   2   3   4   5   6   >