[GitHub] [spark] uncleGen commented on a change in pull request #28391: [SPARK-31593][SS] Remove unnecessary streaming query progress update

2020-06-03 Thread GitBox
uncleGen commented on a change in pull request #28391: URL: https://github.com/apache/spark/pull/28391#discussion_r434380758 ## File path: sql/core/src/test/scala/org/apache/spark/sql/streaming/StreamingQueryListenerSuite.scala ## @@ -433,9 +433,13 @@ class

[GitHub] [spark] maropu commented on a change in pull request #28686: [SPARK-31877][SQL]Avoid stats computation for Hive table

2020-06-03 Thread GitBox
maropu commented on a change in pull request #28686: URL: https://github.com/apache/spark/pull/28686#discussion_r434381576 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/interface.scala ## @@ -654,7 +654,8 @@ case class HiveTableRelation(

[GitHub] [spark] cloud-fan opened a new pull request #28717: [SPARK-29947][SQL][followup] ResolveRelations should return relations with fresh attribute IDs

2020-06-03 Thread GitBox
cloud-fan opened a new pull request #28717: URL: https://github.com/apache/spark/pull/28717 ### What changes were proposed in this pull request? This is a followup of https://github.com/apache/spark/pull/26589, which caches the table relations to speed up the table lookup.

[GitHub] [spark] uncleGen commented on a change in pull request #28391: [SPARK-31593][SS] Remove unnecessary streaming query progress update

2020-06-03 Thread GitBox
uncleGen commented on a change in pull request #28391: URL: https://github.com/apache/spark/pull/28391#discussion_r434380758 ## File path: sql/core/src/test/scala/org/apache/spark/sql/streaming/StreamingQueryListenerSuite.scala ## @@ -433,9 +433,13 @@ class

[GitHub] [spark] cloud-fan commented on a change in pull request #28575: [SPARK-31705][SQL] Push predicate through join by rewriting join condition to conjunctive normal form

2020-06-03 Thread GitBox
cloud-fan commented on a change in pull request #28575: URL: https://github.com/apache/spark/pull/28575#discussion_r434399611 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala ## @@ -1372,6 +1376,108 @@ object

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28704: [SPARK-31777][ML][PySpark] Add user-specified fold column to CrossValidator

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28704: URL: https://github.com/apache/spark/pull/28704#issuecomment-638068044 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #28704: [SPARK-31777][ML][PySpark] Add user-specified fold column to CrossValidator

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28704: URL: https://github.com/apache/spark/pull/28704#issuecomment-638068044 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] wangyum commented on a change in pull request #28575: [SPARK-31705][SQL] Push predicate through join by rewriting join condition to conjunctive normal form

2020-06-03 Thread GitBox
wangyum commented on a change in pull request #28575: URL: https://github.com/apache/spark/pull/28575#discussion_r434458943 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala ## @@ -1372,6 +1376,108 @@ object

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28713: [SPARK-31896][SQL] Handle am-pm timestamp parsing when hour is missing

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28713: URL: https://github.com/apache/spark/pull/28713#issuecomment-638105470 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA commented on pull request #28575: [SPARK-31705][SQL] Push predicate through join by rewriting join condition to conjunctive normal form

2020-06-03 Thread GitBox
SparkQA commented on pull request #28575: URL: https://github.com/apache/spark/pull/28575#issuecomment-638114251 **[Test build #123483 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123483/testReport)** for PR 28575 at commit

[GitHub] [spark] juliuszsompolski commented on a change in pull request #28705: [SPARK-30808][SQL] Enable Java 8 time API in Thrift server

2020-06-03 Thread GitBox
juliuszsompolski commented on a change in pull request #28705: URL: https://github.com/apache/spark/pull/28705#discussion_r434480076 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/HiveResult.scala ## @@ -37,30 +37,45 @@ object HiveResult { * Returns

[GitHub] [spark] rednaxelafx commented on a change in pull request #28707: [SPARK-31894][SS] Introduce UnsafeRow format validation for streaming state store

2020-06-03 Thread GitBox
rednaxelafx commented on a change in pull request #28707: URL: https://github.com/apache/spark/pull/28707#discussion_r434457124 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StreamingAggregationStateManager.scala ## @@ -94,6 +110,28 @@

[GitHub] [spark] cloud-fan edited a comment on pull request #28692: [SPARK-31879][SQL] Using GB as default Locale for datetime formatters

2020-06-03 Thread GitBox
cloud-fan edited a comment on pull request #28692: URL: https://github.com/apache/spark/pull/28692#issuecomment-638147190 > please also create a PR to explain what gets changed in the meaning of u in the migration guide. @bart-samwel maybe you were confused by this. The meaning was

[GitHub] [spark] cloud-fan commented on pull request #28687: [SPARK-31878][SQL] Create date formatter only once in `HiveResult`

2020-06-03 Thread GitBox
cloud-fan commented on pull request #28687: URL: https://github.com/apache/spark/pull/28687#issuecomment-638160456 fixed. This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] maropu commented on a change in pull request #28490: [SPARK-31670][SQL]Struct Field in groupByExpr with CUBE

2020-06-03 Thread GitBox
maropu commented on a change in pull request #28490: URL: https://github.com/apache/spark/pull/28490#discussion_r434366090 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala ## @@ -506,31 +506,55 @@ class Analyzer(

[GitHub] [spark] maropu commented on a change in pull request #28490: [SPARK-31670][SQL]Struct Field in groupByExpr with CUBE

2020-06-03 Thread GitBox
maropu commented on a change in pull request #28490: URL: https://github.com/apache/spark/pull/28490#discussion_r434366334 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala ## @@ -506,31 +506,55 @@ class Analyzer(

[GitHub] [spark] gengliangwang closed pull request #28694: [SPARK-31886][WEBUI] Fix the wrong coloring of nodes in DAG-viz

2020-06-03 Thread GitBox
gengliangwang closed pull request #28694: URL: https://github.com/apache/spark/pull/28694 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28704: [SPARK-31777][ML][PySpark] Add user-specified fold column to CrossValidator

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28704: URL: https://github.com/apache/spark/pull/28704#issuecomment-638068063 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28713: [SPARK-31896][SQL] Handle am-pm timestamp parsing when hour is missing

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28713: URL: https://github.com/apache/spark/pull/28713#issuecomment-638091920 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28713: [SPARK-31896][SQL] Handle am-pm timestamp parsing when hour is missing

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28713: URL: https://github.com/apache/spark/pull/28713#issuecomment-638091912 Build finished. Test FAILed. This is an automated message from the Apache Git Service. To respond

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28713: [SPARK-31896][SQL] Handle am-pm timestamp parsing when hour is missing

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28713: URL: https://github.com/apache/spark/pull/28713#issuecomment-638098118 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] juliuszsompolski commented on a change in pull request #28705: [SPARK-30808][SQL] Enable Java 8 time API in Thrift server

2020-06-03 Thread GitBox
juliuszsompolski commented on a change in pull request #28705: URL: https://github.com/apache/spark/pull/28705#discussion_r434508113 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/HiveResult.scala ## @@ -37,30 +37,45 @@ object HiveResult { * Returns

[GitHub] [spark] maropu commented on a change in pull request #28490: [SPARK-31670][SQL]Struct Field in groupByExpr with CUBE

2020-06-03 Thread GitBox
maropu commented on a change in pull request #28490: URL: https://github.com/apache/spark/pull/28490#discussion_r434369978 ## File path: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala ## @@ -3495,6 +3495,59 @@ class SQLQuerySuite extends QueryTest with

[GitHub] [spark] maropu commented on a change in pull request #28490: [SPARK-31670][SQL]Struct Field in groupByExpr with CUBE

2020-06-03 Thread GitBox
maropu commented on a change in pull request #28490: URL: https://github.com/apache/spark/pull/28490#discussion_r434366090 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala ## @@ -506,31 +506,55 @@ class Analyzer(

[GitHub] [spark] gengliangwang commented on pull request #28694: [SPARK-31886][WEBUI] Fix the wrong coloring of nodes in DAG-viz

2020-06-03 Thread GitBox
gengliangwang commented on pull request #28694: URL: https://github.com/apache/spark/pull/28694#issuecomment-638038924 Thanks, merging to master/3.0 This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] xuanyuanking commented on a change in pull request #28707: [SPARK-31894][SS] Introduce UnsafeRow format validation for streaming state store

2020-06-03 Thread GitBox
xuanyuanking commented on a change in pull request #28707: URL: https://github.com/apache/spark/pull/28707#discussion_r434420491 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala ## @@ -1545,6 +1545,15 @@ object SQLConf {

[GitHub] [spark] SparkQA commented on pull request #28704: [SPARK-31777][ML][PySpark] Add user-specified fold column to CrossValidator

2020-06-03 Thread GitBox
SparkQA commented on pull request #28704: URL: https://github.com/apache/spark/pull/28704#issuecomment-638067129 **[Test build #123470 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123470/testReport)** for PR 28704 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #28704: [SPARK-31777][ML][PySpark] Add user-specified fold column to CrossValidator

2020-06-03 Thread GitBox
SparkQA removed a comment on pull request #28704: URL: https://github.com/apache/spark/pull/28704#issuecomment-638021723 **[Test build #123470 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123470/testReport)** for PR 28704 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28718: [SPARK-31892][SQL][FOLLOWUP] Improve test coverage for valid pattern …

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28718: URL: https://github.com/apache/spark/pull/28718#issuecomment-638077191 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28645: [SPARK-31826][SQL] Support composed type of case class for typed Scala UDF

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28645: URL: https://github.com/apache/spark/pull/28645#issuecomment-638077216 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28707: [SPARK-31894][SS] Introduce UnsafeRow format validation for streaming state store

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28707: URL: https://github.com/apache/spark/pull/28707#issuecomment-638077280 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #27685: [SPARK-30940][SQL] Remove attributeId in auto-generated arguments when Explain SQL query

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #27685: URL: https://github.com/apache/spark/pull/27685#issuecomment-638077321 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27685: [SPARK-30940][SQL] Remove attributeId in auto-generated arguments when Explain SQL query

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #27685: URL: https://github.com/apache/spark/pull/27685#issuecomment-638077321 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28707: [SPARK-31894][SS] Introduce UnsafeRow format validation for streaming state store

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28707: URL: https://github.com/apache/spark/pull/28707#issuecomment-638077280 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28645: [SPARK-31826][SQL] Support composed type of case class for typed Scala UDF

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28645: URL: https://github.com/apache/spark/pull/28645#issuecomment-638077216 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #28718: [SPARK-31892][SQL][FOLLOWUP] Improve test coverage for valid pattern …

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28718: URL: https://github.com/apache/spark/pull/28718#issuecomment-638077191 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #28716: [SPARK-31881][K8S][TESTS][FOLLOWUP] Activate hadoop-2.7 by default in K8S IT

2020-06-03 Thread GitBox
SparkQA removed a comment on pull request #28716: URL: https://github.com/apache/spark/pull/28716#issuecomment-638002012 **[Test build #123466 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123466/testReport)** for PR 28716 at commit

[GitHub] [spark] SparkQA commented on pull request #28718: [SPARK-31892][SQL][FOLLOWUP] Improve test coverage for valid pattern …

2020-06-03 Thread GitBox
SparkQA commented on pull request #28718: URL: https://github.com/apache/spark/pull/28718#issuecomment-638100797 **[Test build #123482 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123482/testReport)** for PR 28718 at commit

[GitHub] [spark] SparkQA commented on pull request #28716: [SPARK-31881][K8S][TESTS][FOLLOWUP] Activate hadoop-2.7 by default in K8S IT

2020-06-03 Thread GitBox
SparkQA commented on pull request #28716: URL: https://github.com/apache/spark/pull/28716#issuecomment-638100839 **[Test build #123466 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123466/testReport)** for PR 28716 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28718: [SPARK-31892][SQL][FOLLOWUP] Improve test coverage for valid pattern …

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28718: URL: https://github.com/apache/spark/pull/28718#issuecomment-638101322 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] yaooqinn commented on a change in pull request #28718: [SPARK-31892][SQL][FOLLOWUP] Improve test coverage for valid pattern …

2020-06-03 Thread GitBox
yaooqinn commented on a change in pull request #28718: URL: https://github.com/apache/spark/pull/28718#discussion_r434468970 ## File path: sql/core/src/test/resources/sql-tests/results/datetime.sql.out ## @@ -1013,3 +1013,110 @@ select date_format('2020-01-01', '-MM-dd

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28575: [SPARK-31705][SQL] Push predicate through join by rewriting join condition to conjunctive normal form

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28575: URL: https://github.com/apache/spark/pull/28575#issuecomment-638114794 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28575: [SPARK-31705][SQL] Push predicate through join by rewriting join condition to conjunctive normal form

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28575: URL: https://github.com/apache/spark/pull/28575#issuecomment-638114794 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] MaxGekk commented on pull request #28709: [WIP][SQL] Make legacy date formatters independent from the default JVM time zone

2020-06-03 Thread GitBox
MaxGekk commented on pull request #28709: URL: https://github.com/apache/spark/pull/28709#issuecomment-638123922 @cloud-fan @HyukjinKwon The changes can lead to inconsistent behaviour when JVM and session time zones are different: ```scala $ export TZ="Europe/Moscow" $

[GitHub] [spark] MaxGekk commented on a change in pull request #28705: [SPARK-30808][SQL] Enable Java 8 time API in Thrift server

2020-06-03 Thread GitBox
MaxGekk commented on a change in pull request #28705: URL: https://github.com/apache/spark/pull/28705#discussion_r434513532 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/HiveResult.scala ## @@ -37,30 +37,45 @@ object HiveResult { * Returns the

[GitHub] [spark] cloud-fan commented on pull request #28687: [SPARK-31878][SQL] Create date formatter only once in `HiveResult`

2020-06-03 Thread GitBox
cloud-fan commented on pull request #28687: URL: https://github.com/apache/spark/pull/28687#issuecomment-638150723 Spark 2.4 also creates date formatter only once, we need to backport it to fix perf regression. Since the last commit just updates the comment, we don't need to wait

[GitHub] [spark] SparkQA commented on pull request #28687: [SPARK-31878][SQL] Create date formatter only once in `HiveResult`

2020-06-03 Thread GitBox
SparkQA commented on pull request #28687: URL: https://github.com/apache/spark/pull/28687#issuecomment-638150770 **[Test build #123484 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123484/testReport)** for PR 28687 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28687: [SPARK-31878][SQL] Create date formatter only once in `HiveResult`

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28687: URL: https://github.com/apache/spark/pull/28687#issuecomment-638150807 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #28687: [SPARK-31878][SQL] Create date formatter only once in `HiveResult`

2020-06-03 Thread GitBox
SparkQA removed a comment on pull request #28687: URL: https://github.com/apache/spark/pull/28687#issuecomment-638146646 **[Test build #123484 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123484/testReport)** for PR 28687 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28687: [SPARK-31878][SQL] Create date formatter only once in `HiveResult`

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28687: URL: https://github.com/apache/spark/pull/28687#issuecomment-638150807 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] dongjoon-hyun commented on pull request #28585: [SPARK-31765][WEBUI][test-maven] Upgrade HtmlUnit >= 2.37.0

2020-06-03 Thread GitBox
dongjoon-hyun commented on pull request #28585: URL: https://github.com/apache/spark/pull/28585#issuecomment-638015764 Sounds reasonable. Thanks! This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] sarutak edited a comment on pull request #28585: [SPARK-31765][WEBUI][test-maven] Upgrade HtmlUnit >= 2.37.0

2020-06-03 Thread GitBox
sarutak edited a comment on pull request #28585: URL: https://github.com/apache/spark/pull/28585#issuecomment-638014904 @dongjoon-hyun @HyukjinKwon The reason of the failure seems that `bytebuddy` which upgraded-selenium depends on is much older than `mockito` depends on. I've excluded

[GitHub] [spark] maropu commented on a change in pull request #28490: [SPARK-31670][SQL]Struct Field in groupByExpr with CUBE

2020-06-03 Thread GitBox
maropu commented on a change in pull request #28490: URL: https://github.com/apache/spark/pull/28490#discussion_r434371772 ## File path: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala ## @@ -3495,6 +3495,59 @@ class SQLQuerySuite extends QueryTest with

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28704: [SPARK-31777][ML][PySpark] Add user-specified fold column to CrossValidator

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28704: URL: https://github.com/apache/spark/pull/28704#issuecomment-638022253 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] maropu commented on a change in pull request #28490: [SPARK-31670][SQL]Struct Field in groupByExpr with CUBE

2020-06-03 Thread GitBox
maropu commented on a change in pull request #28490: URL: https://github.com/apache/spark/pull/28490#discussion_r434372717 ## File path: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala ## @@ -3495,6 +3495,59 @@ class SQLQuerySuite extends QueryTest with

[GitHub] [spark] yaooqinn closed pull request #28673: [SPARK-31867][SQL] Fix silent data change for datetime formatting

2020-06-03 Thread GitBox
yaooqinn closed pull request #28673: URL: https://github.com/apache/spark/pull/28673 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [spark] xuanyuanking commented on a change in pull request #28707: [SPARK-31894][SS] Introduce UnsafeRow format validation for streaming state store

2020-06-03 Thread GitBox
xuanyuanking commented on a change in pull request #28707: URL: https://github.com/apache/spark/pull/28707#discussion_r434419336 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StreamingAggregationStateManager.scala ## @@ -77,13 +82,24 @@

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28707: [SPARK-31894][SS] Introduce UnsafeRow format validation for streaming state store

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28707: URL: https://github.com/apache/spark/pull/28707#issuecomment-638065366 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28707: [SPARK-31894][SS] Introduce UnsafeRow format validation for streaming state store

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28707: URL: https://github.com/apache/spark/pull/28707#issuecomment-638065366 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] MaxGekk commented on a change in pull request #28705: [SPARK-30808][SQL] Enable Java 8 time API in Thrift server

2020-06-03 Thread GitBox
MaxGekk commented on a change in pull request #28705: URL: https://github.com/apache/spark/pull/28705#discussion_r434443354 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/HiveResult.scala ## @@ -37,30 +37,45 @@ object HiveResult { * Returns the

[GitHub] [spark] SparkQA commented on pull request #28718: [SPARK-31892][SQL][FOLLOWUP] Improve test coverage for valid pattern …

2020-06-03 Thread GitBox
SparkQA commented on pull request #28718: URL: https://github.com/apache/spark/pull/28718#issuecomment-638087232 **[Test build #123480 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123480/testReport)** for PR 28718 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28718: [SPARK-31892][SQL][FOLLOWUP] Improve test coverage for valid pattern …

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28718: URL: https://github.com/apache/spark/pull/28718#issuecomment-638087801 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] yaooqinn commented on pull request #28713: [SPARK-31896][SQL] Handle am-pm timestamp parsing when hour is missing

2020-06-03 Thread GitBox
yaooqinn commented on pull request #28713: URL: https://github.com/apache/spark/pull/28713#issuecomment-638097083 retest this please This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [spark] AmplabJenkins commented on pull request #28707: [SPARK-31894][SS] Introduce UnsafeRow format validation for streaming state store

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28707: URL: https://github.com/apache/spark/pull/28707#issuecomment-638119689 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #28707: [SPARK-31894][SS] Introduce UnsafeRow format validation for streaming state store

2020-06-03 Thread GitBox
SparkQA removed a comment on pull request #28707: URL: https://github.com/apache/spark/pull/28707#issuecomment-638076637 **[Test build #123477 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123477/testReport)** for PR 28707 at commit

[GitHub] [spark] SparkQA commented on pull request #28707: [SPARK-31894][SS] Introduce UnsafeRow format validation for streaming state store

2020-06-03 Thread GitBox
SparkQA commented on pull request #28707: URL: https://github.com/apache/spark/pull/28707#issuecomment-638119541 **[Test build #123477 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123477/testReport)** for PR 28707 at commit

[GitHub] [spark] cloud-fan commented on pull request #28692: [SPARK-31879][SQL] Using GB as default Locale for datetime formatters

2020-06-03 Thread GitBox
cloud-fan commented on pull request #28692: URL: https://github.com/apache/spark/pull/28692#issuecomment-638139165 As I mentioned in: https://github.com/apache/spark/pull/28692#issuecomment-637226576 There are only 2 localized pattern letter: `e: localized day-of-week` and `O:

[GitHub] [spark] cloud-fan closed pull request #28687: [SPARK-31878][SQL] Create date formatter only once in `HiveResult`

2020-06-03 Thread GitBox
cloud-fan closed pull request #28687: URL: https://github.com/apache/spark/pull/28687 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28687: [SPARK-31878][SQL] Create date formatter only once in `HiveResult`

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28687: URL: https://github.com/apache/spark/pull/28687#issuecomment-638150813 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] viirya commented on pull request #28704: [SPARK-31777][ML][PySpark] Add user-specified fold column to CrossValidator

2020-06-03 Thread GitBox
viirya commented on pull request #28704: URL: https://github.com/apache/spark/pull/28704#issuecomment-638020051 retest this please This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [spark] cloud-fan commented on pull request #28717: [SPARK-29947][SQL][followup] ResolveRelations should return relations with fresh attribute IDs

2020-06-03 Thread GitBox
cloud-fan commented on pull request #28717: URL: https://github.com/apache/spark/pull/28717#issuecomment-638030909 @wangyum This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #28716: [SPARK-31881][K8S][TESTS][FOLLOWUP] Activate hadoop-2.7 by default in K8S IT

2020-06-03 Thread GitBox
SparkQA commented on pull request #28716: URL: https://github.com/apache/spark/pull/28716#issuecomment-638030508 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/28086/

[GitHub] [spark] dongjoon-hyun commented on pull request #28716: [SPARK-31881][K8S][TESTS][FOLLOWUP] Activate hadoop-2.7 by default in K8S IT

2020-06-03 Thread GitBox
dongjoon-hyun commented on pull request #28716: URL: https://github.com/apache/spark/pull/28716#issuecomment-638029939 cc @holdenk since she informed me this regression. This is an automated message from the Apache Git

[GitHub] [spark] AmplabJenkins commented on pull request #28391: [SPARK-31593][SS] Remove unnecessary streaming query progress update

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28391: URL: https://github.com/apache/spark/pull/28391#issuecomment-638029901 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28391: [SPARK-31593][SS] Remove unnecessary streaming query progress update

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28391: URL: https://github.com/apache/spark/pull/28391#issuecomment-638029901 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28618: [SPARK-31801][WIP][API][SHUFFLE] Register map output metadata

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28618: URL: https://github.com/apache/spark/pull/28618#issuecomment-638045051 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28618: [SPARK-31801][WIP][API][SHUFFLE] Register map output metadata

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28618: URL: https://github.com/apache/spark/pull/28618#issuecomment-638045051 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] xuanyuanking commented on a change in pull request #28707: [SPARK-31894][SS] Introduce UnsafeRow format validation for streaming state store

2020-06-03 Thread GitBox
xuanyuanking commented on a change in pull request #28707: URL: https://github.com/apache/spark/pull/28707#discussion_r434418012 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StreamingAggregationStateManager.scala ## @@ -94,6 +110,28 @@

[GitHub] [spark] SparkQA commented on pull request #28707: [SPARK-31894][SS] Introduce UnsafeRow format validation for streaming state store

2020-06-03 Thread GitBox
SparkQA commented on pull request #28707: URL: https://github.com/apache/spark/pull/28707#issuecomment-638064669 **[Test build #123475 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123475/testReport)** for PR 28707 at commit

[GitHub] [spark] SparkQA commented on pull request #28687: [SPARK-31878][SQL] Create date formatter only once in `HiveResult`

2020-06-03 Thread GitBox
SparkQA commented on pull request #28687: URL: https://github.com/apache/spark/pull/28687#issuecomment-638146646 **[Test build #123484 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123484/testReport)** for PR 28687 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28687: [SPARK-31878][SQL] Create date formatter only once in `HiveResult`

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28687: URL: https://github.com/apache/spark/pull/28687#issuecomment-638147120 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] cloud-fan edited a comment on pull request #28692: [SPARK-31879][SQL] Using GB as default Locale for datetime formatters

2020-06-03 Thread GitBox
cloud-fan edited a comment on pull request #28692: URL: https://github.com/apache/spark/pull/28692#issuecomment-638147190 > please also create a PR to explain what gets changed in the meaning of u in the migration guide. @bart-samwel maybe you were confused by this. The meaning was

[GitHub] [spark] AmplabJenkins commented on pull request #28687: [SPARK-31878][SQL] Create date formatter only once in `HiveResult`

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28687: URL: https://github.com/apache/spark/pull/28687#issuecomment-638147120 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] cloud-fan commented on pull request #28687: [SPARK-31878][SQL] Create date formatter only once in `HiveResult`

2020-06-03 Thread GitBox
cloud-fan commented on pull request #28687: URL: https://github.com/apache/spark/pull/28687#issuecomment-638152357 it has a logical conflict with https://github.com/apache/spark/pull/28706, I'm fixing it This is an

[GitHub] [spark] maropu commented on a change in pull request #28490: [SPARK-31670][SQL]Struct Field in groupByExpr with CUBE

2020-06-03 Thread GitBox
maropu commented on a change in pull request #28490: URL: https://github.com/apache/spark/pull/28490#discussion_r434371366 ## File path: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala ## @@ -3495,6 +3495,59 @@ class SQLQuerySuite extends QueryTest with

[GitHub] [spark] AmplabJenkins commented on pull request #28704: [SPARK-31777][ML][PySpark] Add user-specified fold column to CrossValidator

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28704: URL: https://github.com/apache/spark/pull/28704#issuecomment-638022253 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28704: [SPARK-31777][ML][PySpark] Add user-specified fold column to CrossValidator

2020-06-03 Thread GitBox
SparkQA commented on pull request #28704: URL: https://github.com/apache/spark/pull/28704#issuecomment-638021723 **[Test build #123470 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123470/testReport)** for PR 28704 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28497: [SPARK-31677][SS] Use kvstore to cache stream query progress

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28497: URL: https://github.com/apache/spark/pull/28497#issuecomment-638051935 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28497: [SPARK-31677][SS] Use kvstore to cache stream query progress

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28497: URL: https://github.com/apache/spark/pull/28497#issuecomment-638051935 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #28497: [SPARK-31677][SS] Use kvstore to cache stream query progress

2020-06-03 Thread GitBox
SparkQA commented on pull request #28497: URL: https://github.com/apache/spark/pull/28497#issuecomment-638051393 **[Test build #123474 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123474/testReport)** for PR 28497 at commit

[GitHub] [spark] ulysses-you commented on a change in pull request #28647: [SPARK-31828][SQL] Retain table properties at CreateTableLikeCommand

2020-06-03 Thread GitBox
ulysses-you commented on a change in pull request #28647: URL: https://github.com/apache/spark/pull/28647#discussion_r434403470 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala ## @@ -839,6 +839,19 @@ case class

[GitHub] [spark] dongjoon-hyun commented on pull request #28716: [SPARK-31881][K8S][TESTS][FOLLOWUP] Activate hadoop-2.7 by default in K8S IT

2020-06-03 Thread GitBox
dongjoon-hyun commented on pull request #28716: URL: https://github.com/apache/spark/pull/28716#issuecomment-638052650 Hi, @HyukjinKwon . Could you review this PR? This is an automated message from the Apache Git Service. To

[GitHub] [spark] cloud-fan commented on a change in pull request #28575: [SPARK-31705][SQL] Push predicate through join by rewriting join condition to conjunctive normal form

2020-06-03 Thread GitBox
cloud-fan commented on a change in pull request #28575: URL: https://github.com/apache/spark/pull/28575#discussion_r434425950 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala ## @@ -1372,6 +1376,108 @@ object

[GitHub] [spark] gengliangwang commented on a change in pull request #28575: [SPARK-31705][SQL] Push predicate through join by rewriting join condition to conjunctive normal form

2020-06-03 Thread GitBox
gengliangwang commented on a change in pull request #28575: URL: https://github.com/apache/spark/pull/28575#discussion_r434424853 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala ## @@ -1372,6 +1376,108 @@ object

[GitHub] [spark] xuanyuanking commented on pull request #28707: [SPARK-31894][SS] Introduce UnsafeRow format validation for streaming state store

2020-06-03 Thread GitBox
xuanyuanking commented on pull request #28707: URL: https://github.com/apache/spark/pull/28707#issuecomment-638072075 > @skambha it doesn't fix the issue, it gives a better error message when we hit the issue. Yep, I WIP for the integrated test of the state store format

[GitHub] [spark] dongjoon-hyun commented on pull request #28716: [SPARK-31881][K8S][TESTS][FOLLOWUP] Activate hadoop-2.7 by default in K8S IT

2020-06-03 Thread GitBox
dongjoon-hyun commented on pull request #28716: URL: https://github.com/apache/spark/pull/28716#issuecomment-638072809 Thank you, @HyukjinKwon . Merged to master/3.0. This is an automated message from the Apache Git Service.

[GitHub] [spark] dongjoon-hyun closed pull request #28716: [SPARK-31881][K8S][TESTS][FOLLOWUP] Activate hadoop-2.7 by default in K8S IT

2020-06-03 Thread GitBox
dongjoon-hyun closed pull request #28716: URL: https://github.com/apache/spark/pull/28716 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] AmplabJenkins commented on pull request #28718: [SPARK-31892][SQL][FOLLOWUP] Improve test coverage for valid pattern …

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28718: URL: https://github.com/apache/spark/pull/28718#issuecomment-638101322 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #28716: [SPARK-31881][K8S][TESTS][FOLLOWUP] Activate hadoop-2.7 by default in K8S IT

2020-06-03 Thread GitBox
AmplabJenkins commented on pull request #28716: URL: https://github.com/apache/spark/pull/28716#issuecomment-638102073 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28716: [SPARK-31881][K8S][TESTS][FOLLOWUP] Activate hadoop-2.7 by default in K8S IT

2020-06-03 Thread GitBox
AmplabJenkins removed a comment on pull request #28716: URL: https://github.com/apache/spark/pull/28716#issuecomment-638102073 This is an automated message from the Apache Git Service. To respond to the message, please log on

  1   2   3   4   5   6   7   >