[GitHub] [spark] SparkQA commented on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
SparkQA commented on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715749893 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34828/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715749524 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715749524 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-23 Thread GitBox
SparkQA commented on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715749485 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34827/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715740603 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715740574 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715740574 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA removed a comment on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
SparkQA removed a comment on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715661004 **[Test build #130222 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130222/testReport)** for PR 27333 at commit

[GitHub] [spark] SparkQA commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
SparkQA commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715739957 **[Test build #130222 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130222/testReport)** for PR 27333 at commit

[GitHub] [spark] SparkQA commented on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-23 Thread GitBox
SparkQA commented on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715728829 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34827/

[GitHub] [spark] SparkQA commented on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
SparkQA commented on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715710242 **[Test build #130228 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130228/testReport)** for PR 30138 at commit

[GitHub] [spark] c21 commented on a change in pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
c21 commented on a change in pull request #30138: URL: https://github.com/apache/spark/pull/30138#discussion_r511321724 ## File path: sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala ## @@ -1077,6 +1077,27 @@ object SparkSession extends Logging { throw

[GitHub] [spark] viirya commented on a change in pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
viirya commented on a change in pull request #30138: URL: https://github.com/apache/spark/pull/30138#discussion_r511321486 ## File path: sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala ## @@ -1077,6 +1077,27 @@ object SparkSession extends Logging { throw

[GitHub] [spark] viirya commented on a change in pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
viirya commented on a change in pull request #30138: URL: https://github.com/apache/spark/pull/30138#discussion_r511321486 ## File path: sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala ## @@ -1077,6 +1077,27 @@ object SparkSession extends Logging { throw

[GitHub] [spark] SparkQA commented on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-23 Thread GitBox
SparkQA commented on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715680465 **[Test build #130227 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130227/testReport)** for PR 30144 at commit

[GitHub] [spark] c21 commented on a change in pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
c21 commented on a change in pull request #30138: URL: https://github.com/apache/spark/pull/30138#discussion_r511319323 ## File path: sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala ## @@ -1077,6 +1077,27 @@ object SparkSession extends Logging { throw

[GitHub] [spark] AngersZhuuuu opened a new pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-23 Thread GitBox
AngersZh opened a new pull request #30144: URL: https://github.com/apache/spark/pull/30144 ### What changes were proposed in this pull request? ### Why are the changes needed? Support more SQL scenario. ### Does this PR introduce _any_ user-facing change?

[GitHub] [spark] viirya commented on a change in pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
viirya commented on a change in pull request #27333: URL: https://github.com/apache/spark/pull/27333#discussion_r511309080 ## File path: sql/core/src/test/scala/org/apache/spark/sql/streaming/FlatMapGroupsWithStateSuite.scala ## @@ -1019,6 +1019,56 @@ class

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715669520 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
SparkQA commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715669517 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34825/

[GitHub] [spark] AmplabJenkins commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715669520 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715668895 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715668893 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715668893 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715668895 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-23 Thread GitBox
SparkQA commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715668892 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34824/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715668528 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
SparkQA commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715668522 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34821/

[GitHub] [spark] AmplabJenkins commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715668528 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715668308 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] HeartSaVioR commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-10-23 Thread GitBox
HeartSaVioR commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-715668416 I wouldn't request to an individual contributor doing the heavy work consistently - now this PR has nearly 300 comments. If the remaining comments are minors (not

[GitHub] [spark] AmplabJenkins commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715668308 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-23 Thread GitBox
SparkQA commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715668305 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34822/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715667921 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
SparkQA commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715667918 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34823/

[GitHub] [spark] AmplabJenkins commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715667921 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] HeartSaVioR commented on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
HeartSaVioR commented on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715667405 Looks like this needs rebasing with recent master. @cloud-fan This is an automated message from the Apache

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715667034 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA removed a comment on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
SparkQA removed a comment on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715666260 **[Test build #130226 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130226/testReport)** for PR 29970 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715667029 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] manuzhang commented on pull request #29330: [SPARK-32432][SQL] Added support for reading ORC/Parquet files with SymlinkTextInputFormat

2020-10-23 Thread GitBox
manuzhang commented on pull request #29330: URL: https://github.com/apache/spark/pull/29330#issuecomment-715667150 cc @cloud-fan @gengliangwang @viirya This is an automated message from the Apache Git Service. To respond to

[GitHub] [spark] SparkQA commented on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
SparkQA commented on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715667019 **[Test build #130226 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130226/testReport)** for PR 29970 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715667029 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
SparkQA commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715666709 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34825/

[GitHub] [spark] SparkQA commented on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
SparkQA commented on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715666260 **[Test build #130226 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130226/testReport)** for PR 29970 at commit

[GitHub] [spark] SparkQA commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
SparkQA commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715666228 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34821/

[GitHub] [spark] SparkQA commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-23 Thread GitBox
SparkQA commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715665998 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34822/

[GitHub] [spark] SparkQA commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-23 Thread GitBox
SparkQA commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715665907 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34824/

[GitHub] [spark] SparkQA commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
SparkQA commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715665750 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34823/

[GitHub] [spark] HeartSaVioR commented on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
HeartSaVioR commented on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715665544 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715665066 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715665066 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
SparkQA commented on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715664806 **[Test build #130220 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130220/testReport)** for PR 29906 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
SparkQA removed a comment on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715646698 **[Test build #130220 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130220/testReport)** for PR 29906 at commit

[GitHub] [spark] SparkQA commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-23 Thread GitBox
SparkQA commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715661919 **[Test build #130225 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130225/testReport)** for PR 24173 at commit

[GitHub] [spark] SparkQA commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
SparkQA commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715661905 **[Test build #130224 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130224/testReport)** for PR 28363 at commit

[GitHub] [spark] HeartSaVioR commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
HeartSaVioR commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715661080 retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] SparkQA commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-23 Thread GitBox
SparkQA commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715661007 **[Test build #130223 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130223/testReport)** for PR 24173 at commit

[GitHub] [spark] HeartSaVioR edited a comment on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
HeartSaVioR edited a comment on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715661044 cc. @tdas @zsxwing @viirya @xuanyuanking Kindly reminder. This is an automated message from

[GitHub] [spark] HeartSaVioR commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
HeartSaVioR commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715661044 cc. @tdas @zsxwing @viirya @xuanyuanking This is an automated message from the Apache Git Service.

[GitHub] [spark] SparkQA commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
SparkQA commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715661004 **[Test build #130222 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130222/testReport)** for PR 27333 at commit

[GitHub] [spark] SparkQA commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-23 Thread GitBox
SparkQA commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715660984 **[Test build #130221 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130221/testReport)** for PR 27649 at commit

[GitHub] [spark] HeartSaVioR commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-23 Thread GitBox
HeartSaVioR commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715660935 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [spark] HeartSaVioR commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-23 Thread GitBox
HeartSaVioR commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715660720 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] HeartSaVioR commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
HeartSaVioR commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715660771 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [spark] HeartSaVioR edited a comment on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-23 Thread GitBox
HeartSaVioR edited a comment on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715660680 cc. @viirya @xuanyuanking as well to expand the possibility of reviews. This is an automated message

[GitHub] [spark] HeartSaVioR commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-23 Thread GitBox
HeartSaVioR commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715660680 cc. @viirya @xuanyuanking This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] viirya commented on a change in pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
viirya commented on a change in pull request #30138: URL: https://github.com/apache/spark/pull/30138#discussion_r511269228 ## File path: sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala ## @@ -1077,6 +1077,27 @@ object SparkSession extends Logging { throw

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715659061 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715659061 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
SparkQA removed a comment on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715611786 **[Test build #130217 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130217/testReport)** for PR 30138 at commit

[GitHub] [spark] SparkQA commented on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
SparkQA commented on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715658861 **[Test build #130217 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130217/testReport)** for PR 30138 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715656257 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
SparkQA commented on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715656248 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34820/

[GitHub] [spark] AmplabJenkins commented on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715656257 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
SparkQA commented on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715652629 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34820/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30093: [SPARK-33183][SQL] Fix EliminateSorts bug when removing global sorts

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #30093: URL: https://github.com/apache/spark/pull/30093#issuecomment-715651332 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #30093: [SPARK-33183][SQL] Fix EliminateSorts bug when removing global sorts

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #30093: URL: https://github.com/apache/spark/pull/30093#issuecomment-715651332 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #30093: [SPARK-33183][SQL] Fix EliminateSorts bug when removing global sorts

2020-10-23 Thread GitBox
SparkQA commented on pull request #30093: URL: https://github.com/apache/spark/pull/30093#issuecomment-715651326 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34819/

[GitHub] [spark] HyukjinKwon commented on pull request #30140: [SPARK-33228][SQL] Don't uncache data when replacing a view having the same logical plan

2020-10-23 Thread GitBox
HyukjinKwon commented on pull request #30140: URL: https://github.com/apache/spark/pull/30140#issuecomment-715649695 Looks making sense to me. FYI @dongjoon-hyun. cc @cloud-fan since you're taking a look for views. This is

[GitHub] [spark] SparkQA commented on pull request #30093: [SPARK-33183][SQL] Fix EliminateSorts bug when removing global sorts

2020-10-23 Thread GitBox
SparkQA commented on pull request #30093: URL: https://github.com/apache/spark/pull/30093#issuecomment-715647873 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34819/

[GitHub] [spark] HyukjinKwon closed pull request #30094: [SPARK-33003][PYTHON][DOCS] Add type hints guidelines to the documentation

2020-10-23 Thread GitBox
HyukjinKwon closed pull request #30094: URL: https://github.com/apache/spark/pull/30094 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] HyukjinKwon commented on pull request #30094: [SPARK-33003][PYTHON][DOCS] Add type hints guidelines to the documentation

2020-10-23 Thread GitBox
HyukjinKwon commented on pull request #30094: URL: https://github.com/apache/spark/pull/30094#issuecomment-715647562 Merged to master. This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] HyukjinKwon commented on pull request #30143: [SPARK-32084][PYTHON][SQL] Expand dictionary functions

2020-10-23 Thread GitBox
HyukjinKwon commented on pull request #30143: URL: https://github.com/apache/spark/pull/30143#issuecomment-715647432 > Add proper NumPy-style docstrings to expanded functions. Oh, let's don't do this in this PR. It should add some dependencies into GitHub Actions, fix the script

[GitHub] [spark] HyukjinKwon commented on pull request #30143: [SPARK-32084][PYTHON][SQL] Expand dictionary functions

2020-10-23 Thread GitBox
HyukjinKwon commented on pull request #30143: URL: https://github.com/apache/spark/pull/30143#issuecomment-715647106 Nice, @zero323. Thanks for working on this. This is an automated message from the Apache Git Service. To

[GitHub] [spark] github-actions[bot] commented on pull request #29028: [SPARK-32212][CORE]RDD.takeOrdered can choose to merge intermediate r…

2020-10-23 Thread GitBox
github-actions[bot] commented on pull request #29028: URL: https://github.com/apache/spark/pull/29028#issuecomment-715647015 We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue

[GitHub] [spark] SparkQA commented on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
SparkQA commented on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715646698 **[Test build #130220 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130220/testReport)** for PR 29906 at commit

[GitHub] [spark] HyukjinKwon commented on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
HyukjinKwon commented on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715646423 retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30143: [SPARK-32084][PYTHON][SQL] Expand dictionary functions

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #30143: URL: https://github.com/apache/spark/pull/30143#issuecomment-715646173 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on pull request #30143: [SPARK-32084][PYTHON][SQL] Expand dictionary functions

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #30143: URL: https://github.com/apache/spark/pull/30143#issuecomment-715646170 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30143: [SPARK-32084][PYTHON][SQL] Expand dictionary functions

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #30143: URL: https://github.com/apache/spark/pull/30143#issuecomment-715646170 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] HyukjinKwon commented on pull request #30101: [SPARK-33193][SQL][TEST] Hive ThriftServer JDBC Database MetaData API Behavior Auditing

2020-10-23 Thread GitBox
HyukjinKwon commented on pull request #30101: URL: https://github.com/apache/spark/pull/30101#issuecomment-715646198 +1, nice. This is an automated message from the Apache Git Service. To respond to the message, please log

[GitHub] [spark] SparkQA commented on pull request #30143: [SPARK-32084][PYTHON][SQL] Expand dictionary functions

2020-10-23 Thread GitBox
SparkQA commented on pull request #30143: URL: https://github.com/apache/spark/pull/30143#issuecomment-715646164 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34818/

[GitHub] [spark] HyukjinKwon commented on pull request #30133: [SPARK-33104][BUILD] Exclude 'org.apache.hadoop:hadoop-yarn-server-resourcemanager:jar:tests'

2020-10-23 Thread GitBox
HyukjinKwon commented on pull request #30133: URL: https://github.com/apache/spark/pull/30133#issuecomment-715645114 Okay .. it finally starts to pass: https://amplab.cs.berkeley.edu/jenkins/job/spark-master-test-sbt-hadoop-2.7-hive-2.3/1454/ 

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30093: [SPARK-33183][SQL] Fix EliminateSorts bug when removing global sorts

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #30093: URL: https://github.com/apache/spark/pull/30093#issuecomment-715641035 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA commented on pull request #30143: [SPARK-32084][PYTHON][SQL] Expand dictionary functions

2020-10-23 Thread GitBox
SparkQA commented on pull request #30143: URL: https://github.com/apache/spark/pull/30143#issuecomment-715641175 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34818/

[GitHub] [spark] SparkQA removed a comment on pull request #30093: [SPARK-33183][SQL] Fix EliminateSorts bug when removing global sorts

2020-10-23 Thread GitBox
SparkQA removed a comment on pull request #30093: URL: https://github.com/apache/spark/pull/30093#issuecomment-715640614 **[Test build #130219 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130219/testReport)** for PR 30093 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30093: [SPARK-33183][SQL] Fix EliminateSorts bug when removing global sorts

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #30093: URL: https://github.com/apache/spark/pull/30093#issuecomment-715641031 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #30093: [SPARK-33183][SQL] Fix EliminateSorts bug when removing global sorts

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #30093: URL: https://github.com/apache/spark/pull/30093#issuecomment-715641031 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #30093: [SPARK-33183][SQL] Fix EliminateSorts bug when removing global sorts

2020-10-23 Thread GitBox
SparkQA commented on pull request #30093: URL: https://github.com/apache/spark/pull/30093#issuecomment-715641025 **[Test build #130219 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130219/testReport)** for PR 30093 at commit

  1   2   3   4   5   >