[GitHub] [spark] AmplabJenkins removed a comment on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-24 Thread GitBox
AmplabJenkins removed a comment on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715832093 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-24 Thread GitBox
AmplabJenkins removed a comment on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715832319 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-24 Thread GitBox
AmplabJenkins removed a comment on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715832074 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-24 Thread GitBox
AmplabJenkins removed a comment on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715830786 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA commented on pull request #30143: [SPARK-32084][PYTHON][SQL] Expand dictionary functions

2020-10-24 Thread GitBox
SparkQA commented on pull request #30143: URL: https://github.com/apache/spark/pull/30143#issuecomment-715833374 **[Test build #130230 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130230/testReport)** for PR 30143 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30145: [SPARK-33233][SQL]CUBE/ROLLUP/GROUPING SETS support GROUP BY ordinal

2020-10-24 Thread GitBox
AmplabJenkins removed a comment on pull request #30145: URL: https://github.com/apache/spark/pull/30145#issuecomment-715830055 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-24 Thread GitBox
AmplabJenkins commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715832074 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-24 Thread GitBox
AmplabJenkins removed a comment on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715830789 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-24 Thread GitBox
AmplabJenkins commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715832296 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-24 Thread GitBox
AmplabJenkins removed a comment on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715832296 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA removed a comment on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-24 Thread GitBox
SparkQA removed a comment on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715661905 **[Test build #130224 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130224/testReport)** for PR 28363 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-24 Thread GitBox
AmplabJenkins removed a comment on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715830756 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30145: [SPARK-33233][SQL]CUBE/ROLLUP/GROUPING SETS support GROUP BY ordinal

2020-10-24 Thread GitBox
AmplabJenkins removed a comment on pull request #30145: URL: https://github.com/apache/spark/pull/30145#issuecomment-715830054 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-24 Thread GitBox
AmplabJenkins commented on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715830759 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-24 Thread GitBox
AmplabJenkins commented on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715830756 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-24 Thread GitBox
AmplabJenkins removed a comment on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715830759 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA removed a comment on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-24 Thread GitBox
SparkQA removed a comment on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715710242 **[Test build #130228 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130228/testReport)** for PR 30138 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #30145: [SPARK-33233][SQL]CUBE/ROLLUP/GROUPING SETS support GROUP BY ordinal

2020-10-24 Thread GitBox
SparkQA removed a comment on pull request #30145: URL: https://github.com/apache/spark/pull/30145#issuecomment-715760936 **[Test build #130229 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130229/testReport)** for PR 30145 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-24 Thread GitBox
SparkQA removed a comment on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715680465 **[Test build #130227 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130227/testReport)** for PR 30144 at commit

[GitHub] [spark] SparkQA commented on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-24 Thread GitBox
SparkQA commented on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715829881 **[Test build #130228 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130228/testReport)** for PR 30138 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-24 Thread GitBox
SparkQA removed a comment on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715661919 **[Test build #130225 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130225/testReport)** for PR 24173 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #30145: [SPARK-33233][SQL]CUBE/ROLLUP/GROUPING SETS support GROUP BY ordinal

2020-10-24 Thread GitBox
AmplabJenkins commented on pull request #30145: URL: https://github.com/apache/spark/pull/30145#issuecomment-715830054 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-24 Thread GitBox
SparkQA commented on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715829884 **[Test build #130227 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130227/testReport)** for PR 30144 at commit

[GitHub] [spark] SparkQA commented on pull request #30145: [SPARK-33233][SQL]CUBE/ROLLUP/GROUPING SETS support GROUP BY ordinal

2020-10-24 Thread GitBox
SparkQA commented on pull request #30145: URL: https://github.com/apache/spark/pull/30145#issuecomment-715829887 **[Test build #130229 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130229/testReport)** for PR 30145 at commit

[GitHub] [spark] SparkQA commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-24 Thread GitBox
SparkQA commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715829885 **[Test build #130225 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130225/testReport)** for PR 24173 at commit

[GitHub] [spark] SparkQA commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-24 Thread GitBox
SparkQA commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715829883 **[Test build #130224 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130224/testReport)** for PR 28363 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-24 Thread GitBox
AmplabJenkins commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715828422 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-24 Thread GitBox
AmplabJenkins removed a comment on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715828422 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA removed a comment on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-24 Thread GitBox
SparkQA removed a comment on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715661007 **[Test build #130223 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130223/testReport)** for PR 24173 at commit

[GitHub] [spark] SparkQA commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-24 Thread GitBox
SparkQA commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715825878 **[Test build #130223 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130223/testReport)** for PR 24173 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-24 Thread GitBox
AmplabJenkins removed a comment on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715822285 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-24 Thread GitBox
AmplabJenkins commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715822285 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] HeartSaVioR commented on a change in pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-24 Thread GitBox
HeartSaVioR commented on a change in pull request #27333: URL: https://github.com/apache/spark/pull/27333#discussion_r511329896 ## File path: sql/core/src/test/scala/org/apache/spark/sql/streaming/FlatMapGroupsWithStateSuite.scala ## @@ -1019,6 +1019,56 @@ class

[GitHub] [spark] SparkQA removed a comment on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-24 Thread GitBox
SparkQA removed a comment on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715660984 **[Test build #130221 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130221/testReport)** for PR 27649 at commit

[GitHub] [spark] SparkQA commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-24 Thread GitBox
SparkQA commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715819519 **[Test build #130221 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130221/testReport)** for PR 27649 at commit

[GitHub] [spark] HeartSaVioR opened a new pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-24 Thread GitBox
HeartSaVioR opened a new pull request #27333: URL: https://github.com/apache/spark/pull/27333 ### What changes were proposed in this pull request? This patch adds new UTs to prevent SPARK-29438 for streaming aggregation as well as flatMapGroupsWithState, as we agree about the review

[GitHub] [spark] HeartSaVioR commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-24 Thread GitBox
HeartSaVioR commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715810779 Accidentally closed this - don't get the UX on Github mobile app. (Don't know how to reopen in the app.) Will reopen soon.

[GitHub] [spark] SparkQA commented on pull request #30145: [SPARK-33233][SQL]CUBE/ROLLUP/GROUPING SETS support GROUP BY ordinal

2020-10-24 Thread GitBox
SparkQA commented on pull request #30145: URL: https://github.com/apache/spark/pull/30145#issuecomment-715809580 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34829/

[GitHub] [spark] HeartSaVioR closed pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-24 Thread GitBox
HeartSaVioR closed pull request #27333: URL: https://github.com/apache/spark/pull/27333 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-24 Thread GitBox
AmplabJenkins removed a comment on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715772704 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-24 Thread GitBox
AmplabJenkins commented on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715772704 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-24 Thread GitBox
SparkQA commented on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715772657 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34828/

[GitHub] [spark] SparkQA commented on pull request #30145: [SPARK-33233][SQL]CUBE/ROLLUP/GROUPING SETS support GROUP BY ordinal

2020-10-24 Thread GitBox
SparkQA commented on pull request #30145: URL: https://github.com/apache/spark/pull/30145#issuecomment-715760936 **[Test build #130229 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130229/testReport)** for PR 30145 at commit

[GitHub] [spark] AngersZhuuuu opened a new pull request #30145: [SPARK-33233][SQL]CUBE/ROLLUP/GROUPING SETS support GROUP BY ordinal

2020-10-24 Thread GitBox
AngersZh opened a new pull request #30145: URL: https://github.com/apache/spark/pull/30145 ### What changes were proposed in this pull request? Currently, we can't support use ordinal in CUBE/ROLLUP/GROUPING SETS, this pr make CUBE/ROLLUP/GROUPING SETS support GROUP BY ordinal

[GitHub] [spark] SparkQA commented on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
SparkQA commented on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715749893 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34828/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715749524 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715749524 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-23 Thread GitBox
SparkQA commented on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715749485 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34827/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715740603 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715740574 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715740574 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA removed a comment on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
SparkQA removed a comment on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715661004 **[Test build #130222 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130222/testReport)** for PR 27333 at commit

[GitHub] [spark] SparkQA commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
SparkQA commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715739957 **[Test build #130222 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130222/testReport)** for PR 27333 at commit

[GitHub] [spark] SparkQA commented on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-23 Thread GitBox
SparkQA commented on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715728829 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34827/

[GitHub] [spark] SparkQA commented on pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
SparkQA commented on pull request #30138: URL: https://github.com/apache/spark/pull/30138#issuecomment-715710242 **[Test build #130228 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130228/testReport)** for PR 30138 at commit

[GitHub] [spark] c21 commented on a change in pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
c21 commented on a change in pull request #30138: URL: https://github.com/apache/spark/pull/30138#discussion_r511321724 ## File path: sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala ## @@ -1077,6 +1077,27 @@ object SparkSession extends Logging { throw

[GitHub] [spark] viirya commented on a change in pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
viirya commented on a change in pull request #30138: URL: https://github.com/apache/spark/pull/30138#discussion_r511321486 ## File path: sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala ## @@ -1077,6 +1077,27 @@ object SparkSession extends Logging { throw

[GitHub] [spark] viirya commented on a change in pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
viirya commented on a change in pull request #30138: URL: https://github.com/apache/spark/pull/30138#discussion_r511321486 ## File path: sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala ## @@ -1077,6 +1077,27 @@ object SparkSession extends Logging { throw

[GitHub] [spark] SparkQA commented on pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-23 Thread GitBox
SparkQA commented on pull request #30144: URL: https://github.com/apache/spark/pull/30144#issuecomment-715680465 **[Test build #130227 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130227/testReport)** for PR 30144 at commit

[GitHub] [spark] c21 commented on a change in pull request #30138: [SPARK-33075][SQL] Enable auto bucketed scan by default (disable only for cached query)

2020-10-23 Thread GitBox
c21 commented on a change in pull request #30138: URL: https://github.com/apache/spark/pull/30138#discussion_r511319323 ## File path: sql/core/src/main/scala/org/apache/spark/sql/SparkSession.scala ## @@ -1077,6 +1077,27 @@ object SparkSession extends Logging { throw

[GitHub] [spark] AngersZhuuuu opened a new pull request #30144: [SPARK-33229][SQL]Support GROUP BY use Separate columns and CUBE/ROLLUP

2020-10-23 Thread GitBox
AngersZh opened a new pull request #30144: URL: https://github.com/apache/spark/pull/30144 ### What changes were proposed in this pull request? ### Why are the changes needed? Support more SQL scenario. ### Does this PR introduce _any_ user-facing change?

[GitHub] [spark] viirya commented on a change in pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
viirya commented on a change in pull request #27333: URL: https://github.com/apache/spark/pull/27333#discussion_r511309080 ## File path: sql/core/src/test/scala/org/apache/spark/sql/streaming/FlatMapGroupsWithStateSuite.scala ## @@ -1019,6 +1019,56 @@ class

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715669520 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
SparkQA commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715669517 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34825/

[GitHub] [spark] AmplabJenkins commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715669520 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715668895 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715668893 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715668893 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715668895 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-23 Thread GitBox
SparkQA commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715668892 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34824/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715668528 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
SparkQA commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715668522 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34821/

[GitHub] [spark] AmplabJenkins commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715668528 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715668308 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] HeartSaVioR commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-10-23 Thread GitBox
HeartSaVioR commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-715668416 I wouldn't request to an individual contributor doing the heavy work consistently - now this PR has nearly 300 comments. If the remaining comments are minors (not

[GitHub] [spark] AmplabJenkins commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715668308 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-23 Thread GitBox
SparkQA commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715668305 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34822/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715667921 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
SparkQA commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715667918 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34823/

[GitHub] [spark] AmplabJenkins commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715667921 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] HeartSaVioR commented on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
HeartSaVioR commented on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715667405 Looks like this needs rebasing with recent master. @cloud-fan This is an automated message from the Apache

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715667034 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA removed a comment on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
SparkQA removed a comment on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715666260 **[Test build #130226 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130226/testReport)** for PR 29970 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715667029 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] manuzhang commented on pull request #29330: [SPARK-32432][SQL] Added support for reading ORC/Parquet files with SymlinkTextInputFormat

2020-10-23 Thread GitBox
manuzhang commented on pull request #29330: URL: https://github.com/apache/spark/pull/29330#issuecomment-715667150 cc @cloud-fan @gengliangwang @viirya This is an automated message from the Apache Git Service. To respond to

[GitHub] [spark] SparkQA commented on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
SparkQA commented on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715667019 **[Test build #130226 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130226/testReport)** for PR 29970 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715667029 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
SparkQA commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715666709 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34825/

[GitHub] [spark] SparkQA commented on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
SparkQA commented on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715666260 **[Test build #130226 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130226/testReport)** for PR 29970 at commit

[GitHub] [spark] SparkQA commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
SparkQA commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715666228 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34821/

[GitHub] [spark] SparkQA commented on pull request #27649: [SPARK-30900][SS] FileStreamSource: Avoid reading compact metadata log twice if the query restarts from compact batch

2020-10-23 Thread GitBox
SparkQA commented on pull request #27649: URL: https://github.com/apache/spark/pull/27649#issuecomment-715665998 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34822/

[GitHub] [spark] SparkQA commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-23 Thread GitBox
SparkQA commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715665907 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34824/

[GitHub] [spark] SparkQA commented on pull request #27333: [SPARK-29438][SS][FOLLOWUP] Add regression tests for Streaming Aggregation and flatMapGroupsWithState

2020-10-23 Thread GitBox
SparkQA commented on pull request #27333: URL: https://github.com/apache/spark/pull/27333#issuecomment-715665750 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34823/

[GitHub] [spark] HeartSaVioR commented on pull request #29970: [SPARK-33087][SQL] DataFrameWriterV2 should delegate table resolution to the analyzer

2020-10-23 Thread GitBox
HeartSaVioR commented on pull request #29970: URL: https://github.com/apache/spark/pull/29970#issuecomment-715665544 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715665066 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
AmplabJenkins commented on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715665066 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
SparkQA commented on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715664806 **[Test build #130220 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130220/testReport)** for PR 29906 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #29906: [SPARK-32037][CORE] Rename blacklisting feature

2020-10-23 Thread GitBox
SparkQA removed a comment on pull request #29906: URL: https://github.com/apache/spark/pull/29906#issuecomment-715646698 **[Test build #130220 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130220/testReport)** for PR 29906 at commit

[GitHub] [spark] SparkQA commented on pull request #24173: [SPARK-27237][SS] Introduce State schema validation among query restart

2020-10-23 Thread GitBox
SparkQA commented on pull request #24173: URL: https://github.com/apache/spark/pull/24173#issuecomment-715661919 **[Test build #130225 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130225/testReport)** for PR 24173 at commit

[GitHub] [spark] SparkQA commented on pull request #28363: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2020-10-23 Thread GitBox
SparkQA commented on pull request #28363: URL: https://github.com/apache/spark/pull/28363#issuecomment-715661905 **[Test build #130224 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/130224/testReport)** for PR 28363 at commit

  1   2   3   4   5   6   7   8   9   10   >