[GitHub] [spark] AmplabJenkins removed a comment on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668996964 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668996964 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668959450 **[Test build #127074 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127074/testReport)** for PR 29332 at commit

[GitHub] [spark] SparkQA commented on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
SparkQA commented on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668996210 **[Test build #127074 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127074/testReport)** for PR 29332 at commit

[GitHub] [spark] cloud-fan commented on a change in pull request #29311: [SPARK-32501][SQL] Convert null to "null" in structs, maps and arrays while casting to strings

2020-08-04 Thread GitBox
cloud-fan commented on a change in pull request #29311: URL: https://github.com/apache/spark/pull/29311#discussion_r465489920 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Cast.scala ## @@ -895,6 +905,10 @@ abstract class CastBase extends

[GitHub] [spark] cloud-fan commented on a change in pull request #29311: [SPARK-32501][SQL] Convert null to "null" in structs, maps and arrays while casting to strings

2020-08-04 Thread GitBox
cloud-fan commented on a change in pull request #29311: URL: https://github.com/apache/spark/pull/29311#discussion_r465489374 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Cast.scala ## @@ -342,7 +344,9 @@ abstract class CastBase extends

[GitHub] [spark] cloud-fan commented on a change in pull request #29311: [SPARK-32501][SQL] Convert null to "null" in structs, maps and arrays while casting to strings

2020-08-04 Thread GitBox
cloud-fan commented on a change in pull request #29311: URL: https://github.com/apache/spark/pull/29311#discussion_r465489447 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Cast.scala ## @@ -351,7 +355,9 @@ abstract class CastBase extends

[GitHub] [spark] cloud-fan commented on a change in pull request #29311: [SPARK-32501][SQL] Convert null to "null" in structs, maps and arrays while casting to strings

2020-08-04 Thread GitBox
cloud-fan commented on a change in pull request #29311: URL: https://github.com/apache/spark/pull/29311#discussion_r465489268 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Cast.scala ## @@ -321,7 +321,9 @@ abstract class CastBase extends

[GitHub] [spark] cloud-fan commented on a change in pull request #29311: [SPARK-32501][SQL] Convert null to "null" in structs, maps and arrays while casting to strings

2020-08-04 Thread GitBox
cloud-fan commented on a change in pull request #29311: URL: https://github.com/apache/spark/pull/29311#discussion_r465488975 ## File path: docs/sql-migration-guide.md ## @@ -34,6 +34,8 @@ license: | - In Spark 3.1, structs and maps are wrapped by the `{}` brackets in

[GitHub] [spark] stijndehaes commented on pull request #28423: [SPARK-24266][k8s] Restart the watcher when we receive a version changed from k8s

2020-08-04 Thread GitBox
stijndehaes commented on pull request #28423: URL: https://github.com/apache/spark/pull/28423#issuecomment-668991161 @jkleckner I have never had a problem with the driver watching the executors. I think there was already a fallback mechanism there, but I never looked into the code for

[GitHub] [spark] JkSelf commented on pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-04 Thread GitBox
JkSelf commented on pull request #29266: URL: https://github.com/apache/spark/pull/29266#issuecomment-668989319 Can you show the plan changes in UI? And whether introduced additional shuffle when change the partition num in bucket side or not?

[GitHub] [spark] cloud-fan commented on pull request #29137: [SPARK-32337][SQL] Show initial plan in AQE plan tree string

2020-08-04 Thread GitBox
cloud-fan commented on pull request #29137: URL: https://github.com/apache/spark/pull/29137#issuecomment-668988346 ``` == Physical Plan == AdaptiveSparkPlan (9) +- == Current Plan == BroadcastHashJoin Inner BuildRight (8) :- Project (3) : +- Filter (2) +- ==

[GitHub] [spark] ScrapCodes commented on pull request #29334: [SPARK-32495][2.4] Update jackson versions to a maintained release, to fix various security vulnerabilities.

2020-08-04 Thread GitBox
ScrapCodes commented on pull request #29334: URL: https://github.com/apache/spark/pull/29334#issuecomment-668988454 Alrighty, then I will skip this for 2.4.7 release, even though I still feel that this might be safe and good for people in general, provided jackson 2.6.7 had last release

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29353: [SPARK-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668986802 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668986647 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29353: [SPARK-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668986802 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668986647 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] HyukjinKwon commented on pull request #29334: [SPARK-32495][2.4] Update jackson versions to a maintained release, to fix various security vulnerabilities.

2020-08-04 Thread GitBox
HyukjinKwon commented on pull request #29334: URL: https://github.com/apache/spark/pull/29334#issuecomment-668986454 For the PR itself, I agree with @srowen's and @dongjoon-hyun comments at https://github.com/apache/spark/pull/29334#issuecomment-668044607 and

[GitHub] [spark] SparkQA removed a comment on pull request #29353: [SPARK-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668914387 **[Test build #127071 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127071/testReport)** for PR 29353 at commit

[GitHub] [spark] SparkQA commented on pull request #29353: [SPARK-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
SparkQA commented on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668986200 **[Test build #127071 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127071/testReport)** for PR 29353 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668911605 **[Test build #127070 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127070/testReport)** for PR 29352 at commit

[GitHub] [spark] SparkQA commented on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
SparkQA commented on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668985998 **[Test build #127070 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127070/testReport)** for PR 29352 at commit

[GitHub] [spark] HyukjinKwon commented on pull request #29334: [SPARK-32495][2.4] Update jackson versions to a maintained release, to fix various security vulnerabilities.

2020-08-04 Thread GitBox
HyukjinKwon commented on pull request #29334: URL: https://github.com/apache/spark/pull/29334#issuecomment-668985839 Yes, I think it does. That was one of reasons why I was hesitant. FYI, there was a bit of discussions and updates about resources at SPARK-32264. Given that the PRs

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29350: [SPARK-32529][CORE] Fix Historyserver log scan aborted by application status change

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29350: URL: https://github.com/apache/spark/pull/29350#issuecomment-668985557 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29350: [SPARK-32529][CORE] Fix Historyserver log scan aborted by application status change

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29350: URL: https://github.com/apache/spark/pull/29350#issuecomment-668985557 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29350: [SPARK-32529][CORE] Fix Historyserver log scan aborted by application status change

2020-08-04 Thread GitBox
SparkQA commented on pull request #29350: URL: https://github.com/apache/spark/pull/29350#issuecomment-668985252 **[Test build #127079 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127079/testReport)** for PR 29350 at commit

[GitHub] [spark] HeartSaVioR commented on a change in pull request #29350: [SPARK-32529][CORE] Fix Historyserver log scan aborted by application status change

2020-08-04 Thread GitBox
HeartSaVioR commented on a change in pull request #29350: URL: https://github.com/apache/spark/pull/29350#discussion_r465477234 ## File path: core/src/main/scala/org/apache/spark/deploy/history/FsHistoryProvider.scala ## @@ -530,10 +530,16 @@ private[history] class

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29350: [SPARK-32529][CORE] Fix Historyserver log scan aborted by application status change

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29350: URL: https://github.com/apache/spark/pull/29350#issuecomment-668691724 Can one of the admins verify this patch? This is an automated message from the Apache Git

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28761: [SPARK-25557][SQL][test-hive1.2] Nested column predicate pushdown for ORC

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #28761: URL: https://github.com/apache/spark/pull/28761#issuecomment-668983580 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] HeartSaVioR commented on pull request #29350: [SPARK-32529][CORE] Fix Historyserver log scan aborted by application status change

2020-08-04 Thread GitBox
HeartSaVioR commented on pull request #29350: URL: https://github.com/apache/spark/pull/29350#issuecomment-668983630 ok to test This is an automated message from the Apache Git Service. To respond to the message, please log

[GitHub] [spark] ScrapCodes commented on pull request #29334: [SPARK-32495][2.4] Update jackson versions to a maintained release, to fix various security vulnerabilities.

2020-08-04 Thread GitBox
ScrapCodes commented on pull request #29334: URL: https://github.com/apache/spark/pull/29334#issuecomment-668983574 @HyukjinKwon Thanks for looking in to it, and it is my mistake, I did not know that github actions are not ported to other branches yet. I am not 100% sure that they should

[GitHub] [spark] AmplabJenkins commented on pull request #28761: [SPARK-25557][SQL][test-hive1.2] Nested column predicate pushdown for ORC

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #28761: URL: https://github.com/apache/spark/pull/28761#issuecomment-668983580 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28761: [SPARK-25557][SQL][test-hive1.2] Nested column predicate pushdown for ORC

2020-08-04 Thread GitBox
SparkQA commented on pull request #28761: URL: https://github.com/apache/spark/pull/28761#issuecomment-668983196 **[Test build #127078 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127078/testReport)** for PR 28761 at commit

[GitHub] [spark] viirya commented on pull request #28761: [SPARK-25557][SQL][test-hive1.2] Nested column predicate pushdown for ORC

2020-08-04 Thread GitBox
viirya commented on pull request #28761: URL: https://github.com/apache/spark/pull/28761#issuecomment-668982975 retest this please This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [spark] HyukjinKwon commented on pull request #29334: [SPARK-32495][2.4] Update jackson versions to a maintained release, to fix various security vulnerabilities.

2020-08-04 Thread GitBox
HyukjinKwon commented on pull request #29334: URL: https://github.com/apache/spark/pull/29334#issuecomment-668981971 @ScrapCodes, yes, the m2 is corrupted in Jenkins machine. In the master, this dependency check is being skipped in Jenkins and GitHub Actions build runs instead. In

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29031: [SPARK-32216][SQL] Remove redundant ProjectExec

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29031: URL: https://github.com/apache/spark/pull/29031#issuecomment-668981428 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29031: [SPARK-32216][SQL] Remove redundant ProjectExec

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29031: URL: https://github.com/apache/spark/pull/29031#issuecomment-668981428 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29031: [SPARK-32216][SQL] Remove redundant ProjectExec

2020-08-04 Thread GitBox
SparkQA commented on pull request #29031: URL: https://github.com/apache/spark/pull/29031#issuecomment-668981110 **[Test build #127077 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127077/testReport)** for PR 29031 at commit

[GitHub] [spark] cloud-fan commented on pull request #29031: [SPARK-32216][SQL] Remove redundant ProjectExec

2020-08-04 Thread GitBox
cloud-fan commented on pull request #29031: URL: https://github.com/apache/spark/pull/29031#issuecomment-668980317 add to whitelist This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [spark] allisonwang-db opened a new pull request #29031: [SPARK-32216][SQL] Remove redundant ProjectExec

2020-08-04 Thread GitBox
allisonwang-db opened a new pull request #29031: URL: https://github.com/apache/spark/pull/29031 ### What changes were proposed in this pull request? This PR added a physical rule to remove redundant project nodes. A `ProjectExec` is redundant when 1. It has the same output

[GitHub] [spark] cloud-fan commented on pull request #29031: [SPARK-32216][SQL] Remove redundant ProjectExec

2020-08-04 Thread GitBox
cloud-fan commented on pull request #29031: URL: https://github.com/apache/spark/pull/29031#issuecomment-668980249 retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] cloud-fan closed pull request #29031: [SPARK-32216][SQL] Remove redundant ProjectExec

2020-08-04 Thread GitBox
cloud-fan closed pull request #29031: URL: https://github.com/apache/spark/pull/29031 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [spark] cloud-fan commented on pull request #29125: [SPARK-32018][SQL][3.0] UnsafeRow.setDecimal should set null with overflowed value

2020-08-04 Thread GitBox
cloud-fan commented on pull request #29125: URL: https://github.com/apache/spark/pull/29125#issuecomment-668980063 @skambha you will still hit the sum bug when you disable whole-stage-codegen (or fallback to it due to generated code exceeds 64kb), right? We are not introducing a new

[GitHub] [spark] ScrapCodes commented on pull request #29334: [WIP][RFC][SPARK-32495][2.4] Update jackson versions to a maintained release, to fix various security vulnerabilities.

2020-08-04 Thread GitBox
ScrapCodes commented on pull request #29334: URL: https://github.com/apache/spark/pull/29334#issuecomment-668977421 Alright, one more https://github.com/FasterXML/jackson-databind/issues/2798, Shall we consider 2.10.x ? change is the same and the later is free from whole store house of

[GitHub] [spark] ScrapCodes commented on pull request #29334: [WIP][RFC][SPARK-32495][2.4] Update jackson versions to a maintained release, to fix various security vulnerabilities.

2020-08-04 Thread GitBox
ScrapCodes commented on pull request #29334: URL: https://github.com/apache/spark/pull/29334#issuecomment-668976387 @Fokko, @srowen and @dongjoon-hyun Thank for giving me the feedback. I agree, with you guys. But, I wanted to give this patch a try - can it be done in a clean way? This

[GitHub] [spark] MaxGekk commented on pull request #29311: [SPARK-32501][SQL] Convert null to "null" in structs, maps and arrays while casting to strings

2020-08-04 Thread GitBox
MaxGekk commented on pull request #29311: URL: https://github.com/apache/spark/pull/29311#issuecomment-668976181 @cloud-fan @maropu @HyukjinKwon Please, review this PR. This is an automated message from the Apache Git

[GitHub] [spark] HyukjinKwon commented on pull request #29354: [WIP][Spark-32533][SQL] Improve Avro read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
HyukjinKwon commented on pull request #29354: URL: https://github.com/apache/spark/pull/29354#issuecomment-668975373 It might be great if we can elabourate how it improves performance. We can focus on the fix only instead of mixing refactoring here.

[GitHub] [spark] HyukjinKwon commented on a change in pull request #29354: [WIP][Spark-32533][SQL] Improve Avro read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
HyukjinKwon commented on a change in pull request #29354: URL: https://github.com/apache/spark/pull/29354#discussion_r465466999 ## File path: external/avro/src/main/scala/org/apache/spark/sql/avro/AvroDeserializer.scala ## @@ -1,427 +0,0 @@ -/* Review comment: Hey,

[GitHub] [spark] HyukjinKwon commented on a change in pull request #29353: [SPARK-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
HyukjinKwon commented on a change in pull request #29353: URL: https://github.com/apache/spark/pull/29353#discussion_r465466392 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcDeserializer.scala ## @@ -72,137 +74,191 @@ class

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29211: [SPARK-31197][CORE] Shutdown executor once we are done decommissioning

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29211: URL: https://github.com/apache/spark/pull/29211#issuecomment-668973946 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29211: [SPARK-31197][CORE] Shutdown executor once we are done decommissioning

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29211: URL: https://github.com/apache/spark/pull/29211#issuecomment-668973946 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29211: [SPARK-31197][CORE] Shutdown executor once we are done decommissioning

2020-08-04 Thread GitBox
SparkQA commented on pull request #29211: URL: https://github.com/apache/spark/pull/29211#issuecomment-668973694 **[Test build #127076 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127076/testReport)** for PR 29211 at commit

[GitHub] [spark] SparkQA commented on pull request #29339: [Spark-32512][SQL] add alter table add/drop partition command for datasourcev2

2020-08-04 Thread GitBox
SparkQA commented on pull request #29339: URL: https://github.com/apache/spark/pull/29339#issuecomment-668973681 **[Test build #127075 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127075/testReport)** for PR 29339 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29339: [Spark-32512][SQL] add alter table add/drop partition command for datasourcev2

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29339: URL: https://github.com/apache/spark/pull/29339#issuecomment-668972189 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29339: [Spark-32512][SQL] add alter table add/drop partition command for datasourcev2

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29339: URL: https://github.com/apache/spark/pull/29339#issuecomment-668972189 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] HyukjinKwon commented on a change in pull request #29353: [SPARK-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
HyukjinKwon commented on a change in pull request #29353: URL: https://github.com/apache/spark/pull/29353#discussion_r465460837 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcDeserializer.scala ## @@ -72,137 +74,191 @@ class

[GitHub] [spark] HyukjinKwon commented on a change in pull request #29353: [SPARK-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
HyukjinKwon commented on a change in pull request #29353: URL: https://github.com/apache/spark/pull/29353#discussion_r465460837 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcDeserializer.scala ## @@ -72,137 +74,191 @@ class

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668968321 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668968315 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668968315 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668937610 **[Test build #127073 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127073/testReport)** for PR 29333 at commit

[GitHub] [spark] SparkQA commented on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
SparkQA commented on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668968035 **[Test build #127073 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127073/testReport)** for PR 29333 at commit

[GitHub] [spark] HyukjinKwon commented on a change in pull request #29349: [SPARK-32528][SQL][TEST] The analyze method should make sure the plan is analyzed

2020-08-04 Thread GitBox
HyukjinKwon commented on a change in pull request #29349: URL: https://github.com/apache/spark/pull/29349#discussion_r465458165 ## File path: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/analysis/AnalysisSuite.scala ## @@ -47,6 +48,13 @@ import

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668959724 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668959724 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668845081 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA commented on pull request #29332: [SPARK-32518][CORE] CoarseGrainedSchedulerBackend.maxNumConcurrentTasks should consider all kinds of resources

2020-08-04 Thread GitBox
SparkQA commented on pull request #29332: URL: https://github.com/apache/spark/pull/29332#issuecomment-668959450 **[Test build #127074 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127074/testReport)** for PR 29332 at commit

[GitHub] [spark] beliefer commented on pull request #27429: [SPARK-28330][SQL] Support ANSI SQL: result offset clause in query expression

2020-08-04 Thread GitBox
beliefer commented on pull request #27429: URL: https://github.com/apache/spark/pull/27429#issuecomment-668956644 cc @cloud-fan This is an automated message from the Apache Git Service. To respond to the message, please log

[GitHub] [spark] stczwd commented on a change in pull request #29339: [Spark-32512][SQL] add alter table add/drop partition command for datasourcev2

2020-08-04 Thread GitBox
stczwd commented on a change in pull request #29339: URL: https://github.com/apache/spark/pull/29339#discussion_r465425990 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/AlterTableDropPartitionExec.scala ## @@ -0,0 +1,58 @@ +/* + *

[GitHub] [spark] HyukjinKwon commented on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
HyukjinKwon commented on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668945164 The latest commit above (https://github.com/apache/spark/pull/29333/commits/2688f21f1852b3e6a577fa9292985b346b9bdf6d) contains the problem in terms of forked repos and PRs

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668938127 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668938127 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
SparkQA commented on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668937610 **[Test build #127073 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127073/testReport)** for PR 29333 at commit

[GitHub] [spark] HyukjinKwon commented on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
HyukjinKwon commented on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668935709 I am making some changes to demonstrate the problem in terms of the fork and PRs. Please ignore the changes made from now on. I will switch back from the draft later when

[GitHub] [spark] HyukjinKwon closed pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

2020-08-04 Thread GitBox
HyukjinKwon closed pull request #29320: URL: https://github.com/apache/spark/pull/29320 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] HyukjinKwon commented on pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

2020-08-04 Thread GitBox
HyukjinKwon commented on pull request #29320: URL: https://github.com/apache/spark/pull/29320#issuecomment-668934219 Thank you @viirya for approaching this. I am merging this to master. This is an automated message from the

[GitHub] [spark] HyukjinKwon commented on pull request #29333: [WIP][SPARK-32357][INFRA] Publish failed and succeeded test reports in GitHub Actions

2020-08-04 Thread GitBox
HyukjinKwon commented on pull request #29333: URL: https://github.com/apache/spark/pull/29333#issuecomment-668933522 Sure, let me take a closer look for that approach. In worst case, we might have to drop this and go back to the original @viirya's approach at #29169.

[GitHub] [spark] wangshisan commented on pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-04 Thread GitBox
wangshisan commented on pull request #29266: URL: https://github.com/apache/spark/pull/29266#issuecomment-668932395 > this is with AQE? if so can we please add that to description and it might be nice to describe approach taken to handle it in description as well. Added.

[GitHub] [spark] wangshisan commented on a change in pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-04 Thread GitBox
wangshisan commented on a change in pull request #29266: URL: https://github.com/apache/spark/pull/29266#discussion_r465428028 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/OptimizeSkewedJoin.scala ## @@ -250,6 +251,85 @@ case class

[GitHub] [spark] wangshisan edited a comment on pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-04 Thread GitBox
wangshisan edited a comment on pull request #29266: URL: https://github.com/apache/spark/pull/29266#issuecomment-668926319 > Yea I'm also wondering the approach here. The skew join handling needs to split the skew side, and repeat the other side. I don't think we can split the buckets of

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29339: [Spark-32512][SQL][WIP] add alter table add/drop partition command for datasourcev2

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29339: URL: https://github.com/apache/spark/pull/29339#issuecomment-668927448 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29339: [Spark-32512][SQL][WIP] add alter table add/drop partition command for datasourcev2

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29339: URL: https://github.com/apache/spark/pull/29339#issuecomment-668927448 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] stczwd commented on a change in pull request #29339: [Spark-32512][SQL][WIP] add alter table add/drop partition command for datasourcev2

2020-08-04 Thread GitBox
stczwd commented on a change in pull request #29339: URL: https://github.com/apache/spark/pull/29339#discussion_r465425990 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/AlterTableDropPartitionExec.scala ## @@ -0,0 +1,58 @@ +/* + *

[GitHub] [spark] SparkQA commented on pull request #29339: [Spark-32512][SQL][WIP] add alter table add/drop partition command for datasourcev2

2020-08-04 Thread GitBox
SparkQA commented on pull request #29339: URL: https://github.com/apache/spark/pull/29339#issuecomment-668927081 **[Test build #127072 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127072/testReport)** for PR 29339 at commit

[GitHub] [spark] wangshisan commented on pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-04 Thread GitBox
wangshisan commented on pull request #29266: URL: https://github.com/apache/spark/pull/29266#issuecomment-668926319 > Yea I'm also wondering the approach here. The skew join handling needs to split the skew side, and repeat the other side. I don't think we can split the buckets of

[GitHub] [spark] wangshisan commented on a change in pull request #29266: [SPARK-32464][SQL] Support skew handling on join that has one side wi…

2020-08-04 Thread GitBox
wangshisan commented on a change in pull request #29266: URL: https://github.com/apache/spark/pull/29266#discussion_r465422329 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/OptimizeSkewedJoin.scala ## @@ -250,6 +251,85 @@ case class

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29342: [SPARK-32399][SQL] Full outer shuffled hash join

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29342: URL: https://github.com/apache/spark/pull/29342#issuecomment-668916700 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29342: [SPARK-32399][SQL] Full outer shuffled hash join

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29342: URL: https://github.com/apache/spark/pull/29342#issuecomment-668916700 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29342: [SPARK-32399][SQL] Full outer shuffled hash join

2020-08-04 Thread GitBox
SparkQA removed a comment on pull request #29342: URL: https://github.com/apache/spark/pull/29342#issuecomment-668826573 **[Test build #127065 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127065/testReport)** for PR 29342 at commit

[GitHub] [spark] SparkQA commented on pull request #29342: [SPARK-32399][SQL] Full outer shuffled hash join

2020-08-04 Thread GitBox
SparkQA commented on pull request #29342: URL: https://github.com/apache/spark/pull/29342#issuecomment-668916087 **[Test build #127065 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127065/testReport)** for PR 29342 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29353: [Spark-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668914789 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29353: [Spark-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668914789 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29353: [Spark-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
SparkQA commented on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668914387 **[Test build #127071 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127071/testReport)** for PR 29353 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29353: [Spark-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668823788 Can one of the admins verify this patch? This is an automated message from the Apache Git

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668912120 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] viirya commented on pull request #29353: [Spark-32532][SQL] Improve ORC read/write performance on nested structs and array of structs

2020-08-04 Thread GitBox
viirya commented on pull request #29353: URL: https://github.com/apache/spark/pull/29353#issuecomment-668912282 ok to test This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
AmplabJenkins commented on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668912120 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
SparkQA commented on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668911605 **[Test build #127070 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127070/testReport)** for PR 29352 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
AmplabJenkins removed a comment on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668813824 Can one of the admins verify this patch? This is an automated message from the Apache Git

[GitHub] [spark] viirya commented on pull request #29352: [SPARK-32531][SQL][TEST] Add benchmarks for nested structs and arrays for different file formats

2020-08-04 Thread GitBox
viirya commented on pull request #29352: URL: https://github.com/apache/spark/pull/29352#issuecomment-668910801 ok to test This is an automated message from the Apache Git Service. To respond to the message, please log on to

  1   2   3   4   5   6   >