[GitHub] [spark] AmplabJenkins removed a comment on pull request #28641: [SPARK-31824][CORE][TESTS] DAGSchedulerSuite: Improve and reuse completeShuffleMapStageSuccessfully

2020-06-12 Thread GitBox
AmplabJenkins removed a comment on pull request #28641: URL: https://github.com/apache/spark/pull/28641#issuecomment-643572368 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #28817: [WIP][SPARK-31197][CORE] Exit the executor once all tasks and migrations are finished built on top of on top of spark20629

2020-06-12 Thread GitBox
SparkQA commented on pull request #28817: URL: https://github.com/apache/spark/pull/28817#issuecomment-643573565 **[Test build #123957 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123957/testReport)** for PR 28817 at commit [`ea8efc7`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #28817: [WIP][SPARK-31197][CORE] Exit the executor once all tasks and migrations are finished built on top of on top of spark20629

2020-06-12 Thread GitBox
SparkQA removed a comment on pull request #28817: URL: https://github.com/apache/spark/pull/28817#issuecomment-643555188 **[Test build #123957 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123957/testReport)** for PR 28817 at commit [`ea8efc7`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #28817: [WIP][SPARK-31197][CORE] Exit the executor once all tasks and migrations are finished built on top of on top of spark20629

2020-06-12 Thread GitBox
AmplabJenkins commented on pull request #28817: URL: https://github.com/apache/spark/pull/28817#issuecomment-643573679 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28817: [WIP][SPARK-31197][CORE] Exit the executor once all tasks and migrations are finished built on top of on top of spark20629

2020-06-12 Thread GitBox
AmplabJenkins removed a comment on pull request #28817: URL: https://github.com/apache/spark/pull/28817#issuecomment-643573679 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28817: [WIP][SPARK-31197][CORE] Exit the executor once all tasks and migrations are finished built on top of on top of spark20629

2020-06-12 Thread GitBox
AmplabJenkins removed a comment on pull request #28817: URL: https://github.com/apache/spark/pull/28817#issuecomment-643573682 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/123

[GitHub] [spark] SparkQA commented on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-06-12 Thread GitBox
SparkQA commented on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-643575263 **[Test build #123960 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123960/testReport)** for PR 28708 at commit [`fe34308`](https://github.co

[GitHub] [spark] AmplabJenkins commented on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-06-12 Thread GitBox
AmplabJenkins commented on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-643575373 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA removed a comment on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-06-12 Thread GitBox
SparkQA removed a comment on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-643560584 **[Test build #123960 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123960/testReport)** for PR 28708 at commit [`fe34308`](https://gi

[GitHub] [spark] SparkQA commented on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-06-12 Thread GitBox
SparkQA commented on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-643575296 **[Test build #123964 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123964/testReport)** for PR 28708 at commit [`da1db47`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-06-12 Thread GitBox
AmplabJenkins removed a comment on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-643575373 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] holdenk commented on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-06-12 Thread GitBox
holdenk commented on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-643575581 Yeah so the plan is to trigger an exit as soon as migrations are completed. I think a good follow up to https://issues.apache.org/jira/browse/SPARK-31197 would be adding a time

[GitHub] [spark] wangyum commented on pull request #28032: [SPARK-31264][SQL] Repartition by dynamic partition columns before insert partition table

2020-06-12 Thread GitBox
wangyum commented on pull request #28032: URL: https://github.com/apache/spark/pull/28032#issuecomment-643575562 Thank you @gengliangwang The root cause is repartition by dynamic partition columns can significantly reduce the number of files: ![image](https://user-images.githubuserconte

[GitHub] [spark] sarutak commented on pull request #28803: [SPARK-31971][WEBUI] Add pagination support for all jobs timeline

2020-06-12 Thread GitBox
sarutak commented on pull request #28803: URL: https://github.com/apache/spark/pull/28803#issuecomment-643576377 > I think we should make the two pagination sections consistent The upper text fields are for timeline, while the lower ones are for the table so I think they should be indepe

[GitHub] [spark] wangyum edited a comment on pull request #28032: [SPARK-31264][SQL] Repartition by dynamic partition columns before insert partition table

2020-06-12 Thread GitBox
wangyum edited a comment on pull request #28032: URL: https://github.com/apache/spark/pull/28032#issuecomment-643575562 Thank you @gengliangwang The root cause is it takes too much time to rename files: ![image](https://user-images.githubusercontent.com/5399861/84561473-4a773680-ad7f-11

[GitHub] [spark] sarutak edited a comment on pull request #28803: [SPARK-31971][WEBUI] Add pagination support for all jobs timeline

2020-06-12 Thread GitBox
sarutak edited a comment on pull request #28803: URL: https://github.com/apache/spark/pull/28803#issuecomment-643576377 > I think we should make the two pagination sections consistent The upper text fields are for timeline, while the lower ones are for the table so I think they shoul

[GitHub] [spark] SparkQA commented on pull request #28803: [SPARK-31971][WEBUI] Add pagination support for all jobs timeline

2020-06-12 Thread GitBox
SparkQA commented on pull request #28803: URL: https://github.com/apache/spark/pull/28803#issuecomment-643576740 **[Test build #123965 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123965/testReport)** for PR 28803 at commit [`0270bb9`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28803: [SPARK-31971][WEBUI] Add pagination support for all jobs timeline

2020-06-12 Thread GitBox
AmplabJenkins removed a comment on pull request #28803: URL: https://github.com/apache/spark/pull/28803#issuecomment-643576860 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28803: [SPARK-31971][WEBUI] Add pagination support for all jobs timeline

2020-06-12 Thread GitBox
AmplabJenkins commented on pull request #28803: URL: https://github.com/apache/spark/pull/28803#issuecomment-643576860 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-06-12 Thread GitBox
SparkQA commented on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-643577401 **[Test build #123958 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123958/testReport)** for PR 28708 at commit [`0ea927d`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-06-12 Thread GitBox
AmplabJenkins removed a comment on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-643575378 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/123

[GitHub] [spark] SparkQA removed a comment on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-06-12 Thread GitBox
SparkQA removed a comment on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-643555962 **[Test build #123958 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/123958/testReport)** for PR 28708 at commit [`0ea927d`](https://gi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-06-12 Thread GitBox
AmplabJenkins removed a comment on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-643577621 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-06-12 Thread GitBox
AmplabJenkins commented on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-643577621 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] viirya commented on a change in pull request #28123: [SPARK-31350][SQL] Coalesce bucketed tables for sort merge join if applicable

2020-06-12 Thread GitBox
viirya commented on a change in pull request #28123: URL: https://github.com/apache/spark/pull/28123#discussion_r439714718 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/bucketing/CoalesceBucketsInSortMergeJoin.scala ## @@ -0,0 +1,112 @@ +/* + * Licensed

[GitHub] [spark] SparkQA commented on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-06-12 Thread GitBox
SparkQA commented on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-643579696 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/28586/ -

[GitHub] [spark] sarutak opened a new pull request #28820: [SPARK-31632][CORE][WEBUI][FOLLOWUP] Enrich the exception message when application summary is unavailable

2020-06-12 Thread GitBox
sarutak opened a new pull request #28820: URL: https://github.com/apache/spark/pull/28820 ### What changes were proposed in this pull request? This PR enriches the exception message when application summary is not available. #28444 covers the case when application information is n

[GitHub] [spark] sarutak commented on pull request #28820: [SPARK-31632][CORE][WEBUI][FOLLOWUP] Enrich the exception message when application summary is unavailable

2020-06-12 Thread GitBox
sarutak commented on pull request #28820: URL: https://github.com/apache/spark/pull/28820#issuecomment-643581183 The difference of error message format between this PR and #28444 is due to the version of Jetty. Jetty is upgraded in #28585 .

[GitHub] [spark] AmplabJenkins commented on pull request #28820: [SPARK-31632][CORE][WEBUI][FOLLOWUP] Enrich the exception message when application summary is unavailable

2020-06-12 Thread GitBox
AmplabJenkins commented on pull request #28820: URL: https://github.com/apache/spark/pull/28820#issuecomment-643581126 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

<    3   4   5   6   7   8