[GitHub] [spark] AmplabJenkins removed a comment on pull request #30026: [SPARK-32978][SQL] Make sure the number of dynamic part metric is correct

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #30026: URL: https://github.com/apache/spark/pull/30026#issuecomment-710303567 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30026: [SPARK-32978][SQL] Make sure the number of dynamic part metric is correct

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #30026: URL: https://github.com/apache/spark/pull/30026#issuecomment-710303576 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129

[GitHub] [spark] SparkQA commented on pull request #28618: [SPARK-31801][API][SHUFFLE] Register map output metadata

2020-10-16 Thread GitBox
SparkQA commented on pull request #28618: URL: https://github.com/apache/spark/pull/28618#issuecomment-710308440 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34512/ -

[GitHub] [spark] SparkQA commented on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
SparkQA commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710315519 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34511/ -

[GitHub] [spark] SparkQA commented on pull request #30056: [WIP][SPARK-33160][SQL] Allow saving/loading INT96 in parquet w/o rebasing

2020-10-16 Thread GitBox
SparkQA commented on pull request #30056: URL: https://github.com/apache/spark/pull/30056#issuecomment-710337119 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34510/ ---

[GitHub] [spark] AmplabJenkins commented on pull request #30056: [WIP][SPARK-33160][SQL] Allow saving/loading INT96 in parquet w/o rebasing

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #30056: URL: https://github.com/apache/spark/pull/30056#issuecomment-710337186 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #30056: [WIP][SPARK-33160][SQL] Allow saving/loading INT96 in parquet w/o rebasing

2020-10-16 Thread GitBox
SparkQA commented on pull request #30056: URL: https://github.com/apache/spark/pull/30056#issuecomment-710338753 **[Test build #129897 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129897/testReport)** for PR 30056 at commit [`fb67c68`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #30056: [WIP][SPARK-33160][SQL] Allow saving/loading INT96 in parquet w/o rebasing

2020-10-16 Thread GitBox
SparkQA removed a comment on pull request #30056: URL: https://github.com/apache/spark/pull/30056#issuecomment-710045243 **[Test build #129897 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129897/testReport)** for PR 30056 at commit [`fb67c68`](https://gi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30056: [WIP][SPARK-33160][SQL] Allow saving/loading INT96 in parquet w/o rebasing

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #30056: URL: https://github.com/apache/spark/pull/30056#issuecomment-710337186 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #30056: [WIP][SPARK-33160][SQL] Allow saving/loading INT96 in parquet w/o rebasing

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #30056: URL: https://github.com/apache/spark/pull/30056#issuecomment-710346237 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #28618: [SPARK-31801][API][SHUFFLE] Register map output metadata

2020-10-16 Thread GitBox
SparkQA commented on pull request #28618: URL: https://github.com/apache/spark/pull/28618#issuecomment-710345518 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34512/ ---

[GitHub] [spark] SparkQA commented on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
SparkQA commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710344681 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34513/ -

[GitHub] [spark] AmplabJenkins commented on pull request #28618: [SPARK-31801][API][SHUFFLE] Register map output metadata

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #28618: URL: https://github.com/apache/spark/pull/28618#issuecomment-710345625 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] dongjoon-hyun closed pull request #30012: [SPARK-XXX][INFRA] Rebalance GitHub Action jobs

2020-10-16 Thread GitBox
dongjoon-hyun closed pull request #30012: URL: https://github.com/apache/spark/pull/30012 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30056: [WIP][SPARK-33160][SQL] Allow saving/loading INT96 in parquet w/o rebasing

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #30056: URL: https://github.com/apache/spark/pull/30056#issuecomment-710346237 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #30068: [SPARK-33171][INFRA] Mark ParquetV*FilterSuite/ParquetV*SchemaPruningSuite as ExtendedSQLTest

2020-10-16 Thread GitBox
SparkQA commented on pull request #30068: URL: https://github.com/apache/spark/pull/30068#issuecomment-710349087 **[Test build #129908 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129908/testReport)** for PR 30068 at commit [`767763b`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28618: [SPARK-31801][API][SHUFFLE] Register map output metadata

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #28618: URL: https://github.com/apache/spark/pull/28618#issuecomment-710345625 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710356575 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
SparkQA commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710356466 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34511/ ---

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710356575 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710356604 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8

[GitHub] [spark] SparkQA commented on pull request #29587: [SPARK-32376][SQL] Make unionByName null-filling behavior work with struct columns

2020-10-16 Thread GitBox
SparkQA commented on pull request #29587: URL: https://github.com/apache/spark/pull/29587#issuecomment-710378955 **[Test build #129909 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129909/testReport)** for PR 29587 at commit [`2ca1379`](https://github.com

[GitHub] [spark] AmplabJenkins commented on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710385639 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
SparkQA commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710385603 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34513/ ---

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710385639 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710385652 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8

[GitHub] [spark] lidavidm commented on pull request #29818: [SPARK-32953][PYTHON] Add Arrow self_destruct support to toPandas

2020-10-16 Thread GitBox
lidavidm commented on pull request #29818: URL: https://github.com/apache/spark/pull/29818#issuecomment-710415159 Running the demo again gives these two plots. While the memory usage looks identical, in the no-self-destruct case, Python gets OOMKilled, while it does not get OOMKilled in th

[GitHub] [spark] SparkQA commented on pull request #29818: [SPARK-32953][PYTHON] Add Arrow self_destruct support to toPandas

2020-10-16 Thread GitBox
SparkQA commented on pull request #29818: URL: https://github.com/apache/spark/pull/29818#issuecomment-710422635 **[Test build #129910 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129910/testReport)** for PR 29818 at commit [`4fef9d9`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #30068: [SPARK-33171][INFRA] Mark ParquetV*FilterSuite/ParquetV*SchemaPruningSuite as ExtendedSQLTest

2020-10-16 Thread GitBox
SparkQA commented on pull request #30068: URL: https://github.com/apache/spark/pull/30068#issuecomment-710427152 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34514/ -

[GitHub] [spark] SparkQA commented on pull request #29587: [SPARK-32376][SQL] Make unionByName null-filling behavior work with struct columns

2020-10-16 Thread GitBox
SparkQA commented on pull request #29587: URL: https://github.com/apache/spark/pull/29587#issuecomment-710448782 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34515/ -

[GitHub] [spark] sunchao commented on a change in pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
sunchao commented on a change in pull request #29843: URL: https://github.com/apache/spark/pull/29843#discussion_r506671572 ## File path: pom.xml ## @@ -2393,17 +2435,6 @@ - Review comment: I had to remove this

[GitHub] [spark] AmplabJenkins commented on pull request #30068: [SPARK-33171][INFRA] Mark ParquetV*FilterSuite/ParquetV*SchemaPruningSuite as ExtendedSQLTest

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #30068: URL: https://github.com/apache/spark/pull/30068#issuecomment-710456182 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #30068: [SPARK-33171][INFRA] Mark ParquetV*FilterSuite/ParquetV*SchemaPruningSuite as ExtendedSQLTest

2020-10-16 Thread GitBox
SparkQA commented on pull request #30068: URL: https://github.com/apache/spark/pull/30068#issuecomment-710456094 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34514/ ---

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30068: [SPARK-33171][INFRA] Mark ParquetV*FilterSuite/ParquetV*SchemaPruningSuite as ExtendedSQLTest

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #30068: URL: https://github.com/apache/spark/pull/30068#issuecomment-710456182 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] dongjoon-hyun commented on pull request #29843: [SPARK-29250][BUILD][test-maven][test-hadoop2.7] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
dongjoon-hyun commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710459754 Retest this please This is an automated message from the Apache Git Service. To respond to the message, pl

[GitHub] [spark] SparkQA commented on pull request #29843: [SPARK-29250][BUILD][test-maven][test-hadoop2.7] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
SparkQA commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710462150 **[Test build #129911 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129911/testReport)** for PR 29843 at commit [`5d27163`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #29587: [SPARK-32376][SQL] Make unionByName null-filling behavior work with struct columns

2020-10-16 Thread GitBox
SparkQA commented on pull request #29587: URL: https://github.com/apache/spark/pull/29587#issuecomment-710474858 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34515/ ---

[GitHub] [spark] AmplabJenkins commented on pull request #29587: [SPARK-32376][SQL] Make unionByName null-filling behavior work with struct columns

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #29587: URL: https://github.com/apache/spark/pull/29587#issuecomment-710474963 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #30066: [SPARK-XXX][INFRA] Use pre-built image at GitHub Action SparkR job

2020-10-16 Thread GitBox
SparkQA commented on pull request #30066: URL: https://github.com/apache/spark/pull/30066#issuecomment-710476062 **[Test build #129912 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129912/testReport)** for PR 30066 at commit [`ff1cc7c`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29587: [SPARK-32376][SQL] Make unionByName null-filling behavior work with struct columns

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #29587: URL: https://github.com/apache/spark/pull/29587#issuecomment-710474963 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] dongjoon-hyun commented on pull request #30068: [SPARK-33171][INFRA] Mark ParquetV*FilterSuite/ParquetV*SchemaPruningSuite as ExtendedSQLTest

2020-10-16 Thread GitBox
dongjoon-hyun commented on pull request #30068: URL: https://github.com/apache/spark/pull/30068#issuecomment-710486854 Could you review this, @srowen , @gengliangwang , @HyukjinKwon , @viirya ? This is an automated message fr

[GitHub] [spark] SparkQA commented on pull request #29818: [SPARK-32953][PYTHON] Add Arrow self_destruct support to toPandas

2020-10-16 Thread GitBox
SparkQA commented on pull request #29818: URL: https://github.com/apache/spark/pull/29818#issuecomment-710488670 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34516/ -

[GitHub] [spark] SparkQA commented on pull request #30045: [SPARK-32991][SQL] Use conf in shared state as the original configuraion for RESET

2020-10-16 Thread GitBox
SparkQA commented on pull request #30045: URL: https://github.com/apache/spark/pull/30045#issuecomment-710492296 **[Test build #129900 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129900/testReport)** for PR 30045 at commit [`91c2e91`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #30045: [SPARK-32991][SQL] Use conf in shared state as the original configuraion for RESET

2020-10-16 Thread GitBox
SparkQA removed a comment on pull request #30045: URL: https://github.com/apache/spark/pull/30045#issuecomment-710114143 **[Test build #129900 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129900/testReport)** for PR 30045 at commit [`91c2e91`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #30045: [SPARK-32991][SQL] Use conf in shared state as the original configuraion for RESET

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #30045: URL: https://github.com/apache/spark/pull/30045#issuecomment-710495658 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30045: [SPARK-32991][SQL] Use conf in shared state as the original configuraion for RESET

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #30045: URL: https://github.com/apache/spark/pull/30045#issuecomment-710495658 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] MaxGekk opened a new pull request #30069: [MINOR][SQL] Re-use `binaryToSQLTimestamp()` in `ParquetRowConverter`

2020-10-16 Thread GitBox
MaxGekk opened a new pull request #30069: URL: https://github.com/apache/spark/pull/30069 ### What changes were proposed in this pull request? The function `binaryToSQLTimestamp()` is used by Parquet Vectorized reader. Parquet MR reader has similar code for de-serialization of INT96 time

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #30069: [MINOR][SQL] Re-use `binaryToSQLTimestamp()` in `ParquetRowConverter`

2020-10-16 Thread GitBox
dongjoon-hyun commented on a change in pull request #30069: URL: https://github.com/apache/spark/pull/30069#discussion_r506688087 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetRowConverter.scala ## @@ -300,15 +300,7 @@ private[

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #30069: [MINOR][SQL] Re-use `binaryToSQLTimestamp()` in `ParquetRowConverter`

2020-10-16 Thread GitBox
dongjoon-hyun commented on a change in pull request #30069: URL: https://github.com/apache/spark/pull/30069#discussion_r506688087 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetRowConverter.scala ## @@ -300,15 +300,7 @@ private[

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #30069: [MINOR][SQL] Re-use `binaryToSQLTimestamp()` in `ParquetRowConverter`

2020-10-16 Thread GitBox
dongjoon-hyun commented on a change in pull request #30069: URL: https://github.com/apache/spark/pull/30069#discussion_r506688087 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetRowConverter.scala ## @@ -300,15 +300,7 @@ private[

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #30069: [MINOR][SQL] Re-use `binaryToSQLTimestamp()` in `ParquetRowConverter`

2020-10-16 Thread GitBox
dongjoon-hyun commented on a change in pull request #30069: URL: https://github.com/apache/spark/pull/30069#discussion_r506688087 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetRowConverter.scala ## @@ -300,15 +300,7 @@ private[

[GitHub] [spark] SparkQA commented on pull request #29843: [SPARK-29250][BUILD][test-maven][test-hadoop2.7] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
SparkQA commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710510564 **[Test build #129907 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129907/testReport)** for PR 29843 at commit [`5d27163`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #29843: [SPARK-29250][BUILD][test-maven][test-hadoop2.7] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
SparkQA removed a comment on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710268992 **[Test build #129907 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129907/testReport)** for PR 29843 at commit [`5d27163`](https://gi

[GitHub] [spark] AmplabJenkins commented on pull request #29843: [SPARK-29250][BUILD][test-maven][test-hadoop2.7] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710512983 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] dongjoon-hyun commented on pull request #30068: [SPARK-33171][INFRA] Mark ParquetV*FilterSuite/ParquetV*SchemaPruningSuite as ExtendedSQLTest

2020-10-16 Thread GitBox
dongjoon-hyun commented on pull request #30068: URL: https://github.com/apache/spark/pull/30068#issuecomment-710514192 Ya. We can. Currently, it seems that @HyukjinKwon designed this for easy consistent categorization in Hive/SQL. ![Screen Shot 2020-10-16 at 12 50 36 PM](https://user-im

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29843: [SPARK-29250][BUILD][test-maven][test-hadoop2.7] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710512983 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29843: [SPARK-29250][BUILD][test-maven][test-hadoop2.7] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710512991 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129

[GitHub] [spark] dongjoon-hyun commented on pull request #30068: [SPARK-33171][INFRA] Mark ParquetV*FilterSuite/ParquetV*SchemaPruningSuite as ExtendedSQLTest

2020-10-16 Thread GitBox
dongjoon-hyun commented on pull request #30068: URL: https://github.com/apache/spark/pull/30068#issuecomment-710515093 For naming things, we can switch it with a follow-up as a easy fix without CI passing. So, I'll merge this first to save our community time. Thank you so much, @viirya !

[GitHub] [spark] SparkQA commented on pull request #29587: [SPARK-32376][SQL] Make unionByName null-filling behavior work with struct columns

2020-10-16 Thread GitBox
SparkQA commented on pull request #29587: URL: https://github.com/apache/spark/pull/29587#issuecomment-710516840 **[Test build #129914 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129914/testReport)** for PR 29587 at commit [`3d907d0`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #30069: [MINOR][SQL] Re-use `binaryToSQLTimestamp()` in `ParquetRowConverter`

2020-10-16 Thread GitBox
SparkQA commented on pull request #30069: URL: https://github.com/apache/spark/pull/30069#issuecomment-710516751 **[Test build #129913 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129913/testReport)** for PR 30069 at commit [`f80ab26`](https://github.com

[GitHub] [spark] dongjoon-hyun commented on pull request #30068: [SPARK-33171][INFRA] Mark ParquetV*FilterSuite/ParquetV*SchemaPruningSuite as ExtendedSQLTest

2020-10-16 Thread GitBox
dongjoon-hyun commented on pull request #30068: URL: https://github.com/apache/spark/pull/30068#issuecomment-710518573 Merged to master This is an automated message from the Apache Git Service. To respond to the message, plea

[GitHub] [spark] SparkQA commented on pull request #29818: [SPARK-32953][PYTHON] Add Arrow self_destruct support to toPandas

2020-10-16 Thread GitBox
SparkQA commented on pull request #29818: URL: https://github.com/apache/spark/pull/29818#issuecomment-710524710 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34516/ ---

[GitHub] [spark] AmplabJenkins commented on pull request #29818: [SPARK-32953][PYTHON] Add Arrow self_destruct support to toPandas

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #29818: URL: https://github.com/apache/spark/pull/29818#issuecomment-710524785 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29818: [SPARK-32953][PYTHON] Add Arrow self_destruct support to toPandas

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #29818: URL: https://github.com/apache/spark/pull/29818#issuecomment-710524785 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] dongjoon-hyun commented on pull request #30068: [SPARK-33171][INFRA] Mark ParquetV*FilterSuite/ParquetV*SchemaPruningSuite as ExtendedSQLTest

2020-10-16 Thread GitBox
dongjoon-hyun commented on pull request #30068: URL: https://github.com/apache/spark/pull/30068#issuecomment-710529083 Also, cherry-picked to branch-3.0 to reduce the waiting time there. This is an automated message from the

[GitHub] [spark] SparkQA commented on pull request #29843: [SPARK-29250][BUILD][test-maven][test-hadoop2.7] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
SparkQA commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710533518 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34517/ -

[GitHub] [spark] SparkQA commented on pull request #30066: [SPARK-XXX][INFRA] Use pre-built image at GitHub Action SparkR job

2020-10-16 Thread GitBox
SparkQA commented on pull request #30066: URL: https://github.com/apache/spark/pull/30066#issuecomment-710540563 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34518/ -

[GitHub] [spark] SparkQA commented on pull request #30066: [SPARK-XXX][INFRA] Use pre-built image at GitHub Action SparkR job

2020-10-16 Thread GitBox
SparkQA commented on pull request #30066: URL: https://github.com/apache/spark/pull/30066#issuecomment-710541150 **[Test build #129915 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129915/testReport)** for PR 30066 at commit [`3e86b6e`](https://github.com

[GitHub] [spark] dongjoon-hyun commented on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
dongjoon-hyun commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710559764 Retest this please. This is an automated message from the Apache Git Service. To respond to the message, p

[GitHub] [spark] dongjoon-hyun removed a comment on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
dongjoon-hyun removed a comment on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710559764 Retest this please. This is an automated message from the Apache Git Service. To respond to the me

[GitHub] [spark] SparkQA commented on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
SparkQA commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710562102 Kubernetes integration test status success URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34517/ ---

[GitHub] [spark] AmplabJenkins commented on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710562172 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710562172 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #30069: [MINOR][SQL] Re-use `binaryToSQLTimestamp()` in `ParquetRowConverter`

2020-10-16 Thread GitBox
SparkQA commented on pull request #30069: URL: https://github.com/apache/spark/pull/30069#issuecomment-710569313 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34519/ -

[GitHub] [spark] dongjoon-hyun commented on pull request #29231: [SPARK-32436][CORE] Initialize numNonEmptyBlocks in HighlyCompressedMapStatus.readExternal

2020-10-16 Thread GitBox
dongjoon-hyun commented on pull request #29231: URL: https://github.com/apache/spark/pull/29231#issuecomment-710572952 I agree with you guys, @srowen and @dossett ! Sure, I'll test and backport this. This is an automated mes

[GitHub] [spark] SparkQA commented on pull request #30066: [SPARK-XXX][INFRA] Use pre-built image at GitHub Action SparkR job

2020-10-16 Thread GitBox
SparkQA commented on pull request #30066: URL: https://github.com/apache/spark/pull/30066#issuecomment-710573048 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34518/ ---

[GitHub] [spark] AmplabJenkins commented on pull request #30066: [SPARK-XXX][INFRA] Use pre-built image at GitHub Action SparkR job

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #30066: URL: https://github.com/apache/spark/pull/30066#issuecomment-710573110 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30066: [SPARK-XXX][INFRA] Use pre-built image at GitHub Action SparkR job

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #30066: URL: https://github.com/apache/spark/pull/30066#issuecomment-710573110 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30066: [SPARK-XXX][INFRA] Use pre-built image at GitHub Action SparkR job

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #30066: URL: https://github.com/apache/spark/pull/30066#issuecomment-710573138 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8

[GitHub] [spark] SparkQA commented on pull request #29587: [SPARK-32376][SQL] Make unionByName null-filling behavior work with struct columns

2020-10-16 Thread GitBox
SparkQA commented on pull request #29587: URL: https://github.com/apache/spark/pull/29587#issuecomment-710582095 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34520/ -

[GitHub] [spark] SparkQA commented on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
SparkQA commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710584861 **[Test build #129911 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129911/testReport)** for PR 29843 at commit [`5d27163`](https://github.co

[GitHub] [spark] AmplabJenkins commented on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710585049 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710585049 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] SparkQA removed a comment on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
SparkQA removed a comment on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710462150 **[Test build #129911 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129911/testReport)** for PR 29843 at commit [`5d27163`](https://gi

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29843: [SPARK-29250][BUILD] Upgrade to Hadoop 3.2.1 and move to shaded client

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-710585079 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129

[GitHub] [spark] dongjoon-hyun commented on pull request #29231: [SPARK-32436][CORE] Initialize numNonEmptyBlocks in HighlyCompressedMapStatus.readExternal

2020-10-16 Thread GitBox
dongjoon-hyun commented on pull request #29231: URL: https://github.com/apache/spark/pull/29231#issuecomment-710590910 BTW, for the related Scala issues, I linked [here](https://github.com/apache/spark/pull/29231#issuecomment-671626985). ---

[GitHub] [spark] SparkQA commented on pull request #30066: [SPARK-XXX][INFRA] Use pre-built image at GitHub Action SparkR job

2020-10-16 Thread GitBox
SparkQA commented on pull request #30066: URL: https://github.com/apache/spark/pull/30066#issuecomment-710596554 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34521/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30069: [MINOR][SQL] Re-use `binaryToSQLTimestamp()` in `ParquetRowConverter`

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #30069: URL: https://github.com/apache/spark/pull/30069#issuecomment-710614284 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] AmplabJenkins commented on pull request #30069: [MINOR][SQL] Re-use `binaryToSQLTimestamp()` in `ParquetRowConverter`

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #30069: URL: https://github.com/apache/spark/pull/30069#issuecomment-710614284 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] AmplabJenkins commented on pull request #29587: [SPARK-32376][SQL] Make unionByName null-filling behavior work with struct columns

2020-10-16 Thread GitBox
AmplabJenkins commented on pull request #29587: URL: https://github.com/apache/spark/pull/29587#issuecomment-710615294 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHu

[GitHub] [spark] SparkQA commented on pull request #30069: [MINOR][SQL] Re-use `binaryToSQLTimestamp()` in `ParquetRowConverter`

2020-10-16 Thread GitBox
SparkQA commented on pull request #30069: URL: https://github.com/apache/spark/pull/30069#issuecomment-710614234 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34519/ ---

[GitHub] [spark] gemelen opened a new pull request #30070: [WIP][SPARK-33109][BUILD] Upgrade to sbt 1.4.0

2020-10-16 Thread GitBox
gemelen opened a new pull request #30070: URL: https://github.com/apache/spark/pull/30070 ### What changes were proposed in this pull request? Upgrade sbt to release 1.4.0 ### Why are the changes needed? Bring built-in `dependencyTree` instead of removed `sbt-dependency-

[GitHub] [spark] SparkQA commented on pull request #29587: [SPARK-32376][SQL] Make unionByName null-filling behavior work with struct columns

2020-10-16 Thread GitBox
SparkQA commented on pull request #29587: URL: https://github.com/apache/spark/pull/29587#issuecomment-710615247 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34520/ ---

[GitHub] [spark] AmplabJenkins removed a comment on pull request #30069: [MINOR][SQL] Re-use `binaryToSQLTimestamp()` in `ParquetRowConverter`

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #30069: URL: https://github.com/apache/spark/pull/30069#issuecomment-710614293 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8

[GitHub] [spark] SparkQA commented on pull request #30070: [WIP][SPARK-33109][BUILD] Upgrade to sbt 1.4.0

2020-10-16 Thread GitBox
SparkQA commented on pull request #30070: URL: https://github.com/apache/spark/pull/30070#issuecomment-710617035 **[Test build #129916 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129916/testReport)** for PR 30070 at commit [`1183ea9`](https://github.com

[GitHub] [spark] gemelen commented on pull request #30070: [WIP][SPARK-33109][BUILD] Upgrade to sbt 1.4.0

2020-10-16 Thread GitBox
gemelen commented on pull request #30070: URL: https://github.com/apache/spark/pull/30070#issuecomment-710616557 I'd like to see GH and Jenkins tests, cause I had seen very strange behaviour locally (one of the tasks went into endless loop) and I'm not sure that it is reproducible somewher

[GitHub] [spark] SparkQA commented on pull request #30066: [SPARK-XXX][INFRA] Use pre-built image at GitHub Action SparkR job

2020-10-16 Thread GitBox
SparkQA commented on pull request #30066: URL: https://github.com/apache/spark/pull/30066#issuecomment-710617216 **[Test build #129917 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129917/testReport)** for PR 30066 at commit [`a40555e`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29587: [SPARK-32376][SQL] Make unionByName null-filling behavior work with struct columns

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #29587: URL: https://github.com/apache/spark/pull/29587#issuecomment-710615294 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To r

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29587: [SPARK-32376][SQL] Make unionByName null-filling behavior work with struct columns

2020-10-16 Thread GitBox
AmplabJenkins removed a comment on pull request #29587: URL: https://github.com/apache/spark/pull/29587#issuecomment-710615346 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8

[GitHub] [spark] dongjoon-hyun commented on pull request #29231: [SPARK-32436][CORE] Initialize numNonEmptyBlocks in HighlyCompressedMapStatus.readExternal

2020-10-16 Thread GitBox
dongjoon-hyun commented on pull request #29231: URL: https://github.com/apache/spark/pull/29231#issuecomment-710633885 This lands at `branch-3.0` now. This is an automated message from the Apache Git Service. To respond to th

<    1   2   3   4   5   6   7   8   9   10   >