[GitHub] [spark] SparkQA removed a comment on pull request #32499: [SPARK-29145][SQL][FOLLOWUP] Support sub-queries in join conditions

2021-05-11 Thread GitBox
SparkQA removed a comment on pull request #32499: URL: https://github.com/apache/spark/pull/32499#issuecomment-838571340 **[Test build #138381 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138381/testReport)** for PR 32499 at commit [`d9cad03`](https://gi

[GitHub] [spark] SparkQA commented on pull request #32499: [SPARK-29145][SQL][FOLLOWUP] Support sub-queries in join conditions

2021-05-11 Thread GitBox
SparkQA commented on pull request #32499: URL: https://github.com/apache/spark/pull/32499#issuecomment-839004916 **[Test build #138381 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138381/testReport)** for PR 32499 at commit [`d9cad03`](https://github.co

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32292: [WIP][SPARK-35162][SQL] New SQL functions: TRY_ADD/TRY_DIVIDE

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32292: URL: https://github.com/apache/spark/pull/32292#issuecomment-838973502 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42914/

[GitHub] [spark] AmplabJenkins commented on pull request #32292: [WIP][SPARK-35162][SQL] New SQL functions: TRY_ADD/TRY_DIVIDE

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32292: URL: https://github.com/apache/spark/pull/32292#issuecomment-838973502 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42914/ -- T

[GitHub] [spark] SparkQA commented on pull request #32292: [WIP][SPARK-35162][SQL] New SQL functions: TRY_ADD/TRY_DIVIDE

2021-05-11 Thread GitBox
SparkQA commented on pull request #32292: URL: https://github.com/apache/spark/pull/32292#issuecomment-838973377 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32436: [SPARK-34764][CORE][K8S][UI] Propagate reason for exec loss to Web UI

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32436: URL: https://github.com/apache/spark/pull/32436#issuecomment-838966502 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42913/

[GitHub] [spark] SparkQA commented on pull request #32436: [SPARK-34764][CORE][K8S][UI] Propagate reason for exec loss to Web UI

2021-05-11 Thread GitBox
SparkQA commented on pull request #32436: URL: https://github.com/apache/spark/pull/32436#issuecomment-838966445 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/42913/ -- This is an automated message from the A

[GitHub] [spark] AmplabJenkins commented on pull request #32436: [SPARK-34764][CORE][K8S][UI] Propagate reason for exec loss to Web UI

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32436: URL: https://github.com/apache/spark/pull/32436#issuecomment-838966502 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42913/ -- T

[GitHub] [spark] SparkQA commented on pull request #32178: [DONOTMERGE] initial commit for skeleton ansible for jenkins worker config

2021-05-11 Thread GitBox
SparkQA commented on pull request #32178: URL: https://github.com/apache/spark/pull/32178#issuecomment-838965272 **[Test build #138395 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138395/testReport)** for PR 32178 at commit [`89b36c9`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #32507: [SPARK-35361][SQL] Improve performance for ApplyFunctionExpression

2021-05-11 Thread GitBox
SparkQA commented on pull request #32507: URL: https://github.com/apache/spark/pull/32507#issuecomment-838964138 **[Test build #138394 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138394/testReport)** for PR 32507 at commit [`900f34c`](https://github.com

[GitHub] [spark] sunchao opened a new pull request #32507: [SPARK-35361][SQL] Improve performance for ApplyFunctionExpression

2021-05-11 Thread GitBox
sunchao opened a new pull request #32507: URL: https://github.com/apache/spark/pull/32507 ### What changes were proposed in this pull request? In `ApplyFunctionExpression`, move `zipWithIndex` out of the loop for each input row. ### Why are the changes needed?

[GitHub] [spark] SparkQA commented on pull request #32178: [DONOTMERGE] initial commit for skeleton ansible for jenkins worker config

2021-05-11 Thread GitBox
SparkQA commented on pull request #32178: URL: https://github.com/apache/spark/pull/32178#issuecomment-838959391 **[Test build #138393 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138393/testReport)** for PR 32178 at commit [`2c3ebfd`](https://github.com

[GitHub] [spark] xinrong-databricks commented on pull request #32469: [SPARK-35338][PYTHON] Separate arithmetic operations into data type based structures

2021-05-11 Thread GitBox
xinrong-databricks commented on pull request #32469: URL: https://github.com/apache/spark/pull/32469#issuecomment-838957172 CC @ueshin @HyukjinKwon @itholic -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abo

[GitHub] [spark] SparkQA commented on pull request #32436: [SPARK-34764][CORE][K8S][UI] Propagate reason for exec loss to Web UI

2021-05-11 Thread GitBox
SparkQA commented on pull request #32436: URL: https://github.com/apache/spark/pull/32436#issuecomment-838956958 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/42913/ -- This is an automated message from the Apache

[GitHub] [spark] shaneknapp commented on pull request #32178: [DONOTMERGE] initial commit for skeleton ansible for jenkins worker config

2021-05-11 Thread GitBox
shaneknapp commented on pull request #32178: URL: https://github.com/apache/spark/pull/32178#issuecomment-838952376 > > @shaneknapp Hi, shane, do you have any plan to append Arm node [1] setup related configuration on your patch? Thanks > > [1] https://amplab.cs.berkeley.edu/jenkins/labe

[GitHub] [spark] shaneknapp commented on pull request #32178: [DONOTMERGE] initial commit for skeleton ansible for jenkins worker config

2021-05-11 Thread GitBox
shaneknapp commented on pull request #32178: URL: https://github.com/apache/spark/pull/32178#issuecomment-838950631 > @shaneknapp Hi, shane, do you have any plan to append Arm node [1] setup related configuration on your patch? Thanks > > [1] https://amplab.cs.berkeley.edu/jenkins/la

[GitHub] [spark] SparkQA commented on pull request #32292: [WIP][SPARK-35162][SQL] New SQL functions: TRY_ADD/TRY_DIVIDE

2021-05-11 Thread GitBox
SparkQA commented on pull request #32292: URL: https://github.com/apache/spark/pull/32292#issuecomment-838947759 **[Test build #138392 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138392/testReport)** for PR 32292 at commit [`75dee4a`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32506: [SPARK-35374][SQL] Add string-to-number conversion support to JacksonParser

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32506: URL: https://github.com/apache/spark/pull/32506#issuecomment-838944933 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42911/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32452: [SPARK-35243][SQL] Support columnar execution on ANSI interval types

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32452: URL: https://github.com/apache/spark/pull/32452#issuecomment-838944934 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/138378/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32504: [Minor][SPARK-35013]Don't allow to set spark.driver.cores=0

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32504: URL: https://github.com/apache/spark/pull/32504#issuecomment-838944936 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/138383/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32469: [WIP][SPARK-35338][PYTHON] Separate arithmetic operations into data type based structures

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32469: URL: https://github.com/apache/spark/pull/32469#issuecomment-838944932 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42912/

[GitHub] [spark] AmplabJenkins commented on pull request #32452: [SPARK-35243][SQL] Support columnar execution on ANSI interval types

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32452: URL: https://github.com/apache/spark/pull/32452#issuecomment-838944934 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/138378/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #32504: [Minor][SPARK-35013]Don't allow to set spark.driver.cores=0

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32504: URL: https://github.com/apache/spark/pull/32504#issuecomment-838944936 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/138383/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #32469: [WIP][SPARK-35338][PYTHON] Separate arithmetic operations into data type based structures

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32469: URL: https://github.com/apache/spark/pull/32469#issuecomment-838944932 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42912/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #32506: [SPARK-35374][SQL] Add string-to-number conversion support to JacksonParser

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32506: URL: https://github.com/apache/spark/pull/32506#issuecomment-838944933 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42911/ -- T

[GitHub] [spark] sigmod commented on a change in pull request #32298: [WIP][SPARK-34079][SQL] Merge non-correlated scalar subqueries to multi-column scalar subqueries for better reuse

2021-05-11 Thread GitBox
sigmod commented on a change in pull request #32298: URL: https://github.com/apache/spark/pull/32298#discussion_r630412347 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/MergeScalarSubqueries.scala ## @@ -0,0 +1,184 @@ +/* + * Licensed to the

[GitHub] [spark] sigmod commented on a change in pull request #32298: [WIP][SPARK-34079][SQL] Merge non-correlated scalar subqueries to multi-column scalar subqueries for better reuse

2021-05-11 Thread GitBox
sigmod commented on a change in pull request #32298: URL: https://github.com/apache/spark/pull/32298#discussion_r630412347 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/MergeScalarSubqueries.scala ## @@ -0,0 +1,184 @@ +/* + * Licensed to the

[GitHub] [spark] sigmod commented on a change in pull request #32298: [WIP][SPARK-34079][SQL] Merge non-correlated scalar subqueries to multi-column scalar subqueries for better reuse

2021-05-11 Thread GitBox
sigmod commented on a change in pull request #32298: URL: https://github.com/apache/spark/pull/32298#discussion_r630412347 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/MergeScalarSubqueries.scala ## @@ -0,0 +1,184 @@ +/* + * Licensed to the

[GitHub] [spark] sigmod commented on a change in pull request #32298: [WIP][SPARK-34079][SQL] Merge non-correlated scalar subqueries to multi-column scalar subqueries for better reuse

2021-05-11 Thread GitBox
sigmod commented on a change in pull request #32298: URL: https://github.com/apache/spark/pull/32298#discussion_r630412347 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/MergeScalarSubqueries.scala ## @@ -0,0 +1,184 @@ +/* + * Licensed to the

[GitHub] [spark] SparkQA commented on pull request #32506: [SPARK-35374][SQL] Add string-to-number conversion support to JacksonParser

2021-05-11 Thread GitBox
SparkQA commented on pull request #32506: URL: https://github.com/apache/spark/pull/32506#issuecomment-838932173 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries

[GitHub] [spark] sigmod commented on a change in pull request #32298: [WIP][SPARK-34079][SQL] Merge non-correlated scalar subqueries to multi-column scalar subqueries for better reuse

2021-05-11 Thread GitBox
sigmod commented on a change in pull request #32298: URL: https://github.com/apache/spark/pull/32298#discussion_r630412347 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/MergeScalarSubqueries.scala ## @@ -0,0 +1,184 @@ +/* + * Licensed to the

[GitHub] [spark] SparkQA commented on pull request #32469: [WIP][SPARK-35338][PYTHON] Separate arithmetic operations into data type based structures

2021-05-11 Thread GitBox
SparkQA commented on pull request #32469: URL: https://github.com/apache/spark/pull/32469#issuecomment-838920549 Kubernetes integration test unable to build dist. exiting with code: 1 URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/42912/ -- This

[GitHub] [spark] dongjoon-hyun commented on pull request #32502: [SPARK-35372][BUILD] Increase stack size for Scala compilation in Maven build

2021-05-11 Thread GitBox
dongjoon-hyun commented on pull request #32502: URL: https://github.com/apache/spark/pull/32502#issuecomment-838915154 +1, LGTM. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comm

[GitHub] [spark] SparkQA removed a comment on pull request #32504: [Minor][SPARK-35013]Don't allow to set spark.driver.cores=0

2021-05-11 Thread GitBox
SparkQA removed a comment on pull request #32504: URL: https://github.com/apache/spark/pull/32504#issuecomment-838669907 **[Test build #138383 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138383/testReport)** for PR 32504 at commit [`b615db6`](https://gi

[GitHub] [spark] SparkQA commented on pull request #32504: [Minor][SPARK-35013]Don't allow to set spark.driver.cores=0

2021-05-11 Thread GitBox
SparkQA commented on pull request #32504: URL: https://github.com/apache/spark/pull/32504#issuecomment-838902238 **[Test build #138383 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138383/testReport)** for PR 32504 at commit [`b615db6`](https://github.co

[GitHub] [spark] SparkQA removed a comment on pull request #32452: [SPARK-35243][SQL] Support columnar execution on ANSI interval types

2021-05-11 Thread GitBox
SparkQA removed a comment on pull request #32452: URL: https://github.com/apache/spark/pull/32452#issuecomment-838464756 **[Test build #138378 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138378/testReport)** for PR 32452 at commit [`d911e8a`](https://gi

[GitHub] [spark] SparkQA commented on pull request #32452: [SPARK-35243][SQL] Support columnar execution on ANSI interval types

2021-05-11 Thread GitBox
SparkQA commented on pull request #32452: URL: https://github.com/apache/spark/pull/32452#issuecomment-83319 **[Test build #138378 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138378/testReport)** for PR 32452 at commit [`d911e8a`](https://github.co

[GitHub] [spark] SparkQA commented on pull request #32292: [WIP][SPARK-35162][SQL] New SQL functions: TRY_ADD/TRY_DIVIDE

2021-05-11 Thread GitBox
SparkQA commented on pull request #32292: URL: https://github.com/apache/spark/pull/32292#issuecomment-838883496 **[Test build #138391 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138391/testReport)** for PR 32292 at commit [`a1d5742`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32411: [SPARK-28551][SQL]CTAS with LOCATION , should not allow to a non-empty directory.

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32411: URL: https://github.com/apache/spark/pull/32411#issuecomment-838856363 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42908/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32436: [SPARK-34764][CORE][K8S][UI] Propagate reason for exec loss to Web UI

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32436: URL: https://github.com/apache/spark/pull/32436#issuecomment-832864910 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42699/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32389: [SPARK-35263] [TEST] Refactor ShuffleBlockFetcherIteratorSuite to reduce duplicated code

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32389: URL: https://github.com/apache/spark/pull/32389#issuecomment-838856369 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42909/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32505: [SPARK-35373][BUILD] Check Maven artifact checksum in build/mvn

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32505: URL: https://github.com/apache/spark/pull/32505#issuecomment-838856364 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42907/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32498: [SPARK-35368][SQL] Update histogram statistics for RANGE operator for stats estimation

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32498: URL: https://github.com/apache/spark/pull/32498#issuecomment-838856365 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42910/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32397: [SPARK-35084][CORE] Spark 3: supporting "--packages" in k8s cluster mode

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32397: URL: https://github.com/apache/spark/pull/32397#issuecomment-838856366 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/138380/ -

[GitHub] [spark] SparkQA commented on pull request #32436: [SPARK-34764][CORE][K8S][UI] Propagate reason for exec loss to Web UI

2021-05-11 Thread GitBox
SparkQA commented on pull request #32436: URL: https://github.com/apache/spark/pull/32436#issuecomment-838867786 **[Test build #138390 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138390/testReport)** for PR 32436 at commit [`a4263cd`](https://github.com

[GitHub] [spark] sarutak closed pull request #32502: [SPARK-35372][BUILD] Increase stack size for Scala compilation in Maven build

2021-05-11 Thread GitBox
sarutak closed pull request #32502: URL: https://github.com/apache/spark/pull/32502 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please

[GitHub] [spark] SparkQA commented on pull request #32469: [WIP][SPARK-35338][PYTHON] Separate arithmetic operations into data type based structures

2021-05-11 Thread GitBox
SparkQA commented on pull request #32469: URL: https://github.com/apache/spark/pull/32469#issuecomment-838861089 **[Test build #138389 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138389/testReport)** for PR 32469 at commit [`a4a6581`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #32506: [SPARK-35374][SQL] Add string-to-number conversion support to JacksonParser

2021-05-11 Thread GitBox
SparkQA commented on pull request #32506: URL: https://github.com/apache/spark/pull/32506#issuecomment-838860395 **[Test build #138388 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138388/testReport)** for PR 32506 at commit [`45a6833`](https://github.com

[GitHub] [spark] sarutak commented on pull request #32502: [SPARK-35372][BUILD] Increase stack size for Scala compilation in Maven build

2021-05-11 Thread GitBox
sarutak commented on pull request #32502: URL: https://github.com/apache/spark/pull/32502#issuecomment-838857505 I think the Jenkins failure is not related to this change. Merging to `master` to recover build. -- This is an automated message from the Apache Git Service. To respond to t

[GitHub] [spark] AmplabJenkins commented on pull request #32397: [SPARK-35084][CORE] Spark 3: supporting "--packages" in k8s cluster mode

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32397: URL: https://github.com/apache/spark/pull/32397#issuecomment-838856366 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/138380/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #32411: [SPARK-28551][SQL]CTAS with LOCATION , should not allow to a non-empty directory.

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32411: URL: https://github.com/apache/spark/pull/32411#issuecomment-838856363 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42908/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #32505: [SPARK-35373][BUILD] Check Maven artifact checksum in build/mvn

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32505: URL: https://github.com/apache/spark/pull/32505#issuecomment-838856364 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42907/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #32389: [SPARK-35263] [TEST] Refactor ShuffleBlockFetcherIteratorSuite to reduce duplicated code

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32389: URL: https://github.com/apache/spark/pull/32389#issuecomment-838856369 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42909/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #32498: [SPARK-35368][SQL] Update histogram statistics for RANGE operator for stats estimation

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32498: URL: https://github.com/apache/spark/pull/32498#issuecomment-838856365 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42910/ -- T

[GitHub] [spark] SparkQA commented on pull request #32498: [SPARK-35368][SQL] Update histogram statistics for RANGE operator for stats estimation

2021-05-11 Thread GitBox
SparkQA commented on pull request #32498: URL: https://github.com/apache/spark/pull/32498#issuecomment-838852137 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries

[GitHub] [spark] SparkQA commented on pull request #32389: [SPARK-35263] [TEST] Refactor ShuffleBlockFetcherIteratorSuite to reduce duplicated code

2021-05-11 Thread GitBox
SparkQA commented on pull request #32389: URL: https://github.com/apache/spark/pull/32389#issuecomment-838850182 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/42909/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #32411: [SPARK-28551][SQL]CTAS with LOCATION , should not allow to a non-empty directory.

2021-05-11 Thread GitBox
SparkQA commented on pull request #32411: URL: https://github.com/apache/spark/pull/32411#issuecomment-838844413 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries

[GitHub] [spark] SparkQA commented on pull request #32505: [SPARK-35373][BUILD] Check Maven artifact checksum in build/mvn

2021-05-11 Thread GitBox
SparkQA commented on pull request #32505: URL: https://github.com/apache/spark/pull/32505#issuecomment-838842848 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries

[GitHub] [spark] SparkQA commented on pull request #32389: [SPARK-35263] [TEST] Refactor ShuffleBlockFetcherIteratorSuite to reduce duplicated code

2021-05-11 Thread GitBox
SparkQA commented on pull request #32389: URL: https://github.com/apache/spark/pull/32389#issuecomment-838842568 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/42909/ -- This is an automated message from the Apache

[GitHub] [spark] SparkQA removed a comment on pull request #32397: [SPARK-35084][CORE] Spark 3: supporting "--packages" in k8s cluster mode

2021-05-11 Thread GitBox
SparkQA removed a comment on pull request #32397: URL: https://github.com/apache/spark/pull/32397#issuecomment-838564651 **[Test build #138380 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138380/testReport)** for PR 32397 at commit [`76bdccc`](https://gi

[GitHub] [spark] sarutak opened a new pull request #32506: [SPARK-35374][SQL] Add string-to-number conversion support to JacksonParser

2021-05-11 Thread GitBox
sarutak opened a new pull request #32506: URL: https://github.com/apache/spark/pull/32506 ### What changes were proposed in this pull request? This PR proposes to add string-to-number conversion support to `JacksonParser`. In the current implementation, `spark.read.json` doesn't c

[GitHub] [spark] SparkQA commented on pull request #32397: [SPARK-35084][CORE] Spark 3: supporting "--packages" in k8s cluster mode

2021-05-11 Thread GitBox
SparkQA commented on pull request #32397: URL: https://github.com/apache/spark/pull/32397#issuecomment-838821428 **[Test build #138380 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138380/testReport)** for PR 32397 at commit [`76bdccc`](https://github.co

[GitHub] [spark] peter-toth commented on pull request #32298: [WIP][SPARK-34079][SQL] Merge non-correlated scalar subqueries to multi-column scalar subqueries for better reuse

2021-05-11 Thread GitBox
peter-toth commented on pull request #32298: URL: https://github.com/apache/spark/pull/32298#issuecomment-838806407 I move this back to WIP for now: - to discuss if (1) in https://github.com/apache/spark/pull/32298#discussion_r627800694 is acceptable approach and - to fix a few issue

[GitHub] [spark] viirya commented on pull request #31986: [SPARK-34888][SS] Introduce UpdatingSessionIterator adjusting session window on elements

2021-05-11 Thread GitBox
viirya commented on pull request #31986: URL: https://github.com/apache/spark/pull/31986#issuecomment-838800572 Thanks for updating! I will take another look. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL ab

[GitHub] [spark] SparkQA commented on pull request #32498: [SPARK-35368][SQL] Update histogram statistics for RANGE operator for stats estimation

2021-05-11 Thread GitBox
SparkQA commented on pull request #32498: URL: https://github.com/apache/spark/pull/32498#issuecomment-838797638 **[Test build #138387 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138387/testReport)** for PR 32498 at commit [`8280487`](https://github.com

[GitHub] [spark] shahidki31 commented on pull request #32498: [SPARK-35368][SQL] Update histogram statistics for RANGE operator for stats estimation

2021-05-11 Thread GitBox
shahidki31 commented on pull request #32498: URL: https://github.com/apache/spark/pull/32498#issuecomment-838796881 cc @wangyum @cloud-fan @HyukjinKwon Kindly review -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use th

[GitHub] [spark] szeiger commented on a change in pull request #32315: [SPARK-35206][TESTS][SQL] Extract common used get project path into a function in SparkFunctionSuite

2021-05-11 Thread GitBox
szeiger commented on a change in pull request #32315: URL: https://github.com/apache/spark/pull/32315#discussion_r630338398 ## File path: core/src/test/scala/org/apache/spark/SparkFunSuite.scala ## @@ -119,6 +121,26 @@ abstract class SparkFunSuite file } + /** + *

[GitHub] [spark] SparkQA commented on pull request #32389: [SPARK-35263] [TEST] Refactor ShuffleBlockFetcherIteratorSuite to reduce duplicated code

2021-05-11 Thread GitBox
SparkQA commented on pull request #32389: URL: https://github.com/apache/spark/pull/32389#issuecomment-838776322 **[Test build #138386 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138386/testReport)** for PR 32389 at commit [`b2abb87`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #32411: [SPARK-28551][SQL]CTAS with LOCATION , should not allow to a non-empty directory.

2021-05-11 Thread GitBox
SparkQA commented on pull request #32411: URL: https://github.com/apache/spark/pull/32411#issuecomment-838776197 **[Test build #138385 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138385/testReport)** for PR 32411 at commit [`a37aba6`](https://github.com

[GitHub] [spark] SparkQA commented on pull request #32505: [SPARK-35373][BUILD] Check Maven artifact checksum in build/mvn

2021-05-11 Thread GitBox
SparkQA commented on pull request #32505: URL: https://github.com/apache/spark/pull/32505#issuecomment-838775701 **[Test build #138384 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138384/testReport)** for PR 32505 at commit [`b0db41e`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32504: [Minor][SPARK-35013]Don't allow to set spark.driver.cores=0

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32504: URL: https://github.com/apache/spark/pull/32504#issuecomment-838773096 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42906/

[GitHub] [spark] AmplabJenkins commented on pull request #32504: [Minor][SPARK-35013]Don't allow to set spark.driver.cores=0

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32504: URL: https://github.com/apache/spark/pull/32504#issuecomment-838773096 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42906/ -- T

[GitHub] [spark] srowen commented on a change in pull request #32505: [SPARK-35373][BUILD] Check Maven artifact checksum in build/mvn

2021-05-11 Thread GitBox
srowen commented on a change in pull request #32505: URL: https://github.com/apache/spark/pull/32505#discussion_r630332629 ## File path: build/mvn ## @@ -26,36 +26,67 @@ _COMPILE_JVM_OPTS="-Xmx2g -XX:ReservedCodeCacheSize=1g" # Installs any application tarball given a URL, t

[GitHub] [spark] cloud-fan commented on a change in pull request #32497: [SPARK-35366][SQL] Avoid using deprecated `buildForBatch` and `buildForStreaming`

2021-05-11 Thread GitBox
cloud-fan commented on a change in pull request #32497: URL: https://github.com/apache/spark/pull/32497#discussion_r630332390 ## File path: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaWrite.scala ## @@ -0,0 +1,39 @@ +/* + * Licensed to the Apache

[GitHub] [spark] srowen commented on a change in pull request #32505: [SPARK-35373][BUILD] Check Maven artifact checksum in build/mvn

2021-05-11 Thread GitBox
srowen commented on a change in pull request #32505: URL: https://github.com/apache/spark/pull/32505#discussion_r630330154 ## File path: build/mvn ## @@ -26,36 +26,67 @@ _COMPILE_JVM_OPTS="-Xmx2g -XX:ReservedCodeCacheSize=1g" # Installs any application tarball given a URL, t

[GitHub] [spark] srowen opened a new pull request #32505: [SPARK-35373][BUILD] Check Maven artifact checksum in build/mvn

2021-05-11 Thread GitBox
srowen opened a new pull request #32505: URL: https://github.com/apache/spark/pull/32505 ### What changes were proposed in this pull request? `./build/mvn` now downloads the .sha512 checksum of Maven artifacts it downloads, and checks the checksum after download. ### Why are t

[GitHub] [spark] SparkQA commented on pull request #32504: [Minor][SPARK-35013]Don't allow to set spark.driver.cores=0

2021-05-11 Thread GitBox
SparkQA commented on pull request #32504: URL: https://github.com/apache/spark/pull/32504#issuecomment-838767730 Kubernetes integration test status failure URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/42906/ -- This is an automated message from the A

[GitHub] [spark] SparkQA commented on pull request #32504: [Minor][SPARK-35013]Don't allow to set spark.driver.cores=0

2021-05-11 Thread GitBox
SparkQA commented on pull request #32504: URL: https://github.com/apache/spark/pull/32504#issuecomment-838752727 Kubernetes integration test starting URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/42906/ -- This is an automated message from the Apache

[GitHub] [spark] maryannxue commented on a change in pull request #32439: [SPARK-35298][SQL] Migrate to transformWithPruning for rules in Optimizer.scala

2021-05-11 Thread GitBox
maryannxue commented on a change in pull request #32439: URL: https://github.com/apache/spark/pull/32439#discussion_r630320937 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala ## @@ -745,7 +754,8 @@ object PushProjectionThroughU

[GitHub] [spark] otterc commented on a change in pull request #32389: [SPARK-35263] [TEST] Refactor ShuffleBlockFetcherIteratorSuite to reduce duplicated code

2021-05-11 Thread GitBox
otterc commented on a change in pull request #32389: URL: https://github.com/apache/spark/pull/32389#discussion_r630293317 ## File path: .idea/vcs.xml ## @@ -1,24 +1,16 @@ - Review comment: Nit: This seems to be picked up by error -- This is an automated message

[GitHub] [spark] vinodkc commented on a change in pull request #32411: [SPARK-28551][SQL]CTAS with LOCATION , should not allow to a non-empty directory.

2021-05-11 Thread GitBox
vinodkc commented on a change in pull request #32411: URL: https://github.com/apache/spark/pull/32411#discussion_r630301266 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala ## @@ -1604,6 +1604,15 @@ object SQLConf { .booleanConf

[GitHub] [spark] linhongliu-db commented on pull request #32497: [SPARK-35366][SQL] Avoid using deprecated `buildForBatch` and `buildForStreaming`

2021-05-11 Thread GitBox
linhongliu-db commented on pull request #32497: URL: https://github.com/apache/spark/pull/32497#issuecomment-838712951 cc @cloud-fan -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [spark] xkrogen commented on a change in pull request #32389: [SPARK-35263] [TEST] Refactor ShuffleBlockFetcherIteratorSuite to reduce duplicated code

2021-05-11 Thread GitBox
xkrogen commented on a change in pull request #32389: URL: https://github.com/apache/spark/pull/32389#discussion_r630300027 ## File path: .idea/vcs.xml ## @@ -1,24 +1,16 @@ - Review comment: Oops! Good catch @otterc, thank you! -- This is an automated message fro

[GitHub] [spark] otterc commented on a change in pull request #32389: [SPARK-35263] [TEST] Refactor ShuffleBlockFetcherIteratorSuite to reduce duplicated code

2021-05-11 Thread GitBox
otterc commented on a change in pull request #32389: URL: https://github.com/apache/spark/pull/32389#discussion_r630293317 ## File path: .idea/vcs.xml ## @@ -1,24 +1,16 @@ - Review comment: Nit: This seems to be picked up by error -- This is an automated message

[GitHub] [spark] vinodkc commented on a change in pull request #32411: [SPARK-28551][SQL]CTAS with LOCATION , should not allow to a non-empty directory.

2021-05-11 Thread GitBox
vinodkc commented on a change in pull request #32411: URL: https://github.com/apache/spark/pull/32411#discussion_r630278821 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/command/DataWritingCommand.scala ## @@ -96,4 +98,23 @@ object DataWritingCommand {

[GitHub] [spark] otterc commented on pull request #32389: [SPARK-35263] [TEST] Refactor ShuffleBlockFetcherIteratorSuite to reduce duplicated code

2021-05-11 Thread GitBox
otterc commented on pull request #32389: URL: https://github.com/apache/spark/pull/32389#issuecomment-838683440 @xkrogen @Ngone51 @mridulm Gentle ping folks to check if this PR can be merged. To me it looks like it is ready. I need to rework the tests in my [PR](https://github.com/apach

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32503: [WIP] better error message

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32503: URL: https://github.com/apache/spark/pull/32503#issuecomment-838674983 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42905/

[GitHub] [spark] AmplabJenkins commented on pull request #32503: [WIP] better error message

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32503: URL: https://github.com/apache/spark/pull/32503#issuecomment-838674983 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42905/ -- T

[GitHub] [spark] SparkQA commented on pull request #32503: [WIP] better error message

2021-05-11 Thread GitBox
SparkQA commented on pull request #32503: URL: https://github.com/apache/spark/pull/32503#issuecomment-838674855 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries

[GitHub] [spark] vinodkc commented on a change in pull request #32411: [SPARK-28551][SQL]CTAS with LOCATION , should not allow to a non-empty directory.

2021-05-11 Thread GitBox
vinodkc commented on a change in pull request #32411: URL: https://github.com/apache/spark/pull/32411#discussion_r630278821 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/command/DataWritingCommand.scala ## @@ -96,4 +98,23 @@ object DataWritingCommand {

[GitHub] [spark] SparkQA commented on pull request #32504: [Minor][SPARK-35013]Don't allow to set spark.driver.cores=0

2021-05-11 Thread GitBox
SparkQA commented on pull request #32504: URL: https://github.com/apache/spark/pull/32504#issuecomment-838669907 **[Test build #138383 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/138383/testReport)** for PR 32504 at commit [`b615db6`](https://github.com

[GitHub] [spark] AmplabJenkins removed a comment on pull request #31756: [SPARK-34637] [SQL] Support DPP + AQE when the broadcast exchange can be reused

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #31756: URL: https://github.com/apache/spark/pull/31756#issuecomment-838665138 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/138376/ -

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32397: [SPARK-35084][CORE] Spark 3: supporting "--packages" in k8s cluster mode

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32397: URL: https://github.com/apache/spark/pull/32397#issuecomment-838665141 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42903/

[GitHub] [spark] AmplabJenkins removed a comment on pull request #32499: [SPARK-29145][SQL][FOLLOWUP] Support sub-queries in join conditions

2021-05-11 Thread GitBox
AmplabJenkins removed a comment on pull request #32499: URL: https://github.com/apache/spark/pull/32499#issuecomment-838665143 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42904/

[GitHub] [spark] AmplabJenkins commented on pull request #32397: [SPARK-35084][CORE] Spark 3: supporting "--packages" in k8s cluster mode

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32397: URL: https://github.com/apache/spark/pull/32397#issuecomment-838665141 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42903/ -- T

[GitHub] [spark] AmplabJenkins commented on pull request #31756: [SPARK-34637] [SQL] Support DPP + AQE when the broadcast exchange can be reused

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #31756: URL: https://github.com/apache/spark/pull/31756#issuecomment-838665138 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/138376/ -- This

[GitHub] [spark] AmplabJenkins commented on pull request #32499: [SPARK-29145][SQL][FOLLOWUP] Support sub-queries in join conditions

2021-05-11 Thread GitBox
AmplabJenkins commented on pull request #32499: URL: https://github.com/apache/spark/pull/32499#issuecomment-838665143 Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/42904/ -- T

[GitHub] [spark] huaxingao commented on pull request #32473: [SPARK-35345][SQL] Add Parquet tests to BloomFilterBenchmark

2021-05-11 Thread GitBox
huaxingao commented on pull request #32473: URL: https://github.com/apache/spark/pull/32473#issuecomment-838657326 @ggershinsky Hi Gidon, thank you very much for your explanation. It's very helpful. I will rerun the test per your suggestion. This test only uses toy data. For the real data

[GitHub] [spark] shahidki31 opened a new pull request #32504: [Minor][SPARK-35013]Don't allow to set spark.driver.cores=0

2021-05-11 Thread GitBox
shahidki31 opened a new pull request #32504: URL: https://github.com/apache/spark/pull/32504 ### What changes were proposed in this pull request? Currently spark is not allowing to set spark.driver.memory, spark.executor.cores, spark.executor.memory to 0, but allowing driver core

[GitHub] [spark] ocworld commented on pull request #32397: [SPARK-35084][CORE] Spark 3: supporting "--packages" in k8s cluster mode

2021-05-11 Thread GitBox
ocworld commented on pull request #32397: URL: https://github.com/apache/spark/pull/32397#issuecomment-838655508 @dongjoon-hyun I reviewed "BasicTestsSuite" and "DepsTestsSuite" I think it is hard to add test case in it. It's because it is needed to get spark.jars in the runnin

<    1   2   3   4   5   6   7   8   >