[GitHub] [spark] SparkQA removed a comment on pull request #29102: [SPARK-32303][PYTHON][BUILD] Remove leftover from editable mode installation in PIP test

2020-07-14 Thread GitBox
SparkQA removed a comment on pull request #29102: URL: https://github.com/apache/spark/pull/29102#issuecomment-658106124 **[Test build #125825 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125825/testReport)** for PR 29102 at commit

[GitHub] [spark] SparkQA commented on pull request #29102: [SPARK-32303][PYTHON][BUILD] Remove leftover from editable mode installation in PIP test

2020-07-14 Thread GitBox
SparkQA commented on pull request #29102: URL: https://github.com/apache/spark/pull/29102#issuecomment-658200228 **[Test build #125825 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125825/testReport)** for PR 29102 at commit

[GitHub] [spark] cloud-fan commented on a change in pull request #27428: [SPARK-30276][SQL] Support Filter expression allows simultaneous use of DISTINCT

2020-07-14 Thread GitBox
cloud-fan commented on a change in pull request #27428: URL: https://github.com/apache/spark/pull/27428#discussion_r454391588 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/RewriteDistinctAggregates.scala ## @@ -102,23 +102,127 @@ import

[GitHub] [spark] AmplabJenkins commented on pull request #29085: [SPARK-32106][SQL]Implement SparkScriptTransformationExec in sql/core

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29085: URL: https://github.com/apache/spark/pull/29085#issuecomment-658215008 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29002: [SPARK-32175][CORE] Fix the order between initialization for ExecutorPlugin and starting heartbeat thread

2020-07-14 Thread GitBox
SparkQA commented on pull request #29002: URL: https://github.com/apache/spark/pull/29002#issuecomment-658239160 **[Test build #125831 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125831/testReport)** for PR 29002 at commit

[GitHub] [spark] stczwd commented on pull request #28280: [SPARK-31438][CORE][SQL] Support JobCleaned Status in SparkListener

2020-07-14 Thread GitBox
stczwd commented on pull request #28280: URL: https://github.com/apache/spark/pull/28280#issuecomment-658243169 kindly remind @rdblue @cloud-fan @xuanyuanking. This is an automated message from the Apache Git Service. To

[GitHub] [spark] gaborgsomogyi commented on pull request #29024: [WIP][SPARK-32001][SQL]Create JDBC authentication provider developer API

2020-07-14 Thread GitBox
gaborgsomogyi commented on pull request #29024: URL: https://github.com/apache/spark/pull/29024#issuecomment-658258741 As I see the unit tests passed removing the WIP. This is an automated message from the Apache Git

[GitHub] [spark] AmplabJenkins commented on pull request #29100: [MINOR][R] Match collectAsArrowToR with non-streaming collectAsArrowToPython

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29100: URL: https://github.com/apache/spark/pull/29100#issuecomment-658259487 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] ScrapCodes commented on pull request #28423: [SPARK-24266][k8s] Restart the watcher when we receive a version changed from k8s

2020-07-14 Thread GitBox
ScrapCodes commented on pull request #28423: URL: https://github.com/apache/spark/pull/28423#issuecomment-658153445 > > BTW, when do we receive a version changed from K8s? > > It happens when etcd compaction kicks in for example. On aws EKS I never saw this happening on EKS 1.14,

[GitHub] [spark] SparkQA commented on pull request #29002: [SPARK-32175][CORE] Fix the order between initialization for ExecutorPlugin and starting heartbeat thread

2020-07-14 Thread GitBox
SparkQA commented on pull request #29002: URL: https://github.com/apache/spark/pull/29002#issuecomment-658153415 **[Test build #125831 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125831/testReport)** for PR 29002 at commit

[GitHub] [spark] SparkQA commented on pull request #28939: [SPARK-32119][CORE] ExecutorPlugin doesn't work with Standalone Cluster

2020-07-14 Thread GitBox
SparkQA commented on pull request #28939: URL: https://github.com/apache/spark/pull/28939#issuecomment-658153432 **[Test build #125832 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125832/testReport)** for PR 28939 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28287: [SPARK-31418][SCHEDULER] Request more executors in case of dynamic allocation is enabled and a task becomes unschedulable due t

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #28287: URL: https://github.com/apache/spark/pull/28287#issuecomment-658180547 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA removed a comment on pull request #28917: [SPARK-31847][CORE][TESTS] DAGSchedulerSuite: Rewrite the test framework to support apply specified spark configurations.

2020-07-14 Thread GitBox
SparkQA removed a comment on pull request #28917: URL: https://github.com/apache/spark/pull/28917#issuecomment-658099896 **[Test build #125822 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125822/testReport)** for PR 28917 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28917: [SPARK-31847][CORE][TESTS] DAGSchedulerSuite: Rewrite the test framework to support apply specified spark configurations.

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #28917: URL: https://github.com/apache/spark/pull/28917#issuecomment-658181969 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] ulysses-you commented on a change in pull request #28840: [SPARK-31999][SQL] Add REFRESH FUNCTION command

2020-07-14 Thread GitBox
ulysses-you commented on a change in pull request #28840: URL: https://github.com/apache/spark/pull/28840#discussion_r454365461 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLSuite.scala ## @@ -3030,6 +3030,47 @@ abstract class DDLSuite

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29102: [SPARK-32303][PYTHON][BUILD] Remove leftover from editable mode installation in PIP test

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #29102: URL: https://github.com/apache/spark/pull/29102#issuecomment-658235055 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA commented on pull request #28939: [SPARK-32119][CORE] ExecutorPlugin doesn't work with Standalone Cluster

2020-07-14 Thread GitBox
SparkQA commented on pull request #28939: URL: https://github.com/apache/spark/pull/28939#issuecomment-658251544 **[Test build #125832 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125832/testReport)** for PR 28939 at commit

[GitHub] [spark] SparkQA commented on pull request #29088: [SPARK-32289][SQL] Some characters are garbled when opening csv files with Excel

2020-07-14 Thread GitBox
SparkQA commented on pull request #29088: URL: https://github.com/apache/spark/pull/29088#issuecomment-658160091 **[Test build #125833 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125833/testReport)** for PR 29088 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29098: [SPARK-32300][PYTHON][2.4] toPandas should work from a Spark DataFrame with no partitions

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #29098: URL: https://github.com/apache/spark/pull/29098#issuecomment-658167831 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29098: [SPARK-32300][PYTHON][2.4] toPandas should work from a Spark DataFrame with no partitions

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29098: URL: https://github.com/apache/spark/pull/29098#issuecomment-658167831 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29067: [SPARK-32274][SQL] Make SQL cache serialization pluggable

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #29067: URL: https://github.com/apache/spark/pull/29067#issuecomment-658172413 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #29102: [SPARK-32303][PYTHON][BUILD] Remove leftover from editable mode installation in PIP test

2020-07-14 Thread GitBox
SparkQA commented on pull request #29102: URL: https://github.com/apache/spark/pull/29102#issuecomment-658179738 **[Test build #125837 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125837/testReport)** for PR 29102 at commit

[GitHub] [spark] SparkQA commented on pull request #28287: [SPARK-31418][SCHEDULER] Request more executors in case of dynamic allocation is enabled and a task becomes unschedulable due to spark's blac

2020-07-14 Thread GitBox
SparkQA commented on pull request #28287: URL: https://github.com/apache/spark/pull/28287#issuecomment-658179776 **[Test build #125838 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125838/testReport)** for PR 28287 at commit

[GitHub] [spark] tgravescs commented on pull request #28874: [SPARK-32036] Replace references to blacklist/whitelist language with more appropriate terminology, excluding the blacklisting feature.

2020-07-14 Thread GitBox
tgravescs commented on pull request #28874: URL: https://github.com/apache/spark/pull/28874#issuecomment-658179951 @xkrogen unfortunately it appears there is another conflict could you update again? This is an automated

[GitHub] [spark] beliefer commented on a change in pull request #27428: [SPARK-30276][SQL] Support Filter expression allows simultaneous use of DISTINCT

2020-07-14 Thread GitBox
beliefer commented on a change in pull request #27428: URL: https://github.com/apache/spark/pull/27428#discussion_r454386912 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/RewriteDistinctAggregates.scala ## @@ -102,23 +102,127 @@ import

[GitHub] [spark] AngersZhuuuu commented on pull request #29085: [SPARK-32106][SQL]Implement SparkScriptTransformationExec in sql/core

2020-07-14 Thread GitBox
AngersZh commented on pull request #29085: URL: https://github.com/apache/spark/pull/29085#issuecomment-658223626 > What's the behavior of hive if the script transformation doesn't specify a serde? Does Hive pick a default serde, or it well defines the behavior of non-serde? In

[GitHub] [spark] AmplabJenkins commented on pull request #28874: [SPARK-32036] Replace references to blacklist/whitelist language with more appropriate terminology, excluding the blacklisting feature.

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #28874: URL: https://github.com/apache/spark/pull/28874#issuecomment-658249130 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] xkrogen commented on pull request #28874: [SPARK-32036] Replace references to blacklist/whitelist language with more appropriate terminology, excluding the blacklisting feature.

2020-07-14 Thread GitBox
xkrogen commented on pull request #28874: URL: https://github.com/apache/spark/pull/28874#issuecomment-658248814 Playing a game of whack-a-mole here :) Thanks for the heads up @tgravescs . Pushed up another conflict resolution.

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29104: [SPARK-32290][SQL] NotInSubquery SingleColumn Optimize

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #29104: URL: https://github.com/apache/spark/pull/29104#issuecomment-658256117 Can one of the admins verify this patch? This is an automated message from the Apache Git

[GitHub] [spark] AmplabJenkins commented on pull request #29104: [SPARK-32290][SQL] NotInSubquery SingleColumn Optimize

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29104: URL: https://github.com/apache/spark/pull/29104#issuecomment-658256805 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29098: [SPARK-32300][PYTHON][2.4] toPandas should work from a Spark DataFrame with no partitions

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #29098: URL: https://github.com/apache/spark/pull/29098#issuecomment-658140318 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29098: [SPARK-32300][PYTHON][2.4] toPandas should work from a Spark DataFrame with no partitions

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29098: URL: https://github.com/apache/spark/pull/29098#issuecomment-658140318 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29024: [WIP][SPARK-32001][SQL]Create JDBC authentication provider developer API

2020-07-14 Thread GitBox
SparkQA removed a comment on pull request #29024: URL: https://github.com/apache/spark/pull/29024#issuecomment-658069123 **[Test build #125820 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125820/testReport)** for PR 29024 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29103: [SPARK-32305][BUILD]Automatically cleanup meta data generated by test cases not in target dir

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29103: URL: https://github.com/apache/spark/pull/29103#issuecomment-658146644 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA commented on pull request #29024: [WIP][SPARK-32001][SQL]Create JDBC authentication provider developer API

2020-07-14 Thread GitBox
SparkQA commented on pull request #29024: URL: https://github.com/apache/spark/pull/29024#issuecomment-658146504 **[Test build #125820 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125820/testReport)** for PR 29024 at commit

[GitHub] [spark] wangyum commented on a change in pull request #29088: [SPARK-32289][SQL] Some characters are garbled when opening csv files with Excel

2020-07-14 Thread GitBox
wangyum commented on a change in pull request #29088: URL: https://github.com/apache/spark/pull/29088#discussion_r454328505 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/csv/CSVOptions.scala ## @@ -135,6 +135,8 @@ class CSVOptions( val positiveInf

[GitHub] [spark] wangyum commented on a change in pull request #29088: [SPARK-32289][SQL] Some characters are garbled when opening csv files with Excel

2020-07-14 Thread GitBox
wangyum commented on a change in pull request #29088: URL: https://github.com/apache/spark/pull/29088#discussion_r454328241 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/csv/CSVSuite.scala ## @@ -2353,6 +2355,53 @@ abstract class CSVSuite

[GitHub] [spark] MrPowers commented on pull request #23877: [SPARK-26449][PYTHON] Add transform method to DataFrame API

2020-07-14 Thread GitBox
MrPowers commented on pull request #23877: URL: https://github.com/apache/spark/pull/23877#issuecomment-658165960 @Hellsen83 - I updated [the blog post](https://mungingdata.com/pyspark/chaining-dataframe-transformations/) to add your suggested convention of using `_` as the inner function

[GitHub] [spark] AmplabJenkins commented on pull request #29067: [SPARK-32274][SQL] Make SQL cache serialization pluggable

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29067: URL: https://github.com/apache/spark/pull/29067#issuecomment-658172413 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29090: [WIP][SPARK-32293] Fix inconsistency between Spark memory configs and JVM option

2020-07-14 Thread GitBox
SparkQA commented on pull request #29090: URL: https://github.com/apache/spark/pull/29090#issuecomment-658184244 **[Test build #125826 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125826/testReport)** for PR 29090 at commit

[GitHub] [spark] SparkQA commented on pull request #28412: [SPARK-31608][CORE][WEBUI] Add a new type of KVStore to make loading UI faster

2020-07-14 Thread GitBox
SparkQA commented on pull request #28412: URL: https://github.com/apache/spark/pull/28412#issuecomment-658211083 **[Test build #125841 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125841/testReport)** for PR 28412 at commit

[GitHub] [spark] SparkQA commented on pull request #29085: [SPARK-32106][SQL]Implement SparkScriptTransformationExec in sql/core

2020-07-14 Thread GitBox
SparkQA commented on pull request #29085: URL: https://github.com/apache/spark/pull/29085#issuecomment-658211049 **[Test build #125840 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125840/testReport)** for PR 29085 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28412: [SPARK-31608][CORE][WEBUI] Add a new type of KVStore to make loading UI faster

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #28412: URL: https://github.com/apache/spark/pull/28412#issuecomment-658211580 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #28917: [SPARK-31847][CORE][TESTS] DAGSchedulerSuite: Rewrite the test framework to support apply specified spark configurations.

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #28917: URL: https://github.com/apache/spark/pull/28917#issuecomment-658211690 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] cloud-fan commented on a change in pull request #29045: [SPARK-32234][SQL] Spark sql commands are failing on selecting the orc tables

2020-07-14 Thread GitBox
cloud-fan commented on a change in pull request #29045: URL: https://github.com/apache/spark/pull/29045#discussion_r454396181 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcUtils.scala ## @@ -116,47 +116,53 @@ object OrcUtils extends

[GitHub] [spark] beliefer commented on pull request #28917: [SPARK-31847][CORE][TESTS] DAGSchedulerSuite: Rewrite the test framework to support apply specified spark configurations.

2020-07-14 Thread GitBox
beliefer commented on pull request #28917: URL: https://github.com/apache/spark/pull/28917#issuecomment-658210689 retest this please This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [spark] AmplabJenkins commented on pull request #29085: [SPARK-32106][SQL]Implement SparkScriptTransformationExec in sql/core

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29085: URL: https://github.com/apache/spark/pull/29085#issuecomment-658211620 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29102: [SPARK-32303][PYTHON][BUILD] Remove leftover from editable mode installation in PIP test

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #29102: URL: https://github.com/apache/spark/pull/29102#issuecomment-658235069 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] leanken opened a new pull request #29104: [SPARK-32290][SQL] NotInSubquery SingleColumn Optimize

2020-07-14 Thread GitBox
leanken opened a new pull request #29104: URL: https://github.com/apache/spark/pull/29104 ### What changes were proposed in this pull request? Normally, a NotInSubquery will be planed into BroadcastNestedLoopJoin which is very time consuming, for instance, in TPCH Query 16. ```

[GitHub] [spark] SparkQA removed a comment on pull request #29098: [SPARK-32300][PYTHON][2.4] toPandas should work from a Spark DataFrame with no partitions

2020-07-14 Thread GitBox
SparkQA removed a comment on pull request #29098: URL: https://github.com/apache/spark/pull/29098#issuecomment-658053443 **[Test build #125815 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125815/testReport)** for PR 29098 at commit

[GitHub] [spark] SparkQA commented on pull request #29098: [SPARK-32300][PYTHON][2.4] toPandas should work from a Spark DataFrame with no partitions

2020-07-14 Thread GitBox
SparkQA commented on pull request #29098: URL: https://github.com/apache/spark/pull/29098#issuecomment-658139766 **[Test build #125815 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125815/testReport)** for PR 29098 at commit

[GitHub] [spark] gaborgsomogyi removed a comment on pull request #29024: [WIP][SPARK-32001][SQL]Create JDBC authentication provider developer API

2020-07-14 Thread GitBox
gaborgsomogyi removed a comment on pull request #29024: URL: https://github.com/apache/spark/pull/29024#issuecomment-658147329 retest this pleases This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] gaborgsomogyi commented on pull request #29024: [WIP][SPARK-32001][SQL]Create JDBC authentication provider developer API

2020-07-14 Thread GitBox
gaborgsomogyi commented on pull request #29024: URL: https://github.com/apache/spark/pull/29024#issuecomment-658149363 retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] cloud-fan commented on a change in pull request #28840: [SPARK-31999][SQL] Add REFRESH FUNCTION command

2020-07-14 Thread GitBox
cloud-fan commented on a change in pull request #28840: URL: https://github.com/apache/spark/pull/28840#discussion_r454336315 ## File path: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLSuite.scala ## @@ -3030,6 +3030,47 @@ abstract class DDLSuite extends

[GitHub] [spark] cloud-fan commented on a change in pull request #28840: [SPARK-31999][SQL] Add REFRESH FUNCTION command

2020-07-14 Thread GitBox
cloud-fan commented on a change in pull request #28840: URL: https://github.com/apache/spark/pull/28840#discussion_r454336540 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/command/functions.scala ## @@ -236,6 +236,46 @@ case class ShowFunctionsCommand(

[GitHub] [spark] AmplabJenkins commented on pull request #28840: [SPARK-31999][SQL] Add REFRESH FUNCTION command

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #28840: URL: https://github.com/apache/spark/pull/28840#issuecomment-658164151 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28840: [SPARK-31999][SQL] Add REFRESH FUNCTION command

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #28840: URL: https://github.com/apache/spark/pull/28840#issuecomment-658164151 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29102: [SPARK-32303][PYTHON][BUILD] Remove leftover from editable mode installation in PIP test

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #29102: URL: https://github.com/apache/spark/pull/29102#issuecomment-658171151 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] attilapiros commented on pull request #29102: [SPARK-32303][PYTHON][BUILD] Remove leftover from editable mode installation in PIP test

2020-07-14 Thread GitBox
attilapiros commented on pull request #29102: URL: https://github.com/apache/spark/pull/29102#issuecomment-658179166 retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] AmplabJenkins commented on pull request #28840: [SPARK-31999][SQL] Add REFRESH FUNCTION command

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #28840: URL: https://github.com/apache/spark/pull/28840#issuecomment-658189140 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28840: [SPARK-31999][SQL] Add REFRESH FUNCTION command

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #28840: URL: https://github.com/apache/spark/pull/28840#issuecomment-658189140 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] stijndehaes commented on pull request #28423: [SPARK-24266][k8s] Restart the watcher when we receive a version changed from k8s

2020-07-14 Thread GitBox
stijndehaes commented on pull request #28423: URL: https://github.com/apache/spark/pull/28423#issuecomment-658205704 > > > BTW, when do we receive a version changed from K8s? > > > > > > It happens when etcd compaction kicks in for example. On aws EKS I never saw this happening

[GitHub] [spark] koeninger commented on pull request #21038: [SPARK-22968][DStream] Throw an exception on partition revoking issue

2020-07-14 Thread GitBox
koeninger commented on pull request #21038: URL: https://github.com/apache/spark/pull/21038#issuecomment-658234795 Why can't you use a different group id? This is an automated message from the Apache Git Service. To respond

[GitHub] [spark] srowen commented on a change in pull request #29088: [SPARK-32289][SQL] Some characters are garbled when opening csv files with Excel

2020-07-14 Thread GitBox
srowen commented on a change in pull request #29088: URL: https://github.com/apache/spark/pull/29088#discussion_r454436084 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CsvOutputWriter.scala ## @@ -39,6 +39,10 @@ class CsvOutputWriter(

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29002: [SPARK-32175][CORE] Fix the order between initialization for ExecutorPlugin and starting heartbeat thread

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #29002: URL: https://github.com/apache/spark/pull/29002#issuecomment-658240283 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29002: [SPARK-32175][CORE] Fix the order between initialization for ExecutorPlugin and starting heartbeat thread

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #29002: URL: https://github.com/apache/spark/pull/29002#issuecomment-658240267 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #29002: [SPARK-32175][CORE] Fix the order between initialization for ExecutorPlugin and starting heartbeat thread

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29002: URL: https://github.com/apache/spark/pull/29002#issuecomment-658240267 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] attilapiros commented on pull request #29102: [SPARK-32303][PYTHON][BUILD] Remove leftover from editable mode installation in PIP test

2020-07-14 Thread GitBox
attilapiros commented on pull request #29102: URL: https://github.com/apache/spark/pull/29102#issuecomment-658248290 retest this please This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] cloud-fan commented on a change in pull request #29045: [SPARK-32234][SQL] Spark sql commands are failing on selecting the orc tables

2020-07-14 Thread GitBox
cloud-fan commented on a change in pull request #29045: URL: https://github.com/apache/spark/pull/29045#discussion_r454475026 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala ## @@ -160,12 +160,12 @@ class OrcFileFormat

[GitHub] [spark] cloud-fan commented on a change in pull request #29045: [SPARK-32234][SQL] Spark sql commands are failing on selecting the orc tables

2020-07-14 Thread GitBox
cloud-fan commented on a change in pull request #29045: URL: https://github.com/apache/spark/pull/29045#discussion_r454476253 ## File path: sql/hive/src/test/scala/org/apache/spark/sql/hive/orc/HiveOrcQuerySuite.scala ## @@ -288,4 +288,35 @@ class HiveOrcQuerySuite extends

[GitHub] [spark] AmplabJenkins commented on pull request #29105: [SPARK-25075] Fix some complilation error scala 2.13 part 1

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29105: URL: https://github.com/apache/spark/pull/29105#issuecomment-658280775 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To

[GitHub] [spark] cloud-fan commented on pull request #29085: [SPARK-32106][SQL]Implement SparkScriptTransformationExec in sql/core

2020-07-14 Thread GitBox
cloud-fan commented on pull request #29085: URL: https://github.com/apache/spark/pull/29085#issuecomment-658282891 so eventually we don't need to use `Cast` to convert catalyst value to string? There will always be a serde (default or user-specified).

[GitHub] [spark] AmplabJenkins commented on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-658291045 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28708: [SPARK-20629][CORE][K8S] Copy shuffle data when nodes are being shutdown

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #28708: URL: https://github.com/apache/spark/pull/28708#issuecomment-658185501 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29109: [SPARK-32311][PYSPARK] Remove duplicate import

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29109: URL: https://github.com/apache/spark/pull/29109#issuecomment-658321561 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29110: [SPARK-32313][PYSPARK] Remove Python 2 artifacts

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #29110: URL: https://github.com/apache/spark/pull/29110#issuecomment-658325327 Can one of the admins verify this patch? This is an automated message from the Apache Git

[GitHub] [spark] AmplabJenkins commented on pull request #29089: [SPARK-32276][SQL] Remove redundant sorts before repartition nodes

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29089: URL: https://github.com/apache/spark/pull/29089#issuecomment-658326302 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] aokolnychyi commented on a change in pull request #29089: [SPARK-32276][SQL] Remove redundant sorts before repartition nodes

2020-07-14 Thread GitBox
aokolnychyi commented on a change in pull request #29089: URL: https://github.com/apache/spark/pull/29089#discussion_r454541782 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala ## @@ -981,6 +982,10 @@ object EliminateSorts

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29089: [SPARK-32276][SQL] Remove redundant sorts before repartition nodes

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #29089: URL: https://github.com/apache/spark/pull/29089#issuecomment-658326302 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SaurabhChawla100 commented on a change in pull request #29045: [SPARK-32234][SQL] Spark sql commands are failing on selecting the orc tables

2020-07-14 Thread GitBox
SaurabhChawla100 commented on a change in pull request #29045: URL: https://github.com/apache/spark/pull/29045#discussion_r454558890 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/orc/OrcPartitionReaderFactory.scala ## @@ -74,7 +74,7 @@

[GitHub] [spark] AmplabJenkins commented on pull request #29045: [SPARK-32234][SQL] Spark sql commands are failing on selecting the orc tables

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29045: URL: https://github.com/apache/spark/pull/29045#issuecomment-658351300 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29045: [SPARK-32234][SQL] Spark sql commands are failing on selecting the orc tables

2020-07-14 Thread GitBox
SparkQA commented on pull request #29045: URL: https://github.com/apache/spark/pull/29045#issuecomment-658350754 **[Test build #125851 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125851/testReport)** for PR 29045 at commit

[GitHub] [spark] SaurabhChawla100 commented on a change in pull request #29045: [SPARK-32234][SQL] Spark sql commands are failing on selecting the orc tables

2020-07-14 Thread GitBox
SaurabhChawla100 commented on a change in pull request #29045: URL: https://github.com/apache/spark/pull/29045#discussion_r454570883 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcUtils.scala ## @@ -116,47 +116,53 @@ object OrcUtils

[GitHub] [spark] dongjoon-hyun closed pull request #29108: [SPARK-32309][PYSPARK] Import missing sys import

2020-07-14 Thread GitBox
dongjoon-hyun closed pull request #29108: URL: https://github.com/apache/spark/pull/29108 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] aokolnychyi commented on a change in pull request #29089: [SPARK-32276][SQL] Remove redundant sorts before repartition nodes

2020-07-14 Thread GitBox
aokolnychyi commented on a change in pull request #29089: URL: https://github.com/apache/spark/pull/29089#discussion_r454601951 ## File path: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/optimizer/EliminateSortsBeforeRepartitionSuite.scala ## @@ -0,0 +1,179 @@

[GitHub] [spark] dongjoon-hyun closed pull request #29109: [SPARK-32311][PYSPARK][TESTS] Remove duplicate import

2020-07-14 Thread GitBox
dongjoon-hyun closed pull request #29109: URL: https://github.com/apache/spark/pull/29109 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] Fokko commented on pull request #29109: [SPARK-32311][PYSPARK][TESTS] Remove duplicate import

2020-07-14 Thread GitBox
Fokko commented on pull request #29109: URL: https://github.com/apache/spark/pull/29109#issuecomment-658377938 Thanks for picking this up so quickly @dongjoon-hyun This is an automated message from the Apache Git Service.

[GitHub] [spark] rdblue commented on a change in pull request #29089: [SPARK-32276][SQL] Remove redundant sorts before repartition nodes

2020-07-14 Thread GitBox
rdblue commented on a change in pull request #29089: URL: https://github.com/apache/spark/pull/29089#discussion_r454617890 ## File path: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/optimizer/EliminateSortsBeforeRepartitionSuite.scala ## @@ -0,0 +1,179 @@ +/* + *

[GitHub] [spark] SparkQA commented on pull request #29089: [SPARK-32276][SQL] Remove redundant sorts before repartition nodes

2020-07-14 Thread GitBox
SparkQA commented on pull request #29089: URL: https://github.com/apache/spark/pull/29089#issuecomment-658397973 **[Test build #125847 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125847/testReport)** for PR 29089 at commit

[GitHub] [spark] Ngone51 opened a new pull request #29106: [SPARK-32307][SQL] ScalaUDF's canonicalized expression should exclude inputEncoders

2020-07-14 Thread GitBox
Ngone51 opened a new pull request #29106: URL: https://github.com/apache/spark/pull/29106 ### What changes were proposed in this pull request? Override `canonicalized` to empty the `inputEncoders` for the canonicalized `ScalaUDF`. ### Why are the changes

[GitHub] [spark] SaurabhChawla100 commented on a change in pull request #29045: [SPARK-32234][SQL] Spark sql commands are failing on selecting the orc tables

2020-07-14 Thread GitBox
SaurabhChawla100 commented on a change in pull request #29045: URL: https://github.com/apache/spark/pull/29045#discussion_r454491048 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFileFormat.scala ## @@ -160,12 +160,12 @@ class

[GitHub] [spark] SparkQA commented on pull request #29107: [SPARK-32308][SQL] Move by-name resolution logic of unionByName from API code to analysis phase

2020-07-14 Thread GitBox
SparkQA commented on pull request #29107: URL: https://github.com/apache/spark/pull/29107#issuecomment-658292521 **[Test build #125845 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125845/testReport)** for PR 29107 at commit

[GitHub] [spark] SparkQA commented on pull request #29106: [SPARK-32307][SQL] ScalaUDF's canonicalized expression should exclude inputEncoders

2020-07-14 Thread GitBox
SparkQA commented on pull request #29106: URL: https://github.com/apache/spark/pull/29106#issuecomment-658292625 **[Test build #125846 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125846/testReport)** for PR 29106 at commit

[GitHub] [spark] SaurabhChawla100 commented on a change in pull request #29045: [SPARK-32234][SQL] Spark sql commands are failing on selecting the orc tables

2020-07-14 Thread GitBox
SaurabhChawla100 commented on a change in pull request #29045: URL: https://github.com/apache/spark/pull/29045#discussion_r454524703 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcUtils.scala ## @@ -116,47 +116,53 @@ object OrcUtils

[GitHub] [spark] SaurabhChawla100 commented on a change in pull request #29045: [SPARK-32234][SQL] Spark sql commands are failing on selecting the orc tables

2020-07-14 Thread GitBox
SaurabhChawla100 commented on a change in pull request #29045: URL: https://github.com/apache/spark/pull/29045#discussion_r454524703 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcUtils.scala ## @@ -116,47 +116,53 @@ object OrcUtils

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29108: [SPARK-32309][PYSPARK] Import missing sys import

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #29108: URL: https://github.com/apache/spark/pull/29108#issuecomment-658317741 Can one of the admins verify this patch? This is an automated message from the Apache Git

[GitHub] [spark] AmplabJenkins commented on pull request #29108: [SPARK-32309][PYSPARK] Import missing sys import

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29108: URL: https://github.com/apache/spark/pull/29108#issuecomment-658318431 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29067: [SPARK-32274][SQL] Make SQL cache serialization pluggable

2020-07-14 Thread GitBox
AmplabJenkins removed a comment on pull request #29067: URL: https://github.com/apache/spark/pull/29067#issuecomment-658317544 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on pull request #29110: [SPARK-32313][PYSPARK] Remove Python 2 artifacts

2020-07-14 Thread GitBox
AmplabJenkins commented on pull request #29110: URL: https://github.com/apache/spark/pull/29110#issuecomment-658326014 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA commented on pull request #29089: [SPARK-32276][SQL] Remove redundant sorts before repartition nodes

2020-07-14 Thread GitBox
SparkQA commented on pull request #29089: URL: https://github.com/apache/spark/pull/29089#issuecomment-658325499 **[Test build #125847 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125847/testReport)** for PR 29089 at commit

<    1   2   3   4   5   6   7   8   9   10   >