[GitHub] [spark] HyukjinKwon removed a comment on pull request #29846: [SPARK-32971][K8S] Support dynamic PVC creation/deletion for K8s executors

2020-09-23 Thread GitBox
HyukjinKwon removed a comment on pull request #29846: URL: https://github.com/apache/spark/pull/29846#issuecomment-697308125 cc @holdenk, could you take a look for the test failure please? Two PRs are blocked by it. This is

[GitHub] [spark] HyukjinKwon commented on pull request #29846: [SPARK-32971][K8S] Support dynamic PVC creation/deletion for K8s executors

2020-09-23 Thread GitBox
HyukjinKwon commented on pull request #29846: URL: https://github.com/apache/spark/pull/29846#issuecomment-697308125 cc @holdenk, could you take a look for the test failure please? Two PRs are blocked by it. This is an

[GitHub] [spark] HyukjinKwon commented on pull request #29846: [SPARK-32971][K8S] Support dynamic PVC creation/deletion for K8s executors

2020-09-23 Thread GitBox
HyukjinKwon commented on pull request #29846: URL: https://github.com/apache/spark/pull/29846#issuecomment-697308228 cc @holdenk, could you take a look for the test failure please? K8S PRs are blocked by it. This is an

[GitHub] [spark] HyukjinKwon commented on pull request #29806: [SPARK-32187][PYTHON][DOCS] Doc on Python packaging

2020-09-23 Thread GitBox
HyukjinKwon commented on pull request #29806: URL: https://github.com/apache/spark/pull/29806#issuecomment-697314379 Looks pretty good. I will push some changes into your branches by myself. This is an automated message from

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29850: [SPARK-32974][ML] FeatureHasher transform optimization

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29850: URL: https://github.com/apache/spark/pull/29850#issuecomment-697313771 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA removed a comment on pull request #26935: [SPARK-30294][SS] Explicitly defines read-only StateStore and optimize for HDFSBackedStateStore

2020-09-23 Thread GitBox
SparkQA removed a comment on pull request #26935: URL: https://github.com/apache/spark/pull/26935#issuecomment-697178580 **[Test build #129014 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129014/testReport)** for PR 26935 at commit

[GitHub] [spark] SparkQA commented on pull request #29851: [WIP][SPARK-22674][PYTHON] Removed namedtuple hack and made cloudpickle the default serializer

2020-09-23 Thread GitBox
SparkQA commented on pull request #29851: URL: https://github.com/apache/spark/pull/29851#issuecomment-697320500 **[Test build #129029 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129029/testReport)** for PR 29851 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #25840: [SPARK-29166][SQL] Add parameters to limit the number of dynamic partitions for data source table

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #25840: URL: https://github.com/apache/spark/pull/25840#issuecomment-697335325 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29756: [SPARK-32885][SS] Add DataStreamReader.table API

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29756: URL: https://github.com/apache/spark/pull/29756#issuecomment-697344170 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29756: [SPARK-32885][SS] Add DataStreamReader.table API

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29756: URL: https://github.com/apache/spark/pull/29756#issuecomment-697344170 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #25290: [SPARK-28551][SQL]Add a Checker Rule when CTAS SQL with LOCATION

2020-09-23 Thread GitBox
SparkQA commented on pull request #25290: URL: https://github.com/apache/spark/pull/25290#issuecomment-697361751 **[Test build #129031 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129031/testReport)** for PR 25290 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29798: [SPARK-32931][SQL] Unevaluable Expressions are not Foldable

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29798: URL: https://github.com/apache/spark/pull/29798#issuecomment-697470258 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29798: [SPARK-32931][SQL] Unevaluable Expressions are not Foldable

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29798: URL: https://github.com/apache/spark/pull/29798#issuecomment-697470258 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] dongjoon-hyun commented on pull request #29846: [SPARK-32971][K8S] Support dynamic PVC creation/deletion for K8s executors

2020-09-23 Thread GitBox
dongjoon-hyun commented on pull request #29846: URL: https://github.com/apache/spark/pull/29846#issuecomment-697539175 cc @holdenk , @dbtsai , @viirya , @sunchao This is an automated message from the Apache Git Service. To

[GitHub] [spark] holdenk commented on pull request #29846: [SPARK-32971][K8S] Support dynamic PVC creation/deletion for K8s executors

2020-09-23 Thread GitBox
holdenk commented on pull request #29846: URL: https://github.com/apache/spark/pull/29846#issuecomment-697539116 I agree this PR isn't touching anything in the decommissioning logic. That being said, I'll spend some time today on the decommissioning integration tests.

[GitHub] [spark] AmplabJenkins removed a comment on pull request #25840: [SPARK-29166][SQL] Add parameters to limit the number of dynamic partitions for data source table

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #25840: URL: https://github.com/apache/spark/pull/25840#issuecomment-697535821 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #29806: [SPARK-32187][PYTHON][DOCS] Doc on Python packaging

2020-09-23 Thread GitBox
SparkQA commented on pull request #29806: URL: https://github.com/apache/spark/pull/29806#issuecomment-697572196 **[Test build #129036 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129036/testReport)** for PR 29806 at commit

[GitHub] [spark] bogdanghit commented on a change in pull request #29834: [SPARK-32963][SQL] empty string should be consistent for schema name in SparkGetSchemasOperation

2020-09-23 Thread GitBox
bogdanghit commented on a change in pull request #29834: URL: https://github.com/apache/spark/pull/29834#discussion_r493706395 ## File path: sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/SparkGetSchemasOperation.scala ## @@ -77,7 +77,8 @@

[GitHub] [spark] SparkQA commented on pull request #29054: [SPARK-32243][SQL]HiveSessionCatalog call super.makeFunctionExpression should show error message

2020-09-23 Thread GitBox
SparkQA commented on pull request #29054: URL: https://github.com/apache/spark/pull/29054#issuecomment-697309722 **[Test build #129022 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129022/testReport)** for PR 29054 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29851: [WIP][SPARK-22674][PYTHON] Removed namedtuple hack and made cloudpickle the default serializer

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29851: URL: https://github.com/apache/spark/pull/29851#issuecomment-697310359 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29054: [SPARK-32243][SQL]HiveSessionCatalog call super.makeFunctionExpression should show error message

2020-09-23 Thread GitBox
SparkQA removed a comment on pull request #29054: URL: https://github.com/apache/spark/pull/29054#issuecomment-697252568 **[Test build #129022 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129022/testReport)** for PR 29054 at commit

[GitHub] [spark] SparkQA commented on pull request #29851: [WIP][SPARK-22674][PYTHON] Removed namedtuple hack and made cloudpickle the default serializer

2020-09-23 Thread GitBox
SparkQA commented on pull request #29851: URL: https://github.com/apache/spark/pull/29851#issuecomment-697310201 **[Test build #129027 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129027/testReport)** for PR 29851 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #29850: [SPARK-32974][ML] FeatureHasher transform optimization

2020-09-23 Thread GitBox
SparkQA removed a comment on pull request #29850: URL: https://github.com/apache/spark/pull/29850#issuecomment-697283027 **[Test build #129024 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129024/testReport)** for PR 29850 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29851: [WIP][SPARK-22674][PYTHON] Removed namedtuple hack and made cloudpickle the default serializer

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29851: URL: https://github.com/apache/spark/pull/29851#issuecomment-697314005 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #29850: [SPARK-32974][ML] FeatureHasher transform optimization

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29850: URL: https://github.com/apache/spark/pull/29850#issuecomment-697313771 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #26935: [SPARK-30294][SS] Explicitly defines read-only StateStore and optimize for HDFSBackedStateStore

2020-09-23 Thread GitBox
SparkQA commented on pull request #26935: URL: https://github.com/apache/spark/pull/26935#issuecomment-697314129 **[Test build #129014 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129014/testReport)** for PR 26935 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29851: [WIP][SPARK-22674][PYTHON] Removed namedtuple hack and made cloudpickle the default serializer

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29851: URL: https://github.com/apache/spark/pull/29851#issuecomment-697314005 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29850: [SPARK-32974][ML] FeatureHasher transform optimization

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29850: URL: https://github.com/apache/spark/pull/29850#issuecomment-697327197 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29850: [SPARK-32974][ML] FeatureHasher transform optimization

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29850: URL: https://github.com/apache/spark/pull/29850#issuecomment-697327197 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AngersZhuuuu commented on pull request #25290: [SPARK-28551][SQL]Add a Checker Rule when CTAS SQL with LOCATION

2020-09-23 Thread GitBox
AngersZh commented on pull request #25290: URL: https://github.com/apache/spark/pull/25290#issuecomment-697360429 retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] SparkQA commented on pull request #29286: [WIP][SPARK-21708][BUILD] Migrate build to sbt 1.x

2020-09-23 Thread GitBox
SparkQA commented on pull request #29286: URL: https://github.com/apache/spark/pull/29286#issuecomment-697549208 **[Test build #129035 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129035/testReport)** for PR 29286 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29852: [SPARK-21481][ML][FOLLOWUP][Trivial] HashingTF use util.collection.OpenHashMap instead of mutable.HashMap

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29852: URL: https://github.com/apache/spark/pull/29852#issuecomment-697305023 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #26935: [SPARK-30294][SS] Explicitly defines read-only StateStore and optimize for HDFSBackedStateStore

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #26935: URL: https://github.com/apache/spark/pull/26935#issuecomment-697315495 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #26935: [SPARK-30294][SS] Explicitly defines read-only StateStore and optimize for HDFSBackedStateStore

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #26935: URL: https://github.com/apache/spark/pull/26935#issuecomment-697315495 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #29851: [WIP][SPARK-22674][PYTHON] Removed namedtuple hack and made cloudpickle the default serializer

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29851: URL: https://github.com/apache/spark/pull/29851#issuecomment-697320718 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29851: [WIP][SPARK-22674][PYTHON] Removed namedtuple hack and made cloudpickle the default serializer

2020-09-23 Thread GitBox
SparkQA removed a comment on pull request #29851: URL: https://github.com/apache/spark/pull/29851#issuecomment-697313456 **[Test build #129029 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129029/testReport)** for PR 29851 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29851: [WIP][SPARK-22674][PYTHON] Removed namedtuple hack and made cloudpickle the default serializer

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29851: URL: https://github.com/apache/spark/pull/29851#issuecomment-697320718 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29851: [WIP][SPARK-22674][PYTHON] Removed namedtuple hack and made cloudpickle the default serializer

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29851: URL: https://github.com/apache/spark/pull/29851#issuecomment-697320723 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA commented on pull request #29850: [SPARK-32974][ML] FeatureHasher transform optimization

2020-09-23 Thread GitBox
SparkQA commented on pull request #29850: URL: https://github.com/apache/spark/pull/29850#issuecomment-697326628 **[Test build #129025 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129025/testReport)** for PR 29850 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #29850: [SPARK-32974][ML] FeatureHasher transform optimization

2020-09-23 Thread GitBox
SparkQA removed a comment on pull request #29850: URL: https://github.com/apache/spark/pull/29850#issuecomment-697292058 **[Test build #129025 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129025/testReport)** for PR 29850 at commit

[GitHub] [spark] SparkQA commented on pull request #25840: [SPARK-29166][SQL] Add parameters to limit the number of dynamic partitions for data source table

2020-09-23 Thread GitBox
SparkQA commented on pull request #25840: URL: https://github.com/apache/spark/pull/25840#issuecomment-697334484 **[Test build #129030 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129030/testReport)** for PR 25840 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29852: [SPARK-21481][ML][FOLLOWUP][Trivial] HashingTF use util.collection.OpenHashMap instead of mutable.HashMap

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29852: URL: https://github.com/apache/spark/pull/29852#issuecomment-697348502 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29852: [SPARK-21481][ML][FOLLOWUP][Trivial] HashingTF use util.collection.OpenHashMap instead of mutable.HashMap

2020-09-23 Thread GitBox
SparkQA commented on pull request #29852: URL: https://github.com/apache/spark/pull/29852#issuecomment-697347990 **[Test build #129028 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129028/testReport)** for PR 29852 at commit

[GitHub] [spark] SparkQA commented on pull request #29798: [SPARK-32931][SQL] Unevaluable Expressions are not Foldable

2020-09-23 Thread GitBox
SparkQA commented on pull request #29798: URL: https://github.com/apache/spark/pull/29798#issuecomment-697467645 **[Test build #129034 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129034/testReport)** for PR 29798 at commit

[GitHub] [spark] viirya commented on a change in pull request #29828: [SPARK-32948][SQL] Optimize to_json and from_json expression chain

2020-09-23 Thread GitBox
viirya commented on a change in pull request #29828: URL: https://github.com/apache/spark/pull/29828#discussion_r493724659 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/OptimizeJsonExprs.scala ## @@ -0,0 +1,35 @@ +/* + * Licensed to the

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29851: [WIP][SPARK-22674][PYTHON] Removed namedtuple hack and made cloudpickle the default serializer

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29851: URL: https://github.com/apache/spark/pull/29851#issuecomment-697310359 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #29054: [SPARK-32243][SQL]HiveSessionCatalog call super.makeFunctionExpression should show error message

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29054: URL: https://github.com/apache/spark/pull/29054#issuecomment-697310722 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #29851: [WIP][SPARK-22674][PYTHON] Removed namedtuple hack and made cloudpickle the default serializer

2020-09-23 Thread GitBox
SparkQA removed a comment on pull request #29851: URL: https://github.com/apache/spark/pull/29851#issuecomment-697304553 **[Test build #129027 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129027/testReport)** for PR 29851 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #25840: [SPARK-29166][SQL] Add parameters to limit the number of dynamic partitions for data source table

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #25840: URL: https://github.com/apache/spark/pull/25840#issuecomment-697335325 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29852: [SPARK-21481][ML][FOLLOWUP][Trivial] HashingTF use util.collection.OpenHashMap instead of mutable.HashMap

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29852: URL: https://github.com/apache/spark/pull/29852#issuecomment-697348502 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA removed a comment on pull request #29852: [SPARK-21481][ML][FOLLOWUP][Trivial] HashingTF use util.collection.OpenHashMap instead of mutable.HashMap

2020-09-23 Thread GitBox
SparkQA removed a comment on pull request #29852: URL: https://github.com/apache/spark/pull/29852#issuecomment-697307454 **[Test build #129028 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129028/testReport)** for PR 29852 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #25290: [SPARK-28551][SQL]Add a Checker Rule when CTAS SQL with LOCATION

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #25290: URL: https://github.com/apache/spark/pull/25290#issuecomment-697362539 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29286: [WIP][SPARK-21708][BUILD] Migrate build to sbt 1.x

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29286: URL: https://github.com/apache/spark/pull/29286#issuecomment-697551065 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29286: [WIP][SPARK-21708][BUILD] Migrate build to sbt 1.x

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29286: URL: https://github.com/apache/spark/pull/29286#issuecomment-697551065 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #29806: [SPARK-32187][PYTHON][DOCS] Doc on Python packaging

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29806: URL: https://github.com/apache/spark/pull/29806#issuecomment-697574410 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] HyukjinKwon commented on pull request #29806: [SPARK-32187][PYTHON][DOCS] Doc on Python packaging

2020-09-23 Thread GitBox
HyukjinKwon commented on pull request #29806: URL: https://github.com/apache/spark/pull/29806#issuecomment-697574364 Thanks @fhoering for proofreading :-) This is an automated message from the Apache Git Service. To respond

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29806: [SPARK-32187][PYTHON][DOCS] Doc on Python packaging

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29806: URL: https://github.com/apache/spark/pull/29806#issuecomment-697574410 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] HyukjinKwon commented on pull request #29846: [SPARK-32971][K8S] Support dynamic PVC creation/deletion for K8s executors

2020-09-23 Thread GitBox
HyukjinKwon commented on pull request #29846: URL: https://github.com/apache/spark/pull/29846#issuecomment-697592931 Thanks @holdenk and @dongjoon-hyun. This is an automated message from the Apache Git Service. To respond to

[GitHub] [spark] SparkQA removed a comment on pull request #29806: [SPARK-32187][PYTHON][DOCS] Doc on Python packaging

2020-09-23 Thread GitBox
SparkQA removed a comment on pull request #29806: URL: https://github.com/apache/spark/pull/29806#issuecomment-697572196 **[Test build #129036 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129036/testReport)** for PR 29806 at commit

[GitHub] [spark] SparkQA commented on pull request #29806: [SPARK-32187][PYTHON][DOCS] Doc on Python packaging

2020-09-23 Thread GitBox
SparkQA commented on pull request #29806: URL: https://github.com/apache/spark/pull/29806#issuecomment-697620145 **[Test build #129036 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129036/testReport)** for PR 29806 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29806: [SPARK-32187][PYTHON][DOCS] Doc on Python packaging

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29806: URL: https://github.com/apache/spark/pull/29806#issuecomment-697621801 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29852: [SPARK-21481][ML][FOLLOWUP][Trivial] HashingTF use util.collection.OpenHashMap instead of mutable.HashMap

2020-09-23 Thread GitBox
SparkQA commented on pull request #29852: URL: https://github.com/apache/spark/pull/29852#issuecomment-697307454 **[Test build #129028 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129028/testReport)** for PR 29852 at commit

[GitHub] [spark] SparkQA commented on pull request #29850: [SPARK-32974][ML] FeatureHasher transform optimization

2020-09-23 Thread GitBox
SparkQA commented on pull request #29850: URL: https://github.com/apache/spark/pull/29850#issuecomment-697313341 **[Test build #129024 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129024/testReport)** for PR 29850 at commit

[GitHub] [spark] SparkQA commented on pull request #29851: [WIP][SPARK-22674][PYTHON] Removed namedtuple hack and made cloudpickle the default serializer

2020-09-23 Thread GitBox
SparkQA commented on pull request #29851: URL: https://github.com/apache/spark/pull/29851#issuecomment-697313456 **[Test build #129029 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129029/testReport)** for PR 29851 at commit

[GitHub] [spark] SparkQA commented on pull request #29848: [MINOR][SQL][3.0] Improve examples for `percentile_approx()`

2020-09-23 Thread GitBox
SparkQA commented on pull request #29848: URL: https://github.com/apache/spark/pull/29848#issuecomment-697321959 **[Test build #129016 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129016/testReport)** for PR 29848 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #29848: [MINOR][SQL][3.0] Improve examples for `percentile_approx()`

2020-09-23 Thread GitBox
SparkQA removed a comment on pull request #29848: URL: https://github.com/apache/spark/pull/29848#issuecomment-697184829 **[Test build #129016 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129016/testReport)** for PR 29848 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #25290: [SPARK-28551][SQL]Add a Checker Rule when CTAS SQL with LOCATION

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #25290: URL: https://github.com/apache/spark/pull/25290#issuecomment-697362539 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] srowen commented on pull request #29286: [WIP][SPARK-21708][BUILD] Migrate build to sbt 1.x

2020-09-23 Thread GitBox
srowen commented on pull request #29286: URL: https://github.com/apache/spark/pull/29286#issuecomment-697545434 Jenkins retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] fhoering commented on pull request #29806: [SPARK-32187][PYTHON][DOCS] Doc on Python packaging

2020-09-23 Thread GitBox
fhoering commented on pull request #29806: URL: https://github.com/apache/spark/pull/29806#issuecomment-697567806 OK. I re-pushed some minor changes. This is an automated message from the Apache Git Service. To respond to

[GitHub] [spark] HyukjinKwon commented on pull request #29833: [SPARK-32886][SPARK-31882][WEBUI][2.4] fix 'undefined' link in event timeline view

2020-09-23 Thread GitBox
HyukjinKwon commented on pull request #29833: URL: https://github.com/apache/spark/pull/29833#issuecomment-697306644 `StreamingLogisticRegressionWithSGDTests.test_training_and_prediction` is a known flaky test. I am pretty confident we can ignore it here.

[GitHub] [spark] HyukjinKwon commented on pull request #29833: [SPARK-32886][SPARK-31882][WEBUI][2.4] fix 'undefined' link in event timeline view

2020-09-23 Thread GitBox
HyukjinKwon commented on pull request #29833: URL: https://github.com/apache/spark/pull/29833#issuecomment-697306383 I think we can just use GitHub Actions build. GitHub Actions build was ported back to replace Jenkins build in branch-2.4. It is flaky on few specific machines in

[GitHub] [spark] AmplabJenkins commented on pull request #29850: [SPARK-32974][ML] FeatureHasher transform optimization

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29850: URL: https://github.com/apache/spark/pull/29850#issuecomment-697311920 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29850: [SPARK-32974][ML] FeatureHasher transform optimization

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29850: URL: https://github.com/apache/spark/pull/29850#issuecomment-697311920 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] zhengruifeng commented on pull request #29850: [SPARK-32974][ML] FeatureHasher transform optimization

2020-09-23 Thread GitBox
zhengruifeng commented on pull request #29850: URL: https://github.com/apache/spark/pull/29850#issuecomment-697312270 friendly ping @srowen @huaxingao @HyukjinKwon @MLnick This is an automated message from the Apache Git

[GitHub] [spark] SparkQA removed a comment on pull request #29756: [SPARK-32885][SS] Add DataStreamReader.table API

2020-09-23 Thread GitBox
SparkQA removed a comment on pull request #29756: URL: https://github.com/apache/spark/pull/29756#issuecomment-697201587 **[Test build #129020 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129020/testReport)** for PR 29756 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29824: [SPARK-32954][YARN][TEST][test-hadoop2.7][test-maven] Add jakarta.servlet-api test dependency to yarn module to avoid UTs badca

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29824: URL: https://github.com/apache/spark/pull/29824#issuecomment-697379263 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29824: [SPARK-32954][YARN][TEST][test-hadoop2.7][test-maven] Add jakarta.servlet-api test dependency to yarn module to avoid UTs badcase

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29824: URL: https://github.com/apache/spark/pull/29824#issuecomment-697379263 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29591: [SPARK-32714][PYTHON] Initial pyspark-stubs port.

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29591: URL: https://github.com/apache/spark/pull/29591#issuecomment-697390632 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29591: [SPARK-32714][PYTHON] Initial pyspark-stubs port.

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29591: URL: https://github.com/apache/spark/pull/29591#issuecomment-697390632 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #25840: [SPARK-29166][SQL] Add parameters to limit the number of dynamic partitions for data source table

2020-09-23 Thread GitBox
SparkQA commented on pull request #25840: URL: https://github.com/apache/spark/pull/25840#issuecomment-697533242 **[Test build #129030 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129030/testReport)** for PR 25840 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29848: [MINOR][SQL][3.0] Improve examples for `percentile_approx()`

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29848: URL: https://github.com/apache/spark/pull/29848#issuecomment-697323375 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29848: [MINOR][SQL][3.0] Improve examples for `percentile_approx()`

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #29848: URL: https://github.com/apache/spark/pull/29848#issuecomment-697323375 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #29756: [SPARK-32885][SS] Add DataStreamReader.table API

2020-09-23 Thread GitBox
SparkQA commented on pull request #29756: URL: https://github.com/apache/spark/pull/29756#issuecomment-697342961 **[Test build #129020 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129020/testReport)** for PR 29756 at commit

[GitHub] [spark] steveloughran commented on a change in pull request #29843: [WIP][SPARK-29250] Upgrade to Hadoop 3.2.1

2020-09-23 Thread GitBox
steveloughran commented on a change in pull request #29843: URL: https://github.com/apache/spark/pull/29843#discussion_r493564469 ## File path: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/ApplicationMaster.scala ## @@ -308,7 +307,7 @@ private[spark]

[GitHub] [spark] steveloughran commented on pull request #29843: [WIP][SPARK-29250] Upgrade to Hadoop 3.2.1

2020-09-23 Thread GitBox
steveloughran commented on pull request #29843: URL: https://github.com/apache/spark/pull/29843#issuecomment-697352852 +expect a 3.2.2 before end of year. If there are regressions, that would be the place to get them addressed

[GitHub] [spark] SparkQA commented on pull request #29824: [SPARK-32954][YARN][TEST][test-hadoop2.7][test-maven] Add jakarta.servlet-api test dependency to yarn module to avoid UTs badcase

2020-09-23 Thread GitBox
SparkQA commented on pull request #29824: URL: https://github.com/apache/spark/pull/29824#issuecomment-697376701 **[Test build #129032 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129032/testReport)** for PR 29824 at commit

[GitHub] [spark] srowen commented on pull request #29824: [SPARK-32954][YARN][TEST][test-hadoop2.7][test-maven] Add jakarta.servlet-api test dependency to yarn module to avoid UTs badcase

2020-09-23 Thread GitBox
srowen commented on pull request #29824: URL: https://github.com/apache/spark/pull/29824#issuecomment-697376204 Jenkins retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] SparkQA commented on pull request #29591: [SPARK-32714][PYTHON] Initial pyspark-stubs port.

2020-09-23 Thread GitBox
SparkQA commented on pull request #29591: URL: https://github.com/apache/spark/pull/29591#issuecomment-697399954 **[Test build #129033 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129033/testReport)** for PR 29591 at commit

[GitHub] [spark] dongjoon-hyun commented on pull request #29846: [SPARK-32971][K8S] Support dynamic PVC creation/deletion for K8s executors

2020-09-23 Thread GitBox
dongjoon-hyun commented on pull request #29846: URL: https://github.com/apache/spark/pull/29846#issuecomment-697540293 Thanks, @holdenk . This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29806: [SPARK-32187][PYTHON][DOCS] Doc on Python packaging

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29806: URL: https://github.com/apache/spark/pull/29806#issuecomment-697621801 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #29850: [SPARK-32974][ML] FeatureHasher transform optimization

2020-09-23 Thread GitBox
SparkQA commented on pull request #29850: URL: https://github.com/apache/spark/pull/29850#issuecomment-697311447 **[Test build #129023 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129023/testReport)** for PR 29850 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29851: [WIP][SPARK-22674][PYTHON] Removed namedtuple hack and made cloudpickle the default serializer

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29851: URL: https://github.com/apache/spark/pull/29851#issuecomment-697310367 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29054: [SPARK-32243][SQL]HiveSessionCatalog call super.makeFunctionExpression should show error message

2020-09-23 Thread GitBox
AmplabJenkins removed a comment on pull request #29054: URL: https://github.com/apache/spark/pull/29054#issuecomment-697310722 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA removed a comment on pull request #29850: [SPARK-32974][ML] FeatureHasher transform optimization

2020-09-23 Thread GitBox
SparkQA removed a comment on pull request #29850: URL: https://github.com/apache/spark/pull/29850#issuecomment-697279840 **[Test build #129023 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129023/testReport)** for PR 29850 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #25840: [SPARK-29166][SQL] Add parameters to limit the number of dynamic partitions for data source table

2020-09-23 Thread GitBox
SparkQA removed a comment on pull request #25840: URL: https://github.com/apache/spark/pull/25840#issuecomment-697334484 **[Test build #129030 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129030/testReport)** for PR 25840 at commit

[GitHub] [spark] dongjoon-hyun commented on pull request #29846: [SPARK-32971][K8S] Support dynamic PVC creation/deletion for K8s executors

2020-09-23 Thread GitBox
dongjoon-hyun commented on pull request #29846: URL: https://github.com/apache/spark/pull/29846#issuecomment-697533646 The decomission failure is irrelevant to this one. ```suggestion KubernetesSuite: - Run SparkPi with no resources - Run SparkPi with a very long application

[GitHub] [spark] AmplabJenkins commented on pull request #25840: [SPARK-29166][SQL] Add parameters to limit the number of dynamic partitions for data source table

2020-09-23 Thread GitBox
AmplabJenkins commented on pull request #25840: URL: https://github.com/apache/spark/pull/25840#issuecomment-697535821 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] RussellSpitzer commented on pull request #29853: [SPARK-SQL][SPARK-32977] Fix JavaDoc on Default Save Mode

2020-09-23 Thread GitBox
RussellSpitzer commented on pull request #29853: URL: https://github.com/apache/spark/pull/29853#issuecomment-697647382 @brkyvz - Could you please review this re: our conversation a while back :) This is an automated

[GitHub] [spark] holdenk commented on pull request #29591: [SPARK-32714][PYTHON] Initial pyspark-stubs port.

2020-09-23 Thread GitBox
holdenk commented on pull request #29591: URL: https://github.com/apache/spark/pull/29591#issuecomment-697823681 LGTM pending passing both GHA and Jenkins. This is an automated message from the Apache Git Service. To respond

[GitHub] [spark] cchighman commented on pull request #28841: [SPARK-31962][SQL] Provide modifiedAfter and modifiedBefore options when filtering from a batch-based file data source

2020-09-23 Thread GitBox
cchighman commented on pull request #28841: URL: https://github.com/apache/spark/pull/28841#issuecomment-697913023 @gengliangwang @cloud-fan @zsxwing @maropu @HyukjinKwon @HeartSaVioR @Dooyoung-Hwang Gentle ping. I summarized key feedback above from the review on this PR to help

  1   2   3   4   5   6   7   8   >