[GitHub] [spark] izchen commented on a change in pull request #29028: [SPARK-32212][CORE]RDD.takeOrdered can choose to merge intermediate r…

2020-07-10 Thread GitBox
izchen commented on a change in pull request #29028: URL: https://github.com/apache/spark/pull/29028#discussion_r452858008 ## File path: core/src/main/scala/org/apache/spark/rdd/RDD.scala ## @@ -1509,22 +1509,26 @@ abstract class RDD[T: ClassTag]( * @return an array of top

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28898: [SPARK-32059][SQL] Allow nested schema pruning thru window/sort/filter plans

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #28898: URL: https://github.com/apache/spark/pull/28898#issuecomment-656696370 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #29064: [SPARK-32272][SQL] Add and extend SQL standard command SET TIME ZONE

2020-07-10 Thread GitBox
SparkQA commented on pull request #29064: URL: https://github.com/apache/spark/pull/29064#issuecomment-656704567 **[Test build #125602 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125602/testReport)** for PR 29064 at commit

[GitHub] [spark] Ngone51 commented on pull request #28924: [SPARK-32091][CORE] Ignore timeout error when remove blocks on the lost executor

2020-07-10 Thread GitBox
Ngone51 commented on pull request #28924: URL: https://github.com/apache/spark/pull/28924#issuecomment-656721945 thanks all!! This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29002: [SPARK-32175][CORE] Fix the order between initialization for ExecutorPlugin and starting heartbeat thread

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29002: URL: https://github.com/apache/spark/pull/29002#issuecomment-656725013 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA removed a comment on pull request #28998: [SPARK-32173][SQL] Deduplicate code in FromUTCTimestamp and ToUTCTimestamp

2020-07-10 Thread GitBox
SparkQA removed a comment on pull request #28998: URL: https://github.com/apache/spark/pull/28998#issuecomment-656522077 **[Test build #125569 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125569/testReport)** for PR 28998 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29002: [SPARK-32175][CORE] Fix the order between initialization for ExecutorPlugin and starting heartbeat thread

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #29002: URL: https://github.com/apache/spark/pull/29002#issuecomment-656725013 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #27428: [SPARK-30276][SQL] Support Filter expression allows simultaneous use of DISTINCT

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #27428: URL: https://github.com/apache/spark/pull/27428#issuecomment-656746877 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27428: [SPARK-30276][SQL] Support Filter expression allows simultaneous use of DISTINCT

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #27428: URL: https://github.com/apache/spark/pull/27428#issuecomment-656746877 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] HyukjinKwon closed pull request #29059: [SPARK-32256][SQL][test-hadoop2.7] Force to initialize Hadoop VersionInfo in HiveExternalCatalog

2020-07-10 Thread GitBox
HyukjinKwon closed pull request #29059: URL: https://github.com/apache/spark/pull/29059 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] HyukjinKwon commented on pull request #29059: [SPARK-32256][SQL][test-hadoop2.7] Force to initialize Hadoop VersionInfo in HiveExternalCatalog

2020-07-10 Thread GitBox
HyukjinKwon commented on pull request #29059: URL: https://github.com/apache/spark/pull/29059#issuecomment-656644231 Merged to master and branch-3.0. This is an automated message from the Apache Git Service. To respond to

[GitHub] [spark] SparkQA commented on pull request #28924: [SPARK-32091][CORE] Ignore timeout error when remove blocks on the lost executor

2020-07-10 Thread GitBox
SparkQA commented on pull request #28924: URL: https://github.com/apache/spark/pull/28924#issuecomment-656644069 **[Test build #125578 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125578/testReport)** for PR 28924 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #28924: [SPARK-32091][CORE] Ignore timeout error when remove blocks on the lost executor

2020-07-10 Thread GitBox
SparkQA removed a comment on pull request #28924: URL: https://github.com/apache/spark/pull/28924#issuecomment-656533667 **[Test build #125578 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125578/testReport)** for PR 28924 at commit

[GitHub] [spark] cloud-fan commented on a change in pull request #27366: [SPARK-30648][SQL] Support filters pushdown in JSON datasource

2020-07-10 Thread GitBox
cloud-fan commented on a change in pull request #27366: URL: https://github.com/apache/spark/pull/27366#discussion_r452835950 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JsonFilters.scala ## @@ -0,0 +1,131 @@ +/* + * Licensed to the Apache

[GitHub] [spark] SparkQA commented on pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-10 Thread GitBox
SparkQA commented on pull request #29053: URL: https://github.com/apache/spark/pull/29053#issuecomment-656675794 **[Test build #125610 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125610/testReport)** for PR 29053 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29053: URL: https://github.com/apache/spark/pull/29053#issuecomment-656228109 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] peter-toth commented on a change in pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-10 Thread GitBox
peter-toth commented on a change in pull request #29053: URL: https://github.com/apache/spark/pull/29053#discussion_r452848538 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/PropagateEmptyRelation.scala ## @@ -50,8 +50,25 @@ object

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28412: [SPARK-31608][CORE][WEBUI] Add a new type of KVStore to make loading UI faster

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #28412: URL: https://github.com/apache/spark/pull/28412#issuecomment-656689382 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #28412: [SPARK-31608][CORE][WEBUI] Add a new type of KVStore to make loading UI faster

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #28412: URL: https://github.com/apache/spark/pull/28412#issuecomment-656689382 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28898: [SPARK-32059][SQL] Allow nested schema pruning thru window/sort/filter plans

2020-07-10 Thread GitBox
SparkQA commented on pull request #28898: URL: https://github.com/apache/spark/pull/28898#issuecomment-656695838 **[Test build #125617 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125617/testReport)** for PR 28898 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27366: [SPARK-30648][SQL] Support filters pushdown in JSON datasource

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #27366: URL: https://github.com/apache/spark/pull/27366#issuecomment-656699666 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] guykhazma commented on pull request #28826: [SPARK-31988][SQL] Schema pruning may discard attribute metadata

2020-07-10 Thread GitBox
guykhazma commented on pull request #28826: URL: https://github.com/apache/spark/pull/28826#issuecomment-656699933 @viirya @maropu any comments? This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28746: [SPARK-31922][CORE] Fix "RpcEnv already stopped" error when exit spark-shell with local-cluster mode

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #28746: URL: https://github.com/apache/spark/pull/28746#issuecomment-656228120 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA commented on pull request #28746: [SPARK-31922][CORE] Fix "RpcEnv already stopped" error when exit spark-shell with local-cluster mode

2020-07-10 Thread GitBox
SparkQA commented on pull request #28746: URL: https://github.com/apache/spark/pull/28746#issuecomment-656709416 **[Test build #125619 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125619/testReport)** for PR 28746 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-10 Thread GitBox
SparkQA removed a comment on pull request #29053: URL: https://github.com/apache/spark/pull/29053#issuecomment-656675794 **[Test build #125610 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125610/testReport)** for PR 29053 at commit

[GitHub] [spark] SparkQA commented on pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-10 Thread GitBox
SparkQA commented on pull request #29053: URL: https://github.com/apache/spark/pull/29053#issuecomment-656718527 **[Test build #125610 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125610/testReport)** for PR 29053 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29035: [SPARK-32220][SQL]SHUFFLE_REPLICATE_NL Hint should not change Non-Cartesian Product join result

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #29035: URL: https://github.com/apache/spark/pull/29035#issuecomment-656733347 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29035: [SPARK-32220][SQL]SHUFFLE_REPLICATE_NL Hint should not change Non-Cartesian Product join result

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29035: URL: https://github.com/apache/spark/pull/29035#issuecomment-656733347 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] dongjoon-hyun commented on pull request #29055: [SPARK-32251][SQL][DOCS][TESTS] Fix SQL keyword document

2020-07-10 Thread GitBox
dongjoon-hyun commented on pull request #29055: URL: https://github.com/apache/spark/pull/29055#issuecomment-656733578 Retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] peter-toth commented on a change in pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-10 Thread GitBox
peter-toth commented on a change in pull request #29053: URL: https://github.com/apache/spark/pull/29053#discussion_r452913572 ## File path: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/optimizer/PropagateEmptyRelationSuite.scala ## @@ -67,6 +68,28 @@ class

[GitHub] [spark] AmplabJenkins commented on pull request #28746: [SPARK-31922][CORE] Fix "RpcEnv already stopped" error when exit spark-shell with local-cluster mode

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #28746: URL: https://github.com/apache/spark/pull/28746#issuecomment-656742134 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28746: [SPARK-31922][CORE] Fix "RpcEnv already stopped" error when exit spark-shell with local-cluster mode

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #28746: URL: https://github.com/apache/spark/pull/28746#issuecomment-656742134 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #28746: [SPARK-31922][CORE] Fix "RpcEnv already stopped" error when exit spark-shell with local-cluster mode

2020-07-10 Thread GitBox
SparkQA commented on pull request #28746: URL: https://github.com/apache/spark/pull/28746#issuecomment-656741686 **[Test build #125624 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125624/testReport)** for PR 28746 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28926: [SPARK-32133][SQL] Forbid time field steps for date start/end in Sequence

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #28926: URL: https://github.com/apache/spark/pull/28926#issuecomment-656749076 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29054: [SPARK-32243][SQL]HiveSessionCatalog call super.makeFunctionExpression should show error message

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29054: URL: https://github.com/apache/spark/pull/29054#issuecomment-656751570 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29054: [SPARK-32243][SQL]HiveSessionCatalog call super.makeFunctionExpression should show error message

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29054: URL: https://github.com/apache/spark/pull/29054#issuecomment-656770267 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA commented on pull request #29057: [SPARK-32245][INFRA] Run Spark tests in Github Actions

2020-07-10 Thread GitBox
SparkQA commented on pull request #29057: URL: https://github.com/apache/spark/pull/29057#issuecomment-656770513 **[Test build #125628 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125628/testReport)** for PR 29057 at commit

[GitHub] [spark] SparkQA commented on pull request #28969: [SPARK-32150][BUILD] Upgrade to ZStd 1.4.5-4

2020-07-10 Thread GitBox
SparkQA commented on pull request #28969: URL: https://github.com/apache/spark/pull/28969#issuecomment-656790188 **[Test build #125607 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125607/testReport)** for PR 28969 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28969: [SPARK-32150][BUILD] Upgrade to ZStd 1.4.5-4

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #28969: URL: https://github.com/apache/spark/pull/28969#issuecomment-656792361 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28969: [SPARK-32150][BUILD] Upgrade to ZStd 1.4.5-4

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #28969: URL: https://github.com/apache/spark/pull/28969#issuecomment-656792361 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA commented on pull request #28969: [SPARK-32150][BUILD] Upgrade to ZStd 1.4.5-4

2020-07-10 Thread GitBox
SparkQA commented on pull request #28969: URL: https://github.com/apache/spark/pull/28969#issuecomment-656796734 **[Test build #125629 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125629/testReport)** for PR 28969 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #28904: [SPARK-30462][SS] Streamline the logic on file stream source and sink metadata log to avoid memory issue

2020-07-10 Thread GitBox
SparkQA removed a comment on pull request #28904: URL: https://github.com/apache/spark/pull/28904#issuecomment-656625881 **[Test build #125603 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125603/testReport)** for PR 28904 at commit

[GitHub] [spark] SparkQA commented on pull request #28904: [SPARK-30462][SS] Streamline the logic on file stream source and sink metadata log to avoid memory issue

2020-07-10 Thread GitBox
SparkQA commented on pull request #28904: URL: https://github.com/apache/spark/pull/28904#issuecomment-656801641 **[Test build #125603 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125603/testReport)** for PR 28904 at commit

[GitHub] [spark] SparkQA commented on pull request #28977: [WIP] Add all hive.execution suite in the parallel test group

2020-07-10 Thread GitBox
SparkQA commented on pull request #28977: URL: https://github.com/apache/spark/pull/28977#issuecomment-656806883 **[Test build #125608 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125608/testReport)** for PR 28977 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28287: [SPARK-31418][SCHEDULER] Request more executors in case of dynamic allocation is enabled and a task becomes unschedulable due to spark'

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #28287: URL: https://github.com/apache/spark/pull/28287#issuecomment-656806574 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28287: [SPARK-31418][SCHEDULER] Request more executors in case of dynamic allocation is enabled and a task becomes unschedulable due t

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #28287: URL: https://github.com/apache/spark/pull/28287#issuecomment-656806574 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29063: [SPARK-32270][SQL] Use TextFileFormat in CSV's schema inference with a different encoding

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29063: URL: https://github.com/apache/spark/pull/29063#issuecomment-656806021 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] dongjoon-hyun commented on pull request #29050: [SPARK-32238][SQL] Use Utils.getSimpleName to avoid hitting Malformed class name in ScalaUDF

2020-07-10 Thread GitBox
dongjoon-hyun commented on pull request #29050: URL: https://github.com/apache/spark/pull/29050#issuecomment-656811295 Retest this please. This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] dongjoon-hyun closed pull request #28926: [SPARK-32133][SQL] Forbid time field steps for date start/end in Sequence

2020-07-10 Thread GitBox
dongjoon-hyun closed pull request #28926: URL: https://github.com/apache/spark/pull/28926 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [spark] SparkQA removed a comment on pull request #28412: [SPARK-31608][CORE][WEBUI] Add a new type of KVStore to make loading UI faster

2020-07-10 Thread GitBox
SparkQA removed a comment on pull request #28412: URL: https://github.com/apache/spark/pull/28412#issuecomment-656693014 **[Test build #125616 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125616/testReport)** for PR 28412 at commit

[GitHub] [spark] srowen commented on pull request #28850: [SPARK-32015][Core]Remote inheritable thread local variables after spark context is stopped

2020-07-10 Thread GitBox
srowen commented on pull request #28850: URL: https://github.com/apache/spark/pull/28850#issuecomment-656820906 I dunno, this just feels pretty hacky and like it will break things. Is there no other way to avoid holding the reference with soft references, better cleanup? I think this is

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27988: [SPARK-31226][CORE][TEST] SizeBasedCoalesce logic will lose partition

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #27988: URL: https://github.com/apache/spark/pull/27988#issuecomment-656820309 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #27988: [SPARK-31226][CORE][TEST] SizeBasedCoalesce logic will lose partition

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #27988: URL: https://github.com/apache/spark/pull/27988#issuecomment-656820309 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27988: [SPARK-31226][CORE][TEST] SizeBasedCoalesce logic will lose partition

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #27988: URL: https://github.com/apache/spark/pull/27988#issuecomment-656820311 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] viirya commented on a change in pull request #28957: [SPARK-32138] Drop Python 2.7, 3.4 and 3.5

2020-07-10 Thread GitBox
viirya commented on a change in pull request #28957: URL: https://github.com/apache/spark/pull/28957#discussion_r453018479 ## File path: dev/run-tests.py ## @@ -649,7 +649,7 @@ def main(): # if "DOCS" in changed_modules and test_env == "amplab_jenkins": #

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28746: [SPARK-31922][CORE] Fix "RpcEnv already stopped" error when exit spark-shell with local-cluster mode

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #28746: URL: https://github.com/apache/spark/pull/28746#issuecomment-656849147 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28746: [SPARK-31922][CORE] Fix "RpcEnv already stopped" error when exit spark-shell with local-cluster mode

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #28746: URL: https://github.com/apache/spark/pull/28746#issuecomment-656849154 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29069: [SPARK-31831][SQL][TESTS] Use constructor for mock in HiveSessionImplSuite

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29069: URL: https://github.com/apache/spark/pull/29069#issuecomment-656848455 Can one of the admins verify this patch? This is an automated message from the Apache Git

[GitHub] [spark] AmplabJenkins commented on pull request #28746: [SPARK-31922][CORE] Fix "RpcEnv already stopped" error when exit spark-shell with local-cluster mode

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #28746: URL: https://github.com/apache/spark/pull/28746#issuecomment-656849147 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #29069: [SPARK-31831][SQL][TESTS] Use constructor for mock in HiveSessionImplSuite

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #29069: URL: https://github.com/apache/spark/pull/29069#issuecomment-656848901 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29068: [SPARK-27892][ML]Saving/loading stages in PipelineModel are parallel

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29068: URL: https://github.com/apache/spark/pull/29068#issuecomment-656845987 Can one of the admins verify this patch? This is an automated message from the Apache Git

[GitHub] [spark] viirya commented on pull request #28957: [SPARK-32138] Drop Python 2.7, 3.4 and 3.5

2020-07-10 Thread GitBox
viirya commented on pull request #28957: URL: https://github.com/apache/spark/pull/28957#issuecomment-656856638 Looks good overall. This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [spark] AmplabJenkins commented on pull request #28931: [SPARK-32103][YARN] Handle IPv6 host/port split in YarnRMClient

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #28931: URL: https://github.com/apache/spark/pull/28931#issuecomment-656856757 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29067: [SPARK-32274] Make SQL cache serialization pluggable

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29067: URL: https://github.com/apache/spark/pull/29067#issuecomment-656862071 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29067: [SPARK-32274] Make SQL cache serialization pluggable

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29067: URL: https://github.com/apache/spark/pull/29067#issuecomment-656862075 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA removed a comment on pull request #29067: [SPARK-32274] Make SQL cache serialization pluggable

2020-07-10 Thread GitBox
SparkQA removed a comment on pull request #29067: URL: https://github.com/apache/spark/pull/29067#issuecomment-656818843 **[Test build #125633 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125633/testReport)** for PR 29067 at commit

[GitHub] [spark] viirya commented on a change in pull request #28996: [SPARK-29358][SQL] Make unionByName optionally fill missing columns with nulls

2020-07-10 Thread GitBox
viirya commented on a change in pull request #28996: URL: https://github.com/apache/spark/pull/28996#discussion_r453054212 ## File path: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala ## @@ -2030,7 +2030,25 @@ class Dataset[T] private[sql]( * @group typedrel

[GitHub] [spark] dongjoon-hyun commented on pull request #29044: [SPARK-32227] Fix regression bug in load-spark-env.cmd with Spark 3.0.0

2020-07-10 Thread GitBox
dongjoon-hyun commented on pull request #29044: URL: https://github.com/apache/spark/pull/29044#issuecomment-656758003 It's directly relevant to this PR because your patch is changing `environment` variable. - Please see this for the detail (https://github.com/cdarlint/winutils) -

[GitHub] [spark] dongjoon-hyun commented on pull request #29044: [WIP][SPARK-32227] Fix regression bug in load-spark-env.cmd with Spark 3.0.0

2020-07-10 Thread GitBox
dongjoon-hyun commented on pull request #29044: URL: https://github.com/apache/spark/pull/29044#issuecomment-656758336 Please remove `[WIP]` from the title when AppVeyor passes on Windows. Thanks. This is an automated

[GitHub] [spark] AmplabJenkins commented on pull request #29057: [SPARK-32245][INFRA] Run Spark tests in Github Actions

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #29057: URL: https://github.com/apache/spark/pull/29057#issuecomment-656772419 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #28969: [SPARK-32150][BUILD] Upgrade to ZStd 1.4.5-4

2020-07-10 Thread GitBox
SparkQA removed a comment on pull request #28969: URL: https://github.com/apache/spark/pull/28969#issuecomment-656650491 **[Test build #125607 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125607/testReport)** for PR 28969 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #28967: [SPARK-32149][SHUFFLE] Improve file path name normalisation at block resolution within the external shuffle service

2020-07-10 Thread GitBox
SparkQA removed a comment on pull request #28967: URL: https://github.com/apache/spark/pull/28967#issuecomment-656644537 **[Test build #125606 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125606/testReport)** for PR 28967 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29045: [SPARK-32234][SQL] Spark sql commands are failing on selecting the orc tables

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29045: URL: https://github.com/apache/spark/pull/29045#issuecomment-656802466 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #29045: [SPARK-32234][SQL] Spark sql commands are failing on selecting the orc tables

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #29045: URL: https://github.com/apache/spark/pull/29045#issuecomment-656802466 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28977: [WIP] Add all hive.execution suite in the parallel test group

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #28977: URL: https://github.com/apache/spark/pull/28977#issuecomment-656808736 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA commented on pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-10 Thread GitBox
SparkQA commented on pull request #29053: URL: https://github.com/apache/spark/pull/29053#issuecomment-656809867 **[Test build #125630 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125630/testReport)** for PR 29053 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #29053: URL: https://github.com/apache/spark/pull/29053#issuecomment-656809763 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #27988: [SPARK-31226][CORE][TEST] SizeBasedCoalesce logic will lose partition

2020-07-10 Thread GitBox
SparkQA removed a comment on pull request #27988: URL: https://github.com/apache/spark/pull/27988#issuecomment-656682406 **[Test build #125615 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125615/testReport)** for PR 27988 at commit

[GitHub] [spark] SparkQA commented on pull request #27988: [SPARK-31226][CORE][TEST] SizeBasedCoalesce logic will lose partition

2020-07-10 Thread GitBox
SparkQA commented on pull request #27988: URL: https://github.com/apache/spark/pull/27988#issuecomment-656818779 **[Test build #125615 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125615/testReport)** for PR 27988 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29067: [SPARK-32274] Make SQL cache serialization pluggable

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #29067: URL: https://github.com/apache/spark/pull/29067#issuecomment-656819397 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] tgravescs commented on pull request #28412: [SPARK-31608][CORE][WEBUI] Add a new type of KVStore to make loading UI faster

2020-07-10 Thread GitBox
tgravescs commented on pull request #28412: URL: https://github.com/apache/spark/pull/28412#issuecomment-656819037 test this please This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29020: [SPARK-23431][CORE] Expose stage level peak executor metrics via REST API

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29020: URL: https://github.com/apache/spark/pull/29020#issuecomment-656854737 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29020: [SPARK-23431][CORE] Expose stage level peak executor metrics via REST API

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #29020: URL: https://github.com/apache/spark/pull/29020#issuecomment-656854737 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29062: [SPARK-32237][SQL] Resolve hint in CTE

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29062: URL: https://github.com/apache/spark/pull/29062#issuecomment-656758782 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #29062: [SPARK-32237][SQL] Resolve hint in CTE

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #29062: URL: https://github.com/apache/spark/pull/29062#issuecomment-656758782 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28904: [SPARK-30462][SS] Streamline the logic on file stream source and sink metadata log to avoid memory issue

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #28904: URL: https://github.com/apache/spark/pull/28904#issuecomment-656803694 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins commented on pull request #28904: [SPARK-30462][SS] Streamline the logic on file stream source and sink metadata log to avoid memory issue

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #28904: URL: https://github.com/apache/spark/pull/28904#issuecomment-656803694 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] peter-toth commented on a change in pull request #29053: [SPARK-32241][SQL] Remove empty children of union

2020-07-10 Thread GitBox
peter-toth commented on a change in pull request #29053: URL: https://github.com/apache/spark/pull/29053#discussion_r452992807 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala ## @@ -161,7 +161,8 @@ abstract class

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29050: [SPARK-32238][SQL] Use Utils.getSimpleName to avoid hitting Malformed class name in ScalaUDF

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29050: URL: https://github.com/apache/spark/pull/29050#issuecomment-656780136 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on pull request #28412: [SPARK-31608][CORE][WEBUI] Add a new type of KVStore to make loading UI faster

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #28412: URL: https://github.com/apache/spark/pull/28412#issuecomment-656811985 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28972: [SPARK-30794][CORE] Stage Level scheduling: Add ability to set off heap memory

2020-07-10 Thread GitBox
SparkQA commented on pull request #28972: URL: https://github.com/apache/spark/pull/28972#issuecomment-656812121 **[Test build #125611 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125611/testReport)** for PR 28972 at commit

[GitHub] [spark] viirya commented on a change in pull request #28996: [SPARK-29358][SQL] Make unionByName optionally fill missing columns with nulls

2020-07-10 Thread GitBox
viirya commented on a change in pull request #28996: URL: https://github.com/apache/spark/pull/28996#discussion_r453000366 ## File path: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala ## @@ -2030,7 +2030,25 @@ class Dataset[T] private[sql]( * @group typedrel

[GitHub] [spark] SparkQA commented on pull request #29066: [WIP][SPARK-23889] DataSourceV2: required sorting and clustering for writes

2020-07-10 Thread GitBox
SparkQA commented on pull request #29066: URL: https://github.com/apache/spark/pull/29066#issuecomment-656815748 **[Test build #125632 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125632/testReport)** for PR 29066 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29066: [WIP][SPARK-23889] DataSourceV2: required sorting and clustering for writes

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #29066: URL: https://github.com/apache/spark/pull/29066#issuecomment-656822736 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] Moovlin opened a new pull request #29068: [SPARK-27892][ML]Saving/loading stages in PipelineModel are parallel

2020-07-10 Thread GitBox
Moovlin opened a new pull request #29068: URL: https://github.com/apache/spark/pull/29068 This was a dead simple change that I lightly tested to determine if there was actually a performance increase. Turns out, yes there is (at least locally). ### What changes were

[GitHub] [spark] SparkQA commented on pull request #29024: [WIP][SPARK-32001][SQL]Create JDBC authentication provider developer API

2020-07-10 Thread GitBox
SparkQA commented on pull request #29024: URL: https://github.com/apache/spark/pull/29024#issuecomment-656846044 **[Test build #125621 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125621/testReport)** for PR 29024 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29068: [SPARK-27892][ML]Saving/loading stages in PipelineModel are parallel

2020-07-10 Thread GitBox
AmplabJenkins commented on pull request #29068: URL: https://github.com/apache/spark/pull/29068#issuecomment-656845987 Can one of the admins verify this patch? This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28931: [SPARK-32103][YARN] Handle IPv6 host/port split in YarnRMClient

2020-07-10 Thread GitBox
AmplabJenkins removed a comment on pull request #28931: URL: https://github.com/apache/spark/pull/28931#issuecomment-656856757 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] marmbrus commented on a change in pull request #28996: [SPARK-29358][SQL] Make unionByName optionally fill missing columns with nulls

2020-07-10 Thread GitBox
marmbrus commented on a change in pull request #28996: URL: https://github.com/apache/spark/pull/28996#discussion_r453046549 ## File path: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala ## @@ -2030,7 +2030,25 @@ class Dataset[T] private[sql]( * @group typedrel

[GitHub] [spark] SparkQA commented on pull request #29057: [SPARK-32245][INFRA] Run Spark tests in Github Actions

2020-07-10 Thread GitBox
SparkQA commented on pull request #29057: URL: https://github.com/apache/spark/pull/29057#issuecomment-656769009 **[Test build #125605 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125605/testReport)** for PR 29057 at commit

<    2   3   4   5   6   7   8   9   10   11   >