date:20190320

[GitHub] [spark] AmplabJenkins commented on issue #24123: [SPARK-27184][CORE] Replace 'spark.jars' & 'spark.files' with the variables of this config object

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24123: [SPARK-27184][CORE] Replace 
'spark.jars' & 'spark.files' with the variables of this config object
URL: https://github.com/apache/spark/pull/24123#issuecomment-475120643
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103749/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24123: [SPARK-27184][CORE] Replace 'spark.jars' & 'spark.files' with the variables of this config object

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24123: [SPARK-27184][CORE] Replace 
'spark.jars' & 'spark.files' with the variables of this config object
URL: https://github.com/apache/spark/pull/24123#issuecomment-475120642
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA removed a comment on issue #24123: [SPARK-27184][CORE] Replace 'spark.jars' & 'spark.files' with the variables of this config object

2019-03-20 Thread GitBox

SparkQA removed a comment on issue #24123: [SPARK-27184][CORE] Replace 
'spark.jars' & 'spark.files' with the variables of this config object
URL: https://github.com/apache/spark/pull/24123#issuecomment-475087603
 
 
   **[Test build #103749 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103749/testReport)**
 for PR 24123 at commit 
[`7a88f79`](https://github.com/apache/spark/commit/7a88f7982bb388913c5cf93a9f47a2bf1a96e745).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA commented on issue #24123: [SPARK-27184][CORE] Replace 'spark.jars' & 'spark.files' with the variables of this config object

2019-03-20 Thread GitBox

SparkQA commented on issue #24123: [SPARK-27184][CORE] Replace 'spark.jars' & 
'spark.files' with the variables of this config object
URL: https://github.com/apache/spark/pull/24123#issuecomment-475120444
 
 
   **[Test build #103749 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103749/testReport)**
 for PR 24123 at commit 
[`7a88f79`](https://github.com/apache/spark/commit/7a88f7982bb388913c5cf93a9f47a2bf1a96e745).
* This patch **fails Spark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24154: [SPARK-27210][SS] Cleanup incomplete output files in ManifestFileCommitProtocol if task is aborted

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24154: [SPARK-27210][SS] Cleanup incomplete 
output files in ManifestFileCommitProtocol if task is aborted
URL: https://github.com/apache/spark/pull/24154#issuecomment-475119556
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103751/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24154: [SPARK-27210][SS] Cleanup incomplete output files in ManifestFileCommitProtocol if task is aborted

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24154: [SPARK-27210][SS] Cleanup 
incomplete output files in ManifestFileCommitProtocol if task is aborted
URL: https://github.com/apache/spark/pull/24154#issuecomment-475119556
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103751/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24154: [SPARK-27210][SS] Cleanup incomplete output files in ManifestFileCommitProtocol if task is aborted

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24154: [SPARK-27210][SS] Cleanup incomplete 
output files in ManifestFileCommitProtocol if task is aborted
URL: https://github.com/apache/spark/pull/24154#issuecomment-475119555
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24154: [SPARK-27210][SS] Cleanup incomplete output files in ManifestFileCommitProtocol if task is aborted

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24154: [SPARK-27210][SS] Cleanup 
incomplete output files in ManifestFileCommitProtocol if task is aborted
URL: https://github.com/apache/spark/pull/24154#issuecomment-475119555
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA removed a comment on issue #24154: [SPARK-27210][SS] Cleanup incomplete output files in ManifestFileCommitProtocol if task is aborted

2019-03-20 Thread GitBox

SparkQA removed a comment on issue #24154: [SPARK-27210][SS] Cleanup incomplete 
output files in ManifestFileCommitProtocol if task is aborted
URL: https://github.com/apache/spark/pull/24154#issuecomment-475089697
 
 
   **[Test build #103751 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103751/testReport)**
 for PR 24154 at commit 
[`8e12846`](https://github.com/apache/spark/commit/8e1284613244fc4d248b5189a5886d2304ae2457).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA commented on issue #24154: [SPARK-27210][SS] Cleanup incomplete output files in ManifestFileCommitProtocol if task is aborted

2019-03-20 Thread GitBox

SparkQA commented on issue #24154: [SPARK-27210][SS] Cleanup incomplete output 
files in ManifestFileCommitProtocol if task is aborted
URL: https://github.com/apache/spark/pull/24154#issuecomment-475119308
 
 
   **[Test build #103751 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103751/testReport)**
 for PR 24154 at commit 
[`8e12846`](https://github.com/apache/spark/commit/8e1284613244fc4d248b5189a5886d2304ae2457).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24158: [SPARK-26847][SQL] Pruning nested serializers from object serializers: MapType support

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24158: [SPARK-26847][SQL] Pruning 
nested serializers from object serializers: MapType support
URL: https://github.com/apache/spark/pull/24158#issuecomment-475118887
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103746/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24158: [SPARK-26847][SQL] Pruning nested serializers from object serializers: MapType support

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24158: [SPARK-26847][SQL] Pruning nested 
serializers from object serializers: MapType support
URL: https://github.com/apache/spark/pull/24158#issuecomment-475118887
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103746/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24158: [SPARK-26847][SQL] Pruning nested serializers from object serializers: MapType support

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24158: [SPARK-26847][SQL] Pruning 
nested serializers from object serializers: MapType support
URL: https://github.com/apache/spark/pull/24158#issuecomment-475118884
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24158: [SPARK-26847][SQL] Pruning nested serializers from object serializers: MapType support

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24158: [SPARK-26847][SQL] Pruning nested 
serializers from object serializers: MapType support
URL: https://github.com/apache/spark/pull/24158#issuecomment-475118884
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] viirya commented on issue #24158: [SPARK-26847][SQL] Pruning nested serializers from object serializers: MapType support

2019-03-20 Thread GitBox

viirya commented on issue #24158: [SPARK-26847][SQL] Pruning nested serializers 
from object serializers: MapType support
URL: https://github.com/apache/spark/pull/24158#issuecomment-475118671
 
 
   cc @cloud-fan @dongjoon-hyun @dbtsai 


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA removed a comment on issue #24158: [SPARK-26847][SQL] Pruning nested serializers from object serializers: MapType support

2019-03-20 Thread GitBox

SparkQA removed a comment on issue #24158: [SPARK-26847][SQL] Pruning nested 
serializers from object serializers: MapType support
URL: https://github.com/apache/spark/pull/24158#issuecomment-475084764
 
 
   **[Test build #103746 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103746/testReport)**
 for PR 24158 at commit 
[`d388952`](https://github.com/apache/spark/commit/d3889529c306a2e386ee8ab875b3814594cb4a9b).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA commented on issue #24158: [SPARK-26847][SQL] Pruning nested serializers from object serializers: MapType support

2019-03-20 Thread GitBox

SparkQA commented on issue #24158: [SPARK-26847][SQL] Pruning nested 
serializers from object serializers: MapType support
URL: https://github.com/apache/spark/pull/24158#issuecomment-475118448
 
 
   **[Test build #103746 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103746/testReport)**
 for PR 24158 at commit 
[`d388952`](https://github.com/apache/spark/commit/d3889529c306a2e386ee8ab875b3814594cb4a9b).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #23848: [SPARK-26946][SQL] Identifiers for multi-catalog

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #23848: [SPARK-26946][SQL] Identifiers for 
multi-catalog
URL: https://github.com/apache/spark/pull/23848#issuecomment-475117458
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #23848: [SPARK-26946][SQL] Identifiers for multi-catalog

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #23848: [SPARK-26946][SQL] Identifiers 
for multi-catalog
URL: https://github.com/apache/spark/pull/23848#issuecomment-475117458
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #23848: [SPARK-26946][SQL] Identifiers for multi-catalog

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #23848: [SPARK-26946][SQL] Identifiers for 
multi-catalog
URL: https://github.com/apache/spark/pull/23848#issuecomment-475117459
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103750/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #23848: [SPARK-26946][SQL] Identifiers for multi-catalog

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #23848: [SPARK-26946][SQL] Identifiers 
for multi-catalog
URL: https://github.com/apache/spark/pull/23848#issuecomment-475117459
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103750/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA commented on issue #23848: [SPARK-26946][SQL] Identifiers for multi-catalog

2019-03-20 Thread GitBox

SparkQA commented on issue #23848: [SPARK-26946][SQL] Identifiers for 
multi-catalog
URL: https://github.com/apache/spark/pull/23848#issuecomment-475117213
 
 
   **[Test build #103750 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103750/testReport)**
 for PR 23848 at commit 
[`3bb4485`](https://github.com/apache/spark/commit/3bb448503f35e9e035480401261702bcc3114e7a).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA removed a comment on issue #23848: [SPARK-26946][SQL] Identifiers for multi-catalog

2019-03-20 Thread GitBox

SparkQA removed a comment on issue #23848: [SPARK-26946][SQL] Identifiers for 
multi-catalog
URL: https://github.com/apache/spark/pull/23848#issuecomment-475087599
 
 
   **[Test build #103750 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103750/testReport)**
 for PR 23848 at commit 
[`3bb4485`](https://github.com/apache/spark/commit/3bb448503f35e9e035480401261702bcc3114e7a).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] LantaoJin commented on a change in pull request #23951: [SPARK-13704][CORE][YARN] Re-implement RackResolver to reduce resolving time

2019-03-20 Thread GitBox

LantaoJin commented on a change in pull request #23951: 
[SPARK-13704][CORE][YARN] Re-implement RackResolver to reduce resolving time
URL: https://github.com/apache/spark/pull/23951#discussion_r267624353
 
 

 ##
 File path: core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala
 ##
 @@ -192,17 +192,17 @@ private[spark] class TaskSetManager(
 val (_, duration) = Utils.timeTakenMs {
   val hostToIndices = new HashMap[String, ArrayBuffer[Int]]()
   for (i <- (0 until numTasks).reverse) {
-addPendingTask(i, Option(hostToIndices))
+addPendingTask(i, Some(hostToIndices))
   }
-  // Convert preferred locations to racks in one invocation and zip with 
the origin indices.
-  // We de-duping the hosts to reduce this invocation further.
-  
sched.getRacksForHosts(hostToIndices.keySet.toList).zip(hostToIndices.values) 
foreach {
-case (Some(rack), indices) =>
-  pendingTasksForRack.getOrElseUpdate(rack, new ArrayBuffer) ++= 
indices
-case _ =>
+  // Resolve the rack for each host. This can be somehow slow, so de-dupe 
the list of hosts,
 
 Review comment:
   will delete it


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] LantaoJin commented on a change in pull request #23951: [SPARK-13704][CORE][YARN] Re-implement RackResolver to reduce resolving time

2019-03-20 Thread GitBox

LantaoJin commented on a change in pull request #23951: 
[SPARK-13704][CORE][YARN] Re-implement RackResolver to reduce resolving time
URL: https://github.com/apache/spark/pull/23951#discussion_r267624324
 
 

 ##
 File path: 
core/src/main/scala/org/apache/spark/scheduler/TaskSchedulerImpl.scala
 ##
 @@ -375,9 +375,10 @@ private[spark] class TaskSchedulerImpl(
 executorIdToRunningTaskIds(o.executorId) = HashSet[Long]()
 newExecAvail = true
   }
-  for (rack <- getRackForHost(o.host)) {
-hostsByRack.getOrElseUpdate(rack, new HashSet[String]()) += o.host
-  }
+}
+val hosts = offers.map(_.host)
 
 Review comment:
   ok


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] mccheah commented on a change in pull request #24029: [SPARK-27108][SQL] Add parsed SQL plans for create, CTAS.

2019-03-20 Thread GitBox

mccheah commented on a change in pull request #24029: [SPARK-27108][SQL] Add 
parsed SQL plans for create, CTAS.
URL: https://github.com/apache/spark/pull/24029#discussion_r267623814
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/sql/CreateTable.scala
 ##
 @@ -0,0 +1,66 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.catalyst.plans.logical.sql
+
+import org.apache.spark.sql.catalyst.TableIdentifier
+import org.apache.spark.sql.catalyst.catalog.BucketSpec
+import org.apache.spark.sql.catalyst.expressions.Attribute
+import org.apache.spark.sql.catalyst.plans.logical.LogicalPlan
+import org.apache.spark.sql.types.StructType
+
+/**
+ * A CREATE TABLE command, as parsed from SQL.
+ *
+ * This is a metadata-only command and is not used to write data to the 
created table.
+ */
+case class CreateTable(
 
 Review comment:
   Just so I understand correctly, `DDLResolution` returns an instance of 
`org.apache.spark.sql.execution.datasources.CreateTable`, which is still a 
subtype of `LogicalPlan`. It's a little confusing at least to me to have two 
`LogicalPlan` types be named the same thing, but they represent different steps 
of the query planning, and are in different packages.
   
   What if we called this variant of `CreateTable`, say, making this 
`ParsedCreateTable` or perhaps `CreateTableStatement`?
   
   Also, when we start partitioning between V1 sources and V2 sources in the 
Analyzer, are we going to return the same type of `LogicalPlan` back - a 
`org.apache.spark.sql.execution.datasources.CreateTable` regardless of it's 
loading V1 or V2? Or should we instead be naming 
`org.apache.spark.sql.execution.datasources.CreateTableV1`, anticipating the 
Analyzer to return instances of `CreateTableV2` plans down the road?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] LantaoJin edited a comment on issue #24157: [WIP][SPARK-27216][CORE] Kryo serialization with RoaringBitmap

2019-03-20 Thread GitBox

LantaoJin edited a comment on issue #24157: [WIP][SPARK-27216][CORE] Kryo 
serialization with RoaringBitmap
URL: https://github.com/apache/spark/pull/24157#issuecomment-475107799
 
 
   @srowen @squito I've added another UT which is the minimized dataset from 
our product issue.
   In this UT, I roughly comment one line in `ShuffleBlockFetcherIterator` to 
avoid job fail.
   ```scala
   if (buf.size == 0) {
   // throwFetchFailedException(blockId, address, new IOException(msg))
   }
   ```
   After that, the testing `fail zero-size blocks` in 
`ShuffleBlockFetcherIteratorSuite` will fail. This was introduced by #21219. So 
in Spark2.3.x, this UT doesn't need this hard code commenting.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] mccheah commented on a change in pull request #24029: [SPARK-27108][SQL] Add parsed SQL plans for create, CTAS.

2019-03-20 Thread GitBox

mccheah commented on a change in pull request #24029: [SPARK-27108][SQL] Add 
parsed SQL plans for create, CTAS.
URL: https://github.com/apache/spark/pull/24029#discussion_r267621858
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/sql/ParsedLogicalPlan.scala
 ##
 @@ -0,0 +1,38 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.catalyst.plans.logical.sql
+
+import org.apache.spark.sql.catalyst.plans.logical.LogicalPlan
+
+/**
+ * A logical plan node that contains exactly what was parsed from SQL.
+ *
+ * This is used to hold information parsed from SQL when there are multiple 
implementations of a
+ * query or command. For example, CREATE TABLE may be implemented by different 
nodes for v1 and v2.
+ * Instead of parsing directly to a v1 CreateTable that keeps metadata in 
CatalogTable, and then
+ * converting that v1 metadata to the v2 equivalent, the sql [[CreateTable]] 
plan is produced by
+ * the parser and converted once into both implementations.
+ *
+ * Parsed logical plans are not resolved because they must be converted to 
concrete logical plans.
+ *
+ * Parsed logical plans are located in Catalyst so that as much SQL parsing 
logic as possible is be
+ * kept in a [[org.apache.spark.sql.catalyst.parser.AbstractSqlParser]].
+ */
+private[sql] abstract class ParsedLogicalPlan extends LogicalPlan {
 
 Review comment:
   +1 - @rdblue these should only be inputs to the analyzer, not outputs. Would 
be helpful to write specific JavaDoc on this.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] LantaoJin commented on issue #24157: [WIP][SPARK-27216][CORE] Kryo serialization with RoaringBitmap

2019-03-20 Thread GitBox

LantaoJin commented on issue #24157: [WIP][SPARK-27216][CORE] Kryo 
serialization with RoaringBitmap
URL: https://github.com/apache/spark/pull/24157#issuecomment-475107799
 
 
   @srowen @squito I've added another UT which is the minimized dataset from 
our product issue.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24164: [SPARK-27225][SQL] Implement join strategy hints

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24164: [SPARK-27225][SQL] Implement 
join strategy hints
URL: https://github.com/apache/spark/pull/24164#issuecomment-475104056
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24164: [SPARK-27225][SQL] Implement join strategy hints

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24164: [SPARK-27225][SQL] Implement 
join strategy hints
URL: https://github.com/apache/spark/pull/24164#issuecomment-475104060
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/9132/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA commented on issue #24164: [SPARK-27225][SQL] Implement join strategy hints

2019-03-20 Thread GitBox

SparkQA commented on issue #24164: [SPARK-27225][SQL] Implement join strategy 
hints
URL: https://github.com/apache/spark/pull/24164#issuecomment-475104295
 
 
   **[Test build #103754 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103754/testReport)**
 for PR 24164 at commit 
[`1426294`](https://github.com/apache/spark/commit/14262941d80f5f5b0075ee294769af3e81c2d7b6).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] BryanCutler commented on issue #24095: [SPARK-27163][PYTHON] Cleanup and consolidate Pandas UDF functionality

2019-03-20 Thread GitBox

BryanCutler commented on issue #24095: [SPARK-27163][PYTHON] Cleanup and 
consolidate Pandas UDF functionality
URL: https://github.com/apache/spark/pull/24095#issuecomment-475104137
 
 
   Apologies, I moved things around again for item (2) because I didn't really 
like having an option in `ArrowStreamPandasSerializer` to send the 
`START_ARROW_STREAM` either.
   
   Now, I have `_create_batch(...)` as a method in 
`ArrowStreamPandasSerializer` (where it belongs I think), and then have a 
subclass used for Pandas UDFs that overrides `dump_stream` that can send 
`START_ARROW_STREAM`.
   
   I think it's clearer this way because it's easier to see what serializer is 
used where, and I also tried to improve the docs. Let me know what you think 
when you get the chance to take another look @HyukjinKwon @ueshin . Thanks!


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24164: [SPARK-27225][SQL] Implement join strategy hints

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24164: [SPARK-27225][SQL] Implement join 
strategy hints
URL: https://github.com/apache/spark/pull/24164#issuecomment-475104060
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/9132/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24164: [SPARK-27225][SQL] Implement join strategy hints

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24164: [SPARK-27225][SQL] Implement join 
strategy hints
URL: https://github.com/apache/spark/pull/24164#issuecomment-475104056
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] HyukjinKwon commented on issue #24153: [MINOR][EXAMPLES] Add missing return keyword streaming word count example

2019-03-20 Thread GitBox

HyukjinKwon commented on issue #24153: [MINOR][EXAMPLES] Add missing return 
keyword streaming word count example
URL: https://github.com/apache/spark/pull/24153#issuecomment-475104007
 
 
   Yea, the change looked good. What I wanted to make sure was to keep the PR 
description format to make PR well-formed as guided in 
https://spark.apache.org/contributing.html


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] maryannxue opened a new pull request #24164: [SPARK-27225][SQL] Implement join strategy hints

2019-03-20 Thread GitBox

maryannxue opened a new pull request #24164: [SPARK-27225][SQL] Implement join 
strategy hints
URL: https://github.com/apache/spark/pull/24164
 
 
   ## What changes were proposed in this pull request?
   
   This PR extends the existing BROADCAST join hint (for both broadcast-hash 
join and broadcast-nested-loop join) by implementing other join strategy hints 
corresponding to the rest of Spark's existing join strategies: shuffle-hash, 
sort-merge, cartesian-product. The hint names: SHUFFLE_MERGE, SHUFFLE_HASH, 
SHUFFLE_REPLICATE_NL are partly different from the code names in order to make 
them clearer to users and reflect the actual algorithms better.
   
   The hinted strategy will be used for the join with which it is associated if 
it is applicable/doable.
   
   Conflict resolving rules in case of multiple hints:
   1. Conflicts within either side of the join: take the first strategy hint 
specified in the query, or the top hint node in Dataset. For example, in 
"select /*+ merge(t1) */ /*+ broadcast(t1) */ k1, v2 from t1 join t2 on t1.k1 = 
t2.k2", take "merge(t1)"; in 
```df1.hint("merge").hint("shuffle_hash").join(df2)```, take "shuffle_hash". 
This is a general hint conflict resolving strategy, not specific to join 
strategy hint. 
   2. Conflicts between two sides of the join:
 a) In case of different strategy hints, hints are prioritized as 
```BROADCAST``` over ```SHUFFLE_MERGE``` over ```SHUFFLE_HASH``` over 
```SHUFFLE_REPLICATE_NL```.
 b) In case of same strategy hints but conflicts in build side, choose the 
build side based on join type and size.
   
   ## How was this patch tested?
   
   Added new UTs.
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #24047: [SPARK-25196][SQL] Extends the analyze column command for cached tables

2019-03-20 Thread GitBox

dongjoon-hyun commented on a change in pull request #24047: [SPARK-25196][SQL] 
Extends the analyze column command for cached tables 
URL: https://github.com/apache/spark/pull/24047#discussion_r267616522
 
 

 ##
 File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeColumnCommand.scala
 ##
 @@ -39,32 +40,39 @@ case class AnalyzeColumnCommand(
 require(columnNames.isDefined ^ allColumns, "Parameter `columnNames` or 
`allColumns` are " +
   "mutually exclusive. Only one of them should be specified.")
 val sessionState = sparkSession.sessionState
-val db = 
tableIdent.database.getOrElse(sessionState.catalog.getCurrentDatabase)
-val tableIdentWithDB = TableIdentifier(tableIdent.table, Some(db))
-val tableMeta = sessionState.catalog.getTableMetadata(tableIdentWithDB)
-if (tableMeta.tableType == CatalogTableType.VIEW) {
-  throw new AnalysisException("ANALYZE TABLE is not supported on views.")
-}
-val sizeInBytes = CommandUtils.calculateTotalSize(sparkSession, tableMeta)
-val relation = sparkSession.table(tableIdent).logicalPlan
-val columnsToAnalyze = getColumnsToAnalyze(tableIdent, relation, 
columnNames, allColumns)
-
-// Compute stats for the computed list of columns.
-val (rowCount, newColStats) =
-  CommandUtils.computeColumnStats(sparkSession, relation, columnsToAnalyze)
 
-// We also update table-level stats in order to keep them consistent with 
column-level stats.
-val statistics = CatalogStatistics(
-  sizeInBytes = sizeInBytes,
-  rowCount = Some(rowCount),
-  // Newly computed column stats should override the existing ones.
-  colStats = tableMeta.stats.map(_.colStats).getOrElse(Map.empty) ++ 
newColStats)
-
-sessionState.catalog.alterTableStats(tableIdentWithDB, Some(statistics))
+tableIdent.database match {
+  case Some(db) if db == 
sparkSession.sharedState.globalTempViewManager.database =>
+val plan = 
sessionState.catalog.getGlobalTempView(tableIdent.identifier).getOrElse {
+  throw new NoSuchTableException(db = db, table = 
tableIdent.identifier)
+}
+analyzeColumnInTempView(plan, sparkSession)
+  case Some(_) =>
+analyzeColumnInCatalog(sparkSession)
+  case None =>
+sessionState.catalog.getTempView(tableIdent.identifier) match {
+  case Some(tempView) => analyzeColumnInTempView(tempView, 
sparkSession)
+  case _ => analyzeColumnInCatalog(sparkSession)
+}
+}
 
 Seq.empty[Row]
   }
 
+  private def analyzeColumnInTempView(plan: LogicalPlan, sparkSession: 
SparkSession): Unit = {
+val cacheManager = sparkSession.sharedState.cacheManager
+cacheManager.lookupCachedData(plan) match {
+  case Some(cachedData) =>
+val columnsToAnalyze = getColumnsToAnalyze(
+  tableIdent, cachedData.plan, columnNames, allColumns)
+cacheManager.analyzeColumnCacheQuery(sparkSession, cachedData, 
columnsToAnalyze)
+  case _ =>
+val catalog = sparkSession.sessionState.catalog
+val db = tableIdent.database.getOrElse(catalog.getCurrentDatabase)
+throw new NoSuchTableException(db = db, table = tableIdent.identifier)
 
 Review comment:
   Got it~


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] dongjoon-hyun edited a comment on issue #23803: [SPARK-26894][SQL] Handle Alias as well in AggregateEstimation to propagate child stats

2019-03-20 Thread GitBox

dongjoon-hyun edited a comment on issue #23803: [SPARK-26894][SQL] Handle Alias 
as well in AggregateEstimation to propagate child stats
URL: https://github.com/apache/spark/pull/23803#issuecomment-475101858
 
 
   Sure, @maropu . For the first contributor, you need to add him into `Apache 
Spark Contributor` group.
   
   1. Visit Project Configuration Role Page
   - 
https://issues.apache.org/jira/plugins/servlet/project-config/SPARK/roles
   
   2. Click `Add users to a role` button (at the right-top of the page).
   
   3. Fill the the pop-up box.
   - The JIRA user ID or email
   - Choose `Contributor`.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] dongjoon-hyun edited a comment on issue #23803: [SPARK-26894][SQL] Handle Alias as well in AggregateEstimation to propagate child stats

2019-03-20 Thread GitBox

dongjoon-hyun edited a comment on issue #23803: [SPARK-26894][SQL] Handle Alias 
as well in AggregateEstimation to propagate child stats
URL: https://github.com/apache/spark/pull/23803#issuecomment-475101858
 
 
   Sure, @maropu . For the first contributor, you need to add him into `Apache 
Spark Contributor` group.
   
   1. Visit Project Configuration Role Page
   - 
https://issues.apache.org/jira/plugins/servlet/project-config/SPARK/roles
   
   2. Click `Add users to a role` button (right-top of the page).
   
   3. Fill the the pop-up box.
   - The JIRA user ID or email
   - Choose `Contributor`.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] dongjoon-hyun commented on issue #23803: [SPARK-26894][SQL] Handle Alias as well in AggregateEstimation to propagate child stats

2019-03-20 Thread GitBox

dongjoon-hyun commented on issue #23803: [SPARK-26894][SQL] Handle Alias as 
well in AggregateEstimation to propagate child stats
URL: https://github.com/apache/spark/pull/23803#issuecomment-475101858
 
 
   Sure, @maropu . For the first contributor, you need to him into `Apache 
Spark Contributor` group.
   
   1. Visit Project Configuration Role Page
   - 
https://issues.apache.org/jira/plugins/servlet/project-config/SPARK/roles
   
   2. Click `Add users to a role` button.
   
   3. Fill the the pop-up box.
   - The JIRA user ID or email
   - Choose `Contributor`.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] dongjoon-hyun edited a comment on issue #23803: [SPARK-26894][SQL] Handle Alias as well in AggregateEstimation to propagate child stats

2019-03-20 Thread GitBox

dongjoon-hyun edited a comment on issue #23803: [SPARK-26894][SQL] Handle Alias 
as well in AggregateEstimation to propagate child stats
URL: https://github.com/apache/spark/pull/23803#issuecomment-475101858
 
 
   Sure, @maropu . For the first contributor, you need to add him into `Apache 
Spark Contributor` group.
   
   1. Visit Project Configuration Role Page
   - 
https://issues.apache.org/jira/plugins/servlet/project-config/SPARK/roles
   
   2. Click `Add users to a role` button.
   
   3. Fill the the pop-up box.
   - The JIRA user ID or email
   - Choose `Contributor`.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] beliefer edited a comment on issue #23841: [SPARK-26936][SQL] Fix bug of insert overwrite local dir can not create temporary path in local staging directory

2019-03-20 Thread GitBox

beliefer edited a comment on issue #23841: [SPARK-26936][SQL] Fix bug of insert 
overwrite local dir can not create temporary path in local staging directory
URL: https://github.com/apache/spark/pull/23841#issuecomment-475097724
 
 
   > Why did you remove the tests?
   
   Sorry, I can't find the reason why the feature 'insert overwrite local 
directory' has an inconsistent behavior in different environment. I have moved 
the conversation of create no exists local path with `insert overwrite local 
dir` to PR https://github.com/apache/spark/pull/23950. The current PR only used 
for `Mkdirs failed to create file` in yarn-client mode.
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] LantaoJin commented on a change in pull request #24157: [WIP][SPARK-27216][CORE] Kryo serialization with RoaringBitmap

2019-03-20 Thread GitBox

LantaoJin commented on a change in pull request #24157: 
[WIP][SPARK-27216][CORE] Kryo serialization with RoaringBitmap
URL: https://github.com/apache/spark/pull/24157#discussion_r267614390
 
 

 ##
 File path: 
core/src/test/scala/org/apache/spark/serializer/KryoSerializerSuite.scala
 ##
 @@ -565,6 +565,20 @@ class KryoSerializerAutoResetDisabledSuite extends 
SparkFunSuite with SharedSpar
 assert(serInstance.deserialize[Any](serObj) === (obj))
 assert(serInstance.deserialize[Any](byteBuffer) === (obj))
   }
+
+  test("SPARK-27216: kryo serialization with RoaringBitmap") {
+val bitmap = new RoaringBitmap
+bitmap.add(1787)
+
+val safeSer = new KryoSerializer(conf).newInstance()
+val bitmap2 : RoaringBitmap = 
safeSer.deserialize(safeSer.serialize(bitmap))
+assert(bitmap2.equals(bitmap))
+
+conf.set("spark.kryo.unsafe", "true")
 
 Review comment:
   It can be move to a totally new Suite. I will update it.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] beliefer commented on issue #23841: [SPARK-26936][SQL] Fix bug of insert overwrite local dir can not create temporary path in local staging directory

2019-03-20 Thread GitBox

beliefer commented on issue #23841: [SPARK-26936][SQL] Fix bug of insert 
overwrite local dir can not create temporary path in local staging directory
URL: https://github.com/apache/spark/pull/23841#issuecomment-475097724
 
 
   > Why did you remove the tests?
   Sorry, I can't find the reason why the feature 'insert overwrite local 
directory' has an inconsistent behavior in different environment. I have moved 
the conversation of create no exists local path with `insert overwrite local 
dir` to PR https://github.com/apache/spark/pull/23950. The current PR only used 
for `Mkdirs failed to create file` in yarn-client mode.
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA commented on issue #24161: [SPARK-27222][SQL] Support Instant and LocalDate in Literal.apply

2019-03-20 Thread GitBox

SparkQA commented on issue #24161: [SPARK-27222][SQL] Support Instant and 
LocalDate in Literal.apply
URL: https://github.com/apache/spark/pull/24161#issuecomment-475097032
 
 
   **[Test build #103753 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103753/testReport)**
 for PR 24161 at commit 
[`c51cd91`](https://github.com/apache/spark/commit/c51cd91551e4c547eebc148d486e83f7f86ac58c).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24161: [SPARK-27222][SQL] Support Instant and LocalDate in Literal.apply

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24161: [SPARK-27222][SQL] Support 
Instant and LocalDate in Literal.apply
URL: https://github.com/apache/spark/pull/24161#issuecomment-475096866
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/9131/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24161: [SPARK-27222][SQL] Support Instant and LocalDate in Literal.apply

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24161: [SPARK-27222][SQL] Support 
Instant and LocalDate in Literal.apply
URL: https://github.com/apache/spark/pull/24161#issuecomment-475096863
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24161: [SPARK-27222][SQL] Support Instant and LocalDate in Literal.apply

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24161: [SPARK-27222][SQL] Support Instant and 
LocalDate in Literal.apply
URL: https://github.com/apache/spark/pull/24161#issuecomment-475096866
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/9131/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24161: [SPARK-27222][SQL] Support Instant and LocalDate in Literal.apply

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24161: [SPARK-27222][SQL] Support Instant and 
LocalDate in Literal.apply
URL: https://github.com/apache/spark/pull/24161#issuecomment-475096863
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24160: [SPARK-27219][core] Treat timeouts as fatal in SASL fallback path.

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24160: [SPARK-27219][core] Treat 
timeouts as fatal in SASL fallback path.
URL: https://github.com/apache/spark/pull/24160#issuecomment-475096644
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103740/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24160: [SPARK-27219][core] Treat timeouts as fatal in SASL fallback path.

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24160: [SPARK-27219][core] Treat timeouts as 
fatal in SASL fallback path.
URL: https://github.com/apache/spark/pull/24160#issuecomment-475096644
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103740/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24160: [SPARK-27219][core] Treat timeouts as fatal in SASL fallback path.

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24160: [SPARK-27219][core] Treat timeouts as 
fatal in SASL fallback path.
URL: https://github.com/apache/spark/pull/24160#issuecomment-475096635
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24160: [SPARK-27219][core] Treat timeouts as fatal in SASL fallback path.

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24160: [SPARK-27219][core] Treat 
timeouts as fatal in SASL fallback path.
URL: https://github.com/apache/spark/pull/24160#issuecomment-475096635
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA removed a comment on issue #24160: [SPARK-27219][core] Treat timeouts as fatal in SASL fallback path.

2019-03-20 Thread GitBox

SparkQA removed a comment on issue #24160: [SPARK-27219][core] Treat timeouts 
as fatal in SASL fallback path.
URL: https://github.com/apache/spark/pull/24160#issuecomment-475045854
 
 
   **[Test build #103740 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103740/testReport)**
 for PR 24160 at commit 
[`102e08b`](https://github.com/apache/spark/commit/102e08b620caa8582c140d5efe1744859f4c33e6).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA commented on issue #24160: [SPARK-27219][core] Treat timeouts as fatal in SASL fallback path.

2019-03-20 Thread GitBox

SparkQA commented on issue #24160: [SPARK-27219][core] Treat timeouts as fatal 
in SASL fallback path.
URL: https://github.com/apache/spark/pull/24160#issuecomment-475096294
 
 
   **[Test build #103740 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103740/testReport)**
 for PR 24160 at commit 
[`102e08b`](https://github.com/apache/spark/commit/102e08b620caa8582c140d5efe1744859f4c33e6).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #23912: [SPARK-21029][SS] StreamingQuery should be stopped when the SparkSession is stopped

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #23912: [SPARK-21029][SS] 
StreamingQuery should be stopped when the SparkSession is stopped
URL: https://github.com/apache/spark/pull/23912#issuecomment-475096017
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103744/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #23912: [SPARK-21029][SS] StreamingQuery should be stopped when the SparkSession is stopped

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #23912: [SPARK-21029][SS] 
StreamingQuery should be stopped when the SparkSession is stopped
URL: https://github.com/apache/spark/pull/23912#issuecomment-475096012
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #23912: [SPARK-21029][SS] StreamingQuery should be stopped when the SparkSession is stopped

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #23912: [SPARK-21029][SS] StreamingQuery 
should be stopped when the SparkSession is stopped
URL: https://github.com/apache/spark/pull/23912#issuecomment-475096017
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103744/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #23912: [SPARK-21029][SS] StreamingQuery should be stopped when the SparkSession is stopped

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #23912: [SPARK-21029][SS] StreamingQuery 
should be stopped when the SparkSession is stopped
URL: https://github.com/apache/spark/pull/23912#issuecomment-475096012
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA removed a comment on issue #23912: [SPARK-21029][SS] StreamingQuery should be stopped when the SparkSession is stopped

2019-03-20 Thread GitBox

SparkQA removed a comment on issue #23912: [SPARK-21029][SS] StreamingQuery 
should be stopped when the SparkSession is stopped
URL: https://github.com/apache/spark/pull/23912#issuecomment-475045932
 
 
   **[Test build #103744 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103744/testReport)**
 for PR 23912 at commit 
[`60c3d24`](https://github.com/apache/spark/commit/60c3d24dce6c96432007e6f312136864f8c481b2).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA commented on issue #23912: [SPARK-21029][SS] StreamingQuery should be stopped when the SparkSession is stopped

2019-03-20 Thread GitBox

SparkQA commented on issue #23912: [SPARK-21029][SS] StreamingQuery should be 
stopped when the SparkSession is stopped
URL: https://github.com/apache/spark/pull/23912#issuecomment-475095765
 
 
   **[Test build #103744 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103744/testReport)**
 for PR 23912 at commit 
[`60c3d24`](https://github.com/apache/spark/commit/60c3d24dce6c96432007e6f312136864f8c481b2).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24150: [SPARK-27209][SQL] Split parsing of SELECT and INSERT into two top-level rules in the grammar file.

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24150: [SPARK-27209][SQL] Split 
parsing of SELECT and INSERT into two top-level rules in the grammar file.
URL: https://github.com/apache/spark/pull/24150#issuecomment-475095602
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103742/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24150: [SPARK-27209][SQL] Split parsing of SELECT and INSERT into two top-level rules in the grammar file.

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24150: [SPARK-27209][SQL] Split 
parsing of SELECT and INSERT into two top-level rules in the grammar file.
URL: https://github.com/apache/spark/pull/24150#issuecomment-475095600
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24150: [SPARK-27209][SQL] Split parsing of SELECT and INSERT into two top-level rules in the grammar file.

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24150: [SPARK-27209][SQL] Split parsing of 
SELECT and INSERT into two top-level rules in the grammar file.
URL: https://github.com/apache/spark/pull/24150#issuecomment-475095600
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24150: [SPARK-27209][SQL] Split parsing of SELECT and INSERT into two top-level rules in the grammar file.

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24150: [SPARK-27209][SQL] Split parsing of 
SELECT and INSERT into two top-level rules in the grammar file.
URL: https://github.com/apache/spark/pull/24150#issuecomment-475095602
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103742/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA removed a comment on issue #24150: [SPARK-27209][SQL] Split parsing of SELECT and INSERT into two top-level rules in the grammar file.

2019-03-20 Thread GitBox

SparkQA removed a comment on issue #24150: [SPARK-27209][SQL] Split parsing of 
SELECT and INSERT into two top-level rules in the grammar file.
URL: https://github.com/apache/spark/pull/24150#issuecomment-475045862
 
 
   **[Test build #103742 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103742/testReport)**
 for PR 24150 at commit 
[`e651e14`](https://github.com/apache/spark/commit/e651e148fec330c0aaef1005128f8b497286f570).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA commented on issue #24150: [SPARK-27209][SQL] Split parsing of SELECT and INSERT into two top-level rules in the grammar file.

2019-03-20 Thread GitBox

SparkQA commented on issue #24150: [SPARK-27209][SQL] Split parsing of SELECT 
and INSERT into two top-level rules in the grammar file.
URL: https://github.com/apache/spark/pull/24150#issuecomment-475095265
 
 
   **[Test build #103742 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103742/testReport)**
 for PR 24150 at commit 
[`e651e14`](https://github.com/apache/spark/commit/e651e148fec330c0aaef1005128f8b497286f570).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] maropu commented on issue #23803: [SPARK-26894][SQL] Handle Alias as well in AggregateEstimation to propagate child stats

2019-03-20 Thread GitBox

maropu commented on issue #23803: [SPARK-26894][SQL] Handle Alias as well in 
AggregateEstimation to propagate child stats
URL: https://github.com/apache/spark/pull/23803#issuecomment-475094662
 
 
   @dongjoon-hyun I couldn't assign @venkata91 in the jira and I'm not sure how 
to add a new account in contributor lists there... any info?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] jiangruocheng edited a comment on issue #24153: [MINOR][EXAMPLES] Add missing return keyword streaming word count example

2019-03-20 Thread GitBox

jiangruocheng edited a comment on issue #24153: [MINOR][EXAMPLES] Add missing 
return keyword streaming word count example
URL: https://github.com/apache/spark/pull/24153#issuecomment-475094300
 
 
   > Please keep the PR description format. How did you test?
   I got it. 
   I just ran this example code and found the filter couldn't work.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] jiangruocheng commented on issue #24153: [MINOR][EXAMPLES] Add missing return keyword streaming word count example

2019-03-20 Thread GitBox

jiangruocheng commented on issue #24153: [MINOR][EXAMPLES] Add missing return 
keyword streaming word count example
URL: https://github.com/apache/spark/pull/24153#issuecomment-475094300
 
 
   > Please keep the PR description format. How did you test?
   I got it. 
   I just runned this example code and found the filter couldn't work.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] maropu commented on issue #23803: [SPARK-26894][SQL] Handle Alias as well in AggregateEstimation to propagate child stats

2019-03-20 Thread GitBox

maropu commented on issue #23803: [SPARK-26894][SQL] Handle Alias as well in 
AggregateEstimation to propagate child stats
URL: https://github.com/apache/spark/pull/23803#issuecomment-475094090
 
 
   Thanks! Merged to master.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] srowen commented on issue #24127: [SPARK-27186][CORE] optimize sortshuffle write

2019-03-20 Thread GitBox

srowen commented on issue #24127: [SPARK-27186][CORE] optimize sortshuffle write
URL: https://github.com/apache/spark/pull/24127#issuecomment-475094168
 
 
   I see, it's not a special Iterator implementation. Is it possible to 
refactor this code block, which looks like appears elsewhere, into any 
reasonable common function?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] maropu closed pull request #23803: [SPARK-26894][SQL] Handle Alias as well in AggregateEstimation to propagate child stats

2019-03-20 Thread GitBox

maropu closed pull request #23803: [SPARK-26894][SQL] Handle Alias as well in 
AggregateEstimation to propagate child stats
URL: https://github.com/apache/spark/pull/23803
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA commented on issue #24138: [MINOR][SQL]Put the grammar of database together, because this is good for maintenance and readability.

2019-03-20 Thread GitBox

SparkQA commented on issue #24138: [MINOR][SQL]Put the grammar of database 
together, because this is good for maintenance and readability.
URL: https://github.com/apache/spark/pull/24138#issuecomment-475093686
 
 
   **[Test build #4649 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/4649/testReport)**
 for PR 24138 at commit 
[`1a784a5`](https://github.com/apache/spark/commit/1a784a52a4a23d8be0664cd1c5ecfc7b3f6a0520).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24161: [SPARK-27222][SQL] Support Instant and LocalDate in Literal.apply

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24161: [SPARK-27222][SQL] Support 
Instant and LocalDate in Literal.apply
URL: https://github.com/apache/spark/pull/24161#issuecomment-475093302
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103745/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24161: [SPARK-27222][SQL] Support Instant and LocalDate in Literal.apply

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24161: [SPARK-27222][SQL] Support 
Instant and LocalDate in Literal.apply
URL: https://github.com/apache/spark/pull/24161#issuecomment-475093300
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24161: [SPARK-27222][SQL] Support Instant and LocalDate in Literal.apply

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24161: [SPARK-27222][SQL] Support Instant and 
LocalDate in Literal.apply
URL: https://github.com/apache/spark/pull/24161#issuecomment-475093302
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103745/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] HyukjinKwon closed pull request #24159: [SPARK-27221][SQL]Improve the assert error message in TreeNode.parseToJson

2019-03-20 Thread GitBox

HyukjinKwon closed pull request #24159: [SPARK-27221][SQL]Improve the assert 
error message in TreeNode.parseToJson
URL: https://github.com/apache/spark/pull/24159
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA commented on issue #24161: [SPARK-27222][SQL] Support Instant and LocalDate in Literal.apply

2019-03-20 Thread GitBox

SparkQA commented on issue #24161: [SPARK-27222][SQL] Support Instant and 
LocalDate in Literal.apply
URL: https://github.com/apache/spark/pull/24161#issuecomment-475093198
 
 
   **[Test build #103745 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103745/testReport)**
 for PR 24161 at commit 
[`80b2e30`](https://github.com/apache/spark/commit/80b2e309dbfcf796a09f7b08d5e68862014fa266).
* This patch **fails Spark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24161: [SPARK-27222][SQL] Support Instant and LocalDate in Literal.apply

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24161: [SPARK-27222][SQL] Support 
Instant and LocalDate in Literal.apply
URL: https://github.com/apache/spark/pull/24161#issuecomment-475074846
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103739/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] maropu commented on issue #23841: [SPARK-26936][SQL] Fix bug of insert overwrite local dir can not create temporary path in local staging directory

2019-03-20 Thread GitBox

maropu commented on issue #23841: [SPARK-26936][SQL] Fix bug of insert 
overwrite local dir can not create temporary path in local staging directory
URL: https://github.com/apache/spark/pull/23841#issuecomment-475093193
 
 
   Why did you remove the tests?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24161: [SPARK-27222][SQL] Support Instant and LocalDate in Literal.apply

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24161: [SPARK-27222][SQL] Support Instant and 
LocalDate in Literal.apply
URL: https://github.com/apache/spark/pull/24161#issuecomment-475093300
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA removed a comment on issue #24161: [SPARK-27222][SQL] Support Instant and LocalDate in Literal.apply

2019-03-20 Thread GitBox

SparkQA removed a comment on issue #24161: [SPARK-27222][SQL] Support Instant 
and LocalDate in Literal.apply
URL: https://github.com/apache/spark/pull/24161#issuecomment-475074766
 
 
   **[Test build #103745 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103745/testReport)**
 for PR 24161 at commit 
[`80b2e30`](https://github.com/apache/spark/commit/80b2e309dbfcf796a09f7b08d5e68862014fa266).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] HyukjinKwon closed pull request #24162: [SPARK-27223][SQL] Remove private methods that skip conversion when passing user schemas for constructing a DataFrame

2019-03-20 Thread GitBox

HyukjinKwon closed pull request #24162: [SPARK-27223][SQL] Remove private 
methods that skip conversion when passing user schemas for constructing a 
DataFrame
URL: https://github.com/apache/spark/pull/24162
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] HyukjinKwon commented on issue #24159: [SPARK-27221][SQL]Improve the assert error message in TreeNode.parseToJson

2019-03-20 Thread GitBox

HyukjinKwon commented on issue #24159: [SPARK-27221][SQL]Improve the assert 
error message in TreeNode.parseToJson
URL: https://github.com/apache/spark/pull/24159#issuecomment-475092946
 
 
   Merged to master.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24159: [SPARK-27221][SQL]Improve the assert error message in TreeNode.parseToJson

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24159: [SPARK-27221][SQL]Improve the assert 
error message in TreeNode.parseToJson
URL: https://github.com/apache/spark/pull/24159#issuecomment-475092826
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103741/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24159: [SPARK-27221][SQL]Improve the assert error message in TreeNode.parseToJson

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24159: [SPARK-27221][SQL]Improve the 
assert error message in TreeNode.parseToJson
URL: https://github.com/apache/spark/pull/24159#issuecomment-475092826
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/103741/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24159: [SPARK-27221][SQL]Improve the assert error message in TreeNode.parseToJson

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24159: [SPARK-27221][SQL]Improve the 
assert error message in TreeNode.parseToJson
URL: https://github.com/apache/spark/pull/24159#issuecomment-475092823
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24159: [SPARK-27221][SQL]Improve the assert error message in TreeNode.parseToJson

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24159: [SPARK-27221][SQL]Improve the assert 
error message in TreeNode.parseToJson
URL: https://github.com/apache/spark/pull/24159#issuecomment-475092823
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] maropu commented on issue #24047: [SPARK-25196][SQL] Extends the analyze column command for cached tables

2019-03-20 Thread GitBox

maropu commented on issue #24047: [SPARK-25196][SQL] Extends the analyze column 
command for cached tables 
URL: https://github.com/apache/spark/pull/24047#issuecomment-475092746
 
 
   Thanks for your active reviews, @dongjoon-hyun !


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] HyukjinKwon commented on issue #24162: [SPARK-27223][SQL] Remove private methods that skip conversion when passing user schemas for constructing a DataFrame

2019-03-20 Thread GitBox

HyukjinKwon commented on issue #24162: [SPARK-27223][SQL] Remove private 
methods that skip conversion when passing user schemas for constructing a 
DataFrame
URL: https://github.com/apache/spark/pull/24162#issuecomment-475092658
 
 
   Merged to master.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA removed a comment on issue #24159: [SPARK-27221][SQL]Improve the assert error message in TreeNode.parseToJson

2019-03-20 Thread GitBox

SparkQA removed a comment on issue #24159: [SPARK-27221][SQL]Improve the assert 
error message in TreeNode.parseToJson
URL: https://github.com/apache/spark/pull/24159#issuecomment-475045867
 
 
   **[Test build #103741 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103741/testReport)**
 for PR 24159 at commit 
[`ed3eea4`](https://github.com/apache/spark/commit/ed3eea4672b809da91904d2ffa4f48da5981dbe4).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] maropu commented on a change in pull request #24047: [SPARK-25196][SQL] Extends the analyze column command for cached tables

2019-03-20 Thread GitBox

maropu commented on a change in pull request #24047: [SPARK-25196][SQL] Extends 
the analyze column command for cached tables 
URL: https://github.com/apache/spark/pull/24047#discussion_r267608769
 
 

 ##
 File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeColumnCommand.scala
 ##
 @@ -39,32 +40,39 @@ case class AnalyzeColumnCommand(
 require(columnNames.isDefined ^ allColumns, "Parameter `columnNames` or 
`allColumns` are " +
   "mutually exclusive. Only one of them should be specified.")
 val sessionState = sparkSession.sessionState
-val db = 
tableIdent.database.getOrElse(sessionState.catalog.getCurrentDatabase)
-val tableIdentWithDB = TableIdentifier(tableIdent.table, Some(db))
-val tableMeta = sessionState.catalog.getTableMetadata(tableIdentWithDB)
-if (tableMeta.tableType == CatalogTableType.VIEW) {
-  throw new AnalysisException("ANALYZE TABLE is not supported on views.")
-}
-val sizeInBytes = CommandUtils.calculateTotalSize(sparkSession, tableMeta)
-val relation = sparkSession.table(tableIdent).logicalPlan
-val columnsToAnalyze = getColumnsToAnalyze(tableIdent, relation, 
columnNames, allColumns)
-
-// Compute stats for the computed list of columns.
-val (rowCount, newColStats) =
-  CommandUtils.computeColumnStats(sparkSession, relation, columnsToAnalyze)
 
-// We also update table-level stats in order to keep them consistent with 
column-level stats.
-val statistics = CatalogStatistics(
-  sizeInBytes = sizeInBytes,
-  rowCount = Some(rowCount),
-  // Newly computed column stats should override the existing ones.
-  colStats = tableMeta.stats.map(_.colStats).getOrElse(Map.empty) ++ 
newColStats)
-
-sessionState.catalog.alterTableStats(tableIdentWithDB, Some(statistics))
+tableIdent.database match {
+  case Some(db) if db == 
sparkSession.sharedState.globalTempViewManager.database =>
+val plan = 
sessionState.catalog.getGlobalTempView(tableIdent.identifier).getOrElse {
+  throw new NoSuchTableException(db = db, table = 
tableIdent.identifier)
+}
+analyzeColumnInTempView(plan, sparkSession)
+  case Some(_) =>
+analyzeColumnInCatalog(sparkSession)
+  case None =>
+sessionState.catalog.getTempView(tableIdent.identifier) match {
+  case Some(tempView) => analyzeColumnInTempView(tempView, 
sparkSession)
+  case _ => analyzeColumnInCatalog(sparkSession)
+}
+}
 
 Seq.empty[Row]
   }
 
+  private def analyzeColumnInTempView(plan: LogicalPlan, sparkSession: 
SparkSession): Unit = {
+val cacheManager = sparkSession.sharedState.cacheManager
+cacheManager.lookupCachedData(plan) match {
+  case Some(cachedData) =>
+val columnsToAnalyze = getColumnsToAnalyze(
+  tableIdent, cachedData.plan, columnNames, allColumns)
+cacheManager.analyzeColumnCacheQuery(sparkSession, cachedData, 
columnsToAnalyze)
+  case _ =>
+val catalog = sparkSession.sessionState.catalog
+val db = tableIdent.database.getOrElse(catalog.getCurrentDatabase)
+throw new NoSuchTableException(db = db, table = tableIdent.identifier)
 
 Review comment:
   Since the existing tests expect this exception, I kept as it is:
   
https://github.com/apache/spark/blob/d6ee2f331db461c1f7a25e0ef17901f53d8b707e/sql/core/src/test/scala/org/apache/spark/sql/execution/SQLViewSuite.scala#L163
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA commented on issue #24159: [SPARK-27221][SQL]Improve the assert error message in TreeNode.parseToJson

2019-03-20 Thread GitBox

SparkQA commented on issue #24159: [SPARK-27221][SQL]Improve the assert error 
message in TreeNode.parseToJson
URL: https://github.com/apache/spark/pull/24159#issuecomment-475092527
 
 
   **[Test build #103741 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103741/testReport)**
 for PR 24159 at commit 
[`ed3eea4`](https://github.com/apache/spark/commit/ed3eea4672b809da91904d2ffa4f48da5981dbe4).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] wangjiaochun commented on issue #24127: [SPARK-27186][CORE] optimize sortshuffle write

2019-03-20 Thread GitBox

wangjiaochun commented on issue #24127: [SPARK-27186][CORE] optimize 
sortshuffle write
URL: https://github.com/apache/spark/pull/24127#issuecomment-475091856
 
 
   The underlying Iterator do nothing.
   so I think it's worth revising. first, ExternalSorter-instantiated is 
expensive operation, such as there is need to 
   estimate the size of all the collection of the Instances. Also create tmp 
file is needless.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] SparkQA commented on issue #24047: [SPARK-25196][SQL] Extends the analyze column command for cached tables

2019-03-20 Thread GitBox

SparkQA commented on issue #24047: [SPARK-25196][SQL] Extends the analyze 
column command for cached tables 
URL: https://github.com/apache/spark/pull/24047#issuecomment-475091728
 
 
   **[Test build #103752 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/103752/testReport)**
 for PR 24047 at commit 
[`c6f0fb5`](https://github.com/apache/spark/commit/c6f0fb5e7d0a9b5ede73ec058894c65ee238).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins removed a comment on issue #24047: [SPARK-25196][SQL] Extends the analyze column command for cached tables

2019-03-20 Thread GitBox

AmplabJenkins removed a comment on issue #24047: [SPARK-25196][SQL] Extends the 
analyze column command for cached tables 
URL: https://github.com/apache/spark/pull/24047#issuecomment-475091455
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/9130/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] AmplabJenkins commented on issue #24047: [SPARK-25196][SQL] Extends the analyze column command for cached tables

2019-03-20 Thread GitBox

AmplabJenkins commented on issue #24047: [SPARK-25196][SQL] Extends the analyze 
column command for cached tables 
URL: https://github.com/apache/spark/pull/24047#issuecomment-475091453
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] beliefer edited a comment on issue #24138: [MINOR][SQL]Put the grammar of database together, because this is good for maintenance and readability.

2019-03-20 Thread GitBox

beliefer edited a comment on issue #24138: [MINOR][SQL]Put the grammar of 
database together, because this is good for maintenance and readability.
URL: https://github.com/apache/spark/pull/24138#issuecomment-475090967
 
 
   @srowen The test build not found. How to retrigger jenkins to build?
   The url 
'https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/4637/testReport'
 is 404.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

1 2 3 4 5 6 7 8 9 >

1 - 100 of 845 matches

Mail list logo