[GitHub] spark pull request: [SPARK-14669] [SQL] Fix some SQL metrics in co...
Github user ericl commented on the pull request: https://github.com/apache/spark/pull/12425#issuecomment-210741272 I slightly prefer to have `dataSize` in the following stage so all the relevant metrics are together, but having it in Exchange seems ok too. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14669] [SQL] Fix some SQL metrics in co...
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12425#discussion_r59963085 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/ShuffleExchange.scala --- @@ -78,8 +82,15 @@ case class ShuffleExchange( * the returned ShuffleDependency will be the input of shuffle. */ private[sql] def prepareShuffleDependency(): ShuffleDependency[Int, InternalRow, InternalRow] = { -ShuffleExchange.prepareShuffleDependency( - child.execute(), child.output, newPartitioning, serializer) +val dataSize = longMetric("dataSize") +val rdd = child.execute().mapPartitionsInternal { iter => + val localDataSize = dataSize.localValue + iter.map { row => +localDataSize.add(row.asInstanceOf[UnsafeRow].getSizeInBytes) --- End diff -- Isn't this iteration over each row a significant added overhead? Seems it would be better to count the data size in bulk instead where the sort is done. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210740750 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210740751 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55990/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210740663 **[Test build #55990 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55990/consoleFull)** for PR 12259 at commit [`5ecfd63`](https://github.com/apache/spark/commit/5ecfd63352b0bf0312cf7990c331ec88eb7fa05e). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14609][SQL] Native support for LOAD DAT...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12412#issuecomment-210738816 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55992/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14609][SQL] Native support for LOAD DAT...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12412#issuecomment-210738814 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14609][SQL] Native support for LOAD DAT...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12412#issuecomment-210738808 **[Test build #55992 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55992/consoleFull)** for PR 12412 at commit [`6a97a9b`](https://github.com/apache/spark/commit/6a97a9baf1f0a67e60e92a17b52b401f98509b7c). * This patch **fails Spark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14669] [SQL] Fix some SQL metrics in co...
Github user davies commented on the pull request: https://github.com/apache/spark/pull/12425#issuecomment-210736839 @ericl Exchange has `dataSize`, should that be enough? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14609][SQL] Native support for LOAD DAT...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12412#issuecomment-210736620 **[Test build #55992 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55992/consoleFull)** for PR 12412 at commit [`6a97a9b`](https://github.com/apache/spark/commit/6a97a9baf1f0a67e60e92a17b52b401f98509b7c). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14670][SQL] Allow updating SQLMetrics o...
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/12427#discussion_r59962714 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SQLExecution.scala --- @@ -52,8 +52,14 @@ private[sql] object SQLExecution { try { body } finally { + val driverAccumUpdates = queryExecution.executedPlan +.collect { case plan: SparkPlan => plan } +.flatMap(_.metrics.values.toSeq) +.map { a => a.toInfo(Some(a.localValue), None) } --- End diff -- Should we filter out the metrics that have not be updated (zero value)? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14609][SQL] Native support for LOAD DAT...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12412#issuecomment-210734125 **[Test build #55991 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55991/consoleFull)** for PR 12412 at commit [`e43055e`](https://github.com/apache/spark/commit/e43055e40d75273ac89ed4b53a6232886c7f3412). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14580][SPARK-14655][SQL] Hive IfCoercio...
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/12340#issuecomment-210729011 Yea I'd say we just support boolean type here for now. If people run into issue with this we can add other things in the future. I'd much rather have a simpler implementation for esoteric features. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210727168 **[Test build #55990 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55990/consoleFull)** for PR 12259 at commit [`5ecfd63`](https://github.com/apache/spark/commit/5ecfd63352b0bf0312cf7990c331ec88eb7fa05e). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14580][SPARK-14655][SQL] Hive IfCoercio...
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/12340#discussion_r59961811 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/misc.scala --- @@ -487,6 +487,43 @@ case class PrintToStderr(child: Expression) extends UnaryExpression { } /** + * A function throws an exception if 'condition' is not true. + */ +@ExpressionDescription( + usage = "_FUNC_(condition) - Throw an exception if 'condition' is not true (false, 0, null, '').") +case class AssertTrue(child: Expression) extends UnaryExpression { + + override def nullable: Boolean = true + + def dataType: DataType = NullType --- End diff -- override? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14647][SQL] Group SQLContext/HiveContex...
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/12405#discussion_r59961810 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveSharedState.scala --- @@ -0,0 +1,52 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one or more + * contributor license agreements. See the NOTICE file distributed with + * this work for additional information regarding copyright ownership. + * The ASF licenses this file to You under the Apache License, Version 2.0 + * (the "License"); you may not use this file except in compliance with + * the License. You may obtain a copy of the License at + * + *http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ + +package org.apache.spark.sql.hive + +import org.apache.spark.SparkContext +import org.apache.spark.sql.hive.client.{HiveClient, HiveClientImpl} +import org.apache.spark.sql.internal.SharedState + + +/** + * A class that holds all state shared across sessions in a given [[HiveContext]]. + */ +private[hive] class HiveSharedState(override val sparkContext: SparkContext) + extends SharedState(sparkContext) { + + // TODO: just share the IsolatedClientLoader instead of the client instances themselves + + /** + * A Hive client used for execution. + */ + val executionHive: HiveClientImpl = { +HiveContext.newClientForExecution(sparkContext.conf, sparkContext.hadoopConfiguration) + } + + /** + * A Hive client used to interact with the metastore. + */ + lazy val metadataHive: HiveClient = { --- End diff -- Let's explain why we need to make it a lazy val at here. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user viirya commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210726962 retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210726369 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210726370 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55988/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14577][SQL] Add spark.sql.codegen.maxCa...
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/12353#issuecomment-210726342 @dongjoon-hyun what's your idea on how to update this? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210726294 **[Test build #55988 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55988/consoleFull)** for PR 12259 at commit [`5ecfd63`](https://github.com/apache/spark/commit/5ecfd63352b0bf0312cf7990c331ec88eb7fa05e). * This patch **fails PySpark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP, DO-NOT-MERGE][SQL][Added support for par...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12409#issuecomment-210726302 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55989/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP, DO-NOT-MERGE][SQL][Added support for par...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12409#issuecomment-210726301 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP, DO-NOT-MERGE][SQL][Added support for par...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12409#issuecomment-210726128 **[Test build #55989 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55989/consoleFull)** for PR 12409 at commit [`d6bca8d`](https://github.com/apache/spark/commit/d6bca8d1c069cb6021b5ef3f3360eac860c72093). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14677][SQL] Make the max number of iter...
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/12434 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [MINOR] Remove inappropriate type notation and...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12413#issuecomment-210725929 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55987/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [MINOR] Remove inappropriate type notation and...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12413#issuecomment-210725927 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [MINOR] Remove inappropriate type notation and...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12413#issuecomment-210725881 **[Test build #55987 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55987/consoleFull)** for PR 12413 at commit [`08de54d`](https://github.com/apache/spark/commit/08de54d0101715e1742f34cbcad66712e1a5793b). * This patch **fails Spark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14677][SQL] Make the max number of iter...
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/12434#issuecomment-210725817 LGTM. Merging to master! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14679] [UI] Fix UI DAG visualization OO...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12437#issuecomment-210725720 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55986/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14679] [UI] Fix UI DAG visualization OO...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12437#issuecomment-210725718 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14679] [UI] Fix UI DAG visualization OO...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12437#issuecomment-210725635 **[Test build #55986 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55986/consoleFull)** for PR 12437 at commit [`e0bda13`](https://github.com/apache/spark/commit/e0bda13cb037649c2b9dd7247b3b90e76ae20cbc). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14676] Wrap and re-throw Await.result e...
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/12433#issuecomment-210725264 Hmm, this is unfortunate. It looks like there are a bunch more assertions like the one in the `FutureAction` test suite which match on the class / name of the exception thrown from methods which call `Await.result`. I wonder whether this change of wrapping the exception is going to cause regressions in user code which tries to do the same thing. Should I try to use reflection to make a best-effort attempt to re-throw using the same exception? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14677][SQL] Make the max number of iter...
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/12434#issuecomment-210722539 cc @yhuai --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: SPARK-14632 randomSplit method fails on datafr...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12438#issuecomment-210722477 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: SPARK-14632 randomSplit method fails on datafr...
GitHub user sbcd90 opened a pull request: https://github.com/apache/spark/pull/12438 SPARK-14632 randomSplit method fails on dataframes with maps in schema ## What changes were proposed in this pull request? The patch fixes the issue with the randomSplit method which is not able to split dataframes which has maps in schema. The bug was introduced in spark 1.6.1. ## How was this patch tested? Tested with unit tests. (If this patch involves UI changes, please attach a screenshot; otherwise, remove this) You can merge this pull request into a Git repository by running: $ git pull https://github.com/sbcd90/spark randomSplitIssue Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/12438.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #12438 commit f8ddfebcabe0c2ce5e828cd0db93d70f8dd59504 Author: Subhobrata DeyDate: 2016-04-16T02:26:22Z SPARK-14632 randomSplit method fails on dataframes with maps in schema --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14676] Wrap and re-throw Await.result e...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12433#issuecomment-210721724 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55985/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14676] Wrap and re-throw Await.result e...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12433#issuecomment-210721722 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14676] Wrap and re-throw Await.result e...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12433#issuecomment-210721684 **[Test build #55985 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55985/consoleFull)** for PR 12433 at commit [`975bbac`](https://github.com/apache/spark/commit/975bbacf06e1d089dd16ef2140dbb71c141a56cd). * This patch **fails Spark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [Spark-13772][SQL] fix data type mismatch for ...
Github user cenyuhai closed the pull request at: https://github.com/apache/spark/pull/11605 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14677][SQL] Make the max number of iter...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12434#issuecomment-210720027 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55984/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14677][SQL] Make the max number of iter...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12434#issuecomment-210720023 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14677][SQL] Make the max number of iter...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12434#issuecomment-210719503 **[Test build #55984 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55984/consoleFull)** for PR 12434 at commit [`a7fbf87`](https://github.com/apache/spark/commit/a7fbf878961deb27d3d7a93e79b5b0b9fb1080a2). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14603] [SQL] [WIP] Verification of Meta...
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/12385#issuecomment-210715151 Thank you! The work requires more and more code changes. Let me split the work into two parts. One is for handling partitioning-related PR; another is for handling the remaining parts. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP, DO-NOT-MERGE][SQL][Added support for par...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12409#issuecomment-210715004 **[Test build #55989 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55989/consoleFull)** for PR 12409 at commit [`d6bca8d`](https://github.com/apache/spark/commit/d6bca8d1c069cb6021b5ef3f3360eac860c72093). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210714759 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55983/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210714755 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP, DO-NOT-MERGE][SQL][Added support for par...
Github user tdas commented on the pull request: https://github.com/apache/spark/pull/12409#issuecomment-210714466 @marmbrus Updated with tests and more docs. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210714382 **[Test build #55983 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55983/consoleFull)** for PR 12259 at commit [`5ecfd63`](https://github.com/apache/spark/commit/5ecfd63352b0bf0312cf7990c331ec88eb7fa05e). * This patch **fails Spark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14647][SQL] Group SQLContext/HiveContex...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12405#issuecomment-210710017 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14647][SQL] Group SQLContext/HiveContex...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12405#issuecomment-210710018 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55982/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14647][SQL] Group SQLContext/HiveContex...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12405#issuecomment-210709916 **[Test build #55982 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55982/consoleFull)** for PR 12405 at commit [`2a84b8c`](https://github.com/apache/spark/commit/2a84b8c6bc5e62bd4eaee03c7623f3095c3b9698). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210709842 **[Test build #55988 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55988/consoleFull)** for PR 12259 at commit [`5ecfd63`](https://github.com/apache/spark/commit/5ecfd63352b0bf0312cf7990c331ec88eb7fa05e). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [MINOR] Remove inappropriate type notation and...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12413#issuecomment-210709838 **[Test build #55987 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55987/consoleFull)** for PR 12413 at commit [`08de54d`](https://github.com/apache/spark/commit/08de54d0101715e1742f34cbcad66712e1a5793b). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user viirya commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210709655 retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [MINOR] Remove inappropriate type notation and...
Github user HyukjinKwon commented on the pull request: https://github.com/apache/spark/pull/12413#issuecomment-210709624 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [MINOR] Remove inappropriate type notation and...
Github user HyukjinKwon commented on the pull request: https://github.com/apache/spark/pull/12413#issuecomment-210709586 Am I doing someyhing wrong? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14679] [UI] Fix UI DAG visualization OO...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12437#issuecomment-210704904 **[Test build #55986 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55986/consoleFull)** for PR 12437 at commit [`e0bda13`](https://github.com/apache/spark/commit/e0bda13cb037649c2b9dd7247b3b90e76ae20cbc). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [MINOR] Remove inappropriate type notation and...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12413#issuecomment-210704288 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [MINOR] Remove inappropriate type notation and...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12413#issuecomment-210704293 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55978/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: SPARK-14679: Fix UI DAG visualization OOM.
GitHub user rdblue opened a pull request: https://github.com/apache/spark/pull/12437 SPARK-14679: Fix UI DAG visualization OOM. ## What changes were proposed in this pull request? The DAG visualization can cause an OOM when generating the DOT file. This happens because clusters are not correctly deduped by a contains check because they use the default equals implementation. This adds a working equals implementation. ## How was this patch tested? This adds a test suite that checks the new equals implementation. You can merge this pull request into a Git repository by running: $ git pull https://github.com/rdblue/spark SPARK-14679-fix-ui-oom Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/12437.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #12437 commit e0bda13cb037649c2b9dd7247b3b90e76ae20cbc Author: Ryan BlueDate: 2016-04-16T01:22:16Z SPARK-14679: Fix UI DAG visualization OOM. The DAG visualization can cause an OOM when generating the DOT file. This happens because clusters are not correctly deduped by a contains check because they use the default equals implementation. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [MINOR] Remove inappropriate type notation and...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12413#issuecomment-210704082 **[Test build #55978 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55978/consoleFull)** for PR 12413 at commit [`08de54d`](https://github.com/apache/spark/commit/08de54d0101715e1742f34cbcad66712e1a5793b). * This patch **fails Spark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14674][SQL] Move HiveContext.hiveconf t...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12431#issuecomment-210703329 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14674][SQL] Move HiveContext.hiveconf t...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12431#issuecomment-210703335 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55980/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14674][SQL] Move HiveContext.hiveconf t...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12431#issuecomment-210703057 **[Test build #55980 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55980/consoleFull)** for PR 12431 at commit [`1436889`](https://github.com/apache/spark/commit/1436889944a29bd431c275251011cd2f7cd2bb74). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-13602][CORE] Add shutdown hook to Drive...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/11746#issuecomment-210702183 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-13602][CORE] Add shutdown hook to Drive...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/11746#issuecomment-210702185 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55974/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-13602][CORE] Add shutdown hook to Drive...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/11746#issuecomment-210702083 **[Test build #55974 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55974/consoleFull)** for PR 11746 at commit [`d8effc2`](https://github.com/apache/spark/commit/d8effc2bb2a1c0bc7bc481b2172e5fcc8a56efdc). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14677][SQL] Make the max number of iter...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12434#issuecomment-210701964 **[Test build #55984 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55984/consoleFull)** for PR 12434 at commit [`a7fbf87`](https://github.com/apache/spark/commit/a7fbf878961deb27d3d7a93e79b5b0b9fb1080a2). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14676] Wrap and re-throw Await.result e...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12433#issuecomment-210701963 **[Test build #55985 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55985/consoleFull)** for PR 12433 at commit [`975bbac`](https://github.com/apache/spark/commit/975bbacf06e1d089dd16ef2140dbb71c141a56cd). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14676] Wrap and re-throw Await.result e...
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/12433#issuecomment-210701761 I've decided to just leave the occurrence in `FutureAction` untouched for now. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP, DO-NOT-MERGE][SQL][Added support for par...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12409#issuecomment-210701177 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55979/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP, DO-NOT-MERGE][SQL][Added support for par...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12409#issuecomment-210701176 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [WIP, DO-NOT-MERGE][SQL][Added support for par...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12409#issuecomment-210700821 **[Test build #55979 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55979/consoleFull)** for PR 12409 at commit [`916ef94`](https://github.com/apache/spark/commit/916ef948961ba0321da987f4c6211dcd5db9e47b). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14676] Wrap and re-throw Await.result e...
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/12433#issuecomment-210700398 Hmm, those are legitimate test failures. The problem was that wrapping something in `Exception` changed the exception matched by some test asserts. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14668] [SQL] Move CurrentDatabase to Ca...
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/12424 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14668] [SQL] Move CurrentDatabase to Ca...
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/12424#issuecomment-210699216 Merging in master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14668] [SQL] Move CurrentDatabase to Ca...
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/12424#issuecomment-210699209 This conflicts with mine - but I can redo mine https://github.com/apache/spark/pull/12434 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14676] Wrap and re-throw Await.result e...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12433#issuecomment-210699178 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14676] Wrap and re-throw Await.result e...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12433#issuecomment-210699179 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55972/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14676] Wrap and re-throw Await.result e...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12433#issuecomment-210699113 **[Test build #55972 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55972/consoleFull)** for PR 12433 at commit [`23e7467`](https://github.com/apache/spark/commit/23e7467de7ba39b16dc92003c842ac1156f8). * This patch **fails Spark unit tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14678][SQL]Add a file sink log to suppo...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12435#issuecomment-210699054 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55977/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14678][SQL]Add a file sink log to suppo...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12435#issuecomment-210699053 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14678][SQL]Add a file sink log to suppo...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12435#issuecomment-210698965 **[Test build #55977 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55977/consoleFull)** for PR 12435 at commit [`29e3088`](https://github.com/apache/spark/commit/29e3088131beff4b36841a3d6544d399b0bfa0c2). * This patch passes all tests. * This patch merges cleanly. * This patch adds the following public classes _(experimental)_: * `case class FileLog(path: String, size: Long, action: String)` * `class FileStreamSinkLog(sqlContext: SQLContext, path: String)` --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14473][SQL] Define analysis rules to ca...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12246#issuecomment-210697628 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55976/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14669] [SQL] Fix some SQL metrics in co...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12425#issuecomment-210697630 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55970/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14669] [SQL] Fix some SQL metrics in co...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12425#issuecomment-210697626 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14473][SQL] Define analysis rules to ca...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12246#issuecomment-210697627 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14647][SQL] Group SQLContext/HiveContex...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12405#issuecomment-210697547 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14669] [SQL] Fix some SQL metrics in co...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12425#issuecomment-210697510 **[Test build #55970 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55970/consoleFull)** for PR 12425 at commit [`696aafe`](https://github.com/apache/spark/commit/696aafee07ace0fb8142295e9954bdcd00e29061). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14473][SQL] Define analysis rules to ca...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12246#issuecomment-210697523 **[Test build #55976 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55976/consoleFull)** for PR 12246 at commit [`8a71835`](https://github.com/apache/spark/commit/8a71835f4011a3570990669346a65dcea51adb4f). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14647][SQL] Group SQLContext/HiveContex...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12405#issuecomment-210697549 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55973/ Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14647][SQL] Group SQLContext/HiveContex...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12405#issuecomment-210697414 **[Test build #55973 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55973/consoleFull)** for PR 12405 at commit [`2a84b8c`](https://github.com/apache/spark/commit/2a84b8c6bc5e62bd4eaee03c7623f3095c3b9698). * This patch passes all tests. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14487][SQL] User Defined Type registrat...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12259#issuecomment-210697123 **[Test build #55983 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55983/consoleFull)** for PR 12259 at commit [`5ecfd63`](https://github.com/apache/spark/commit/5ecfd63352b0bf0312cf7990c331ec88eb7fa05e). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14569][ML] Log instrumentation in KMean...
Github user keypointt commented on a diff in the pull request: https://github.com/apache/spark/pull/12432#discussion_r59957731 --- Diff: mllib/src/main/scala/org/apache/spark/ml/clustering/KMeans.scala --- @@ -264,6 +264,9 @@ class KMeans @Since("1.5.0") ( override def fit(dataset: Dataset[_]): KMeansModel = { val rdd = dataset.select(col($(featuresCol))).rdd.map { case Row(point: Vector) => point } +val instr = Instrumentation.create(this, rdd) +instr.logParams(featuresCol, predictionCol, k, initMode, initSteps, maxIter, seed, tol) + val algo = new MLlibKMeans() --- End diff -- Thanks Timothy. I'm a starter on Spark sorry for being naive. I just want to confirm with you that I understand correctly. 1. for creating a new method `algo.run(rdd, instr)`, I just find I also need to create another method `runAlgorithm(zippedData, instr)` to take `instr` as a parameter https://github.com/apache/spark/blob/master/mllib/src/main/scala/org/apache/spark/mllib/clustering/KMeans.scala#L241 , since inside 'runAlgorithm' is the dimension we want https://github.com/apache/spark/blob/master/mllib/src/main/scala/org/apache/spark/mllib/clustering/KMeans.scala#L295 1. class 'Instrumentation' is private and in ml package, so it cannot be accessed from mllib package. So I have to change it to be public by removing `private[ml] `? https://github.com/apache/spark/blob/master/mllib/src/main/scala/org/apache/spark/ml/util/Instrumentation.scala#L42 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14647][SQL] Group SQLContext/HiveContex...
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/12405#issuecomment-210696867 looks like a legitimate failure? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14672][SQL] Move HiveContext analyze lo...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12429#issuecomment-210696519 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/55981/ Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-12177][Streaming][Kafka] Update KafkaDS...
Github user markgrover commented on the pull request: https://github.com/apache/spark/pull/11863#issuecomment-210696664 Thanks for this, @koeninger! This is looking really good. I have made some comments inline. I noticed that there wasn't a new example, a new KafkaUtils, python support or documentation. So, we should take care of those too. Also, I'd personally prefer a different name than kafka-beta, because the new consumer API will become GA some day and I would rather not have us rename the project then and break everyone. I would personally prefer calling it some thing like kafka-newapi (or something else, it's not that important). We can doc that this subproject is beta quality but as time goes and the subproject and kafka's new consumer API matures, we don't have to rename, just update the docs. And, I would definitely like to help out with the work you have started here. Whether it's implementing some of the comments (if you agree with them), or the left over things that I listed out earlier. Let me know what you think. Thanks again for working on this! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14672][SQL] Move HiveContext analyze lo...
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12429#issuecomment-210696516 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: [SPARK-14672][SQL] Move HiveContext analyze lo...
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12429#issuecomment-210696504 **[Test build #55981 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/55981/consoleFull)** for PR 12429 at commit [`fa80269`](https://github.com/apache/spark/commit/fa80269c1f35c942f0a4604854c01696f1788d2d). * This patch **fails Scala style tests**. * This patch merges cleanly. * This patch adds no public classes. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org