date:20150813

[GitHub] spark pull request: [SPARK-9882][Core] Priority-based scheduling f...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8129#issuecomment-130551156
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9882][Core] Priority-based scheduling f...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8129#issuecomment-130551177
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9882][Core] Priority-based scheduling f...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8129#issuecomment-130551460
  
  [Test build #40745 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40745/consoleFull)
 for   PR 8129 at commit 
[`9298fa0`](https://github.com/apache/spark/commit/9298fa0577dae7018c4d2aaa58301ce5e340251a).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9877][Core] Fix StandaloneRestServer NP...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8127#issuecomment-130551430
  
  [Test build #40744 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40744/consoleFull)
 for   PR 8127 at commit 
[`fdb6158`](https://github.com/apache/spark/commit/fdb6158f6439dce80bdbd01ef2e483265ca5eb84).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9740] [SPARK-9592] [SPARK-9210] [SQL] C...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8113#issuecomment-130551263
  
  [Test build #1542 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1542/console)
 for   PR 8113 at commit 
[`f828bdf`](https://github.com/apache/spark/commit/f828bdf1612a5fc9466b9a7e80700d0dd94faaf5).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `   * Set thresholds in multiclass (or binary) classification to adjust 
the probability of`
  * `case class First(child: Expression, ignoreNullsExpr: Expression) 
extends AlgebraicAggregate `
  * `case class Last(child: Expression, ignoreNullsExpr: Expression) 
extends AlgebraicAggregate `
  * `case class First(`
  * `case class FirstFunction(`
  * `case class Last(`
  * `case class LastFunction(`



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9885] [SQL] Also pass barrierPrefixes a...

2015-08-13 Thread liancheng

Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/8158#issuecomment-130560469
  
Verified under my local MySQL backed Hive 0.13.1 metastore and it works. 
Merging to master and branch-1.5.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9661] [MLLib] [ML] Java compatibility

2015-08-13 Thread MechCoder

Github user MechCoder commented on the pull request:

https://github.com/apache/spark/pull/8126#issuecomment-130566232
  
@jkbradley I've addressed your comments. I'll have another pass at the 
generated docs, to see if there are other issues as well.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9661] [MLLib] [ML] Java compatibility

2015-08-13 Thread MechCoder

Github user MechCoder commented on a diff in the pull request:

https://github.com/apache/spark/pull/8126#discussion_r36947726
  
--- Diff: 
mllib/src/test/java/org/apache/spark/mllib/stat/JavaStatisticsSuite.java ---
@@ -53,4 +54,12 @@ public void testCorr() {
 // Check default method
 assertEquals(corr1, corr2);
   }
+
+  @Test
+  public void kolmogorovSmirnovTest() {
--- End diff --

done


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9580] [SQL] Replace singletons in SQL t...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8111#issuecomment-130566054
  
  [Test build #1539 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1539/console)
 for   PR 8111 at commit 
[`828144f`](https://github.com/apache/spark/commit/828144f96e4454824887c1d01fada20ce3510610).
 * This patch **fails PySpark unit tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `  implicit class StringToColumn(val sc: StringContext) `



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8125] [SQL] Backports PR #7396 to branc...

2015-08-13 Thread liancheng

Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/7664#issuecomment-130568125
  
@oliviertoupin Usually we only backports fixes of severe bugs to 
maintaining branches.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9757] [SQL] Fixes persistence of Parque...

2015-08-13 Thread liancheng

Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/8130#issuecomment-130570571
  
@yhuai @marmbrus Thanks for the review and helping fixing it! I'm merging 
this to master and branch-1.5.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread liancheng

Github user liancheng commented on a diff in the pull request:

https://github.com/apache/spark/pull/8132#discussion_r36949497
  
--- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningUtils.scala
 ---
@@ -270,6 +270,8 @@ private[sql] object PartitioningUtils {
   private val upCastingOrder: Seq[DataType] =
 Seq(NullType, IntegerType, LongType, FloatType, DoubleType, StringType)
 
+  val validPartitionColumnTypes: Set[DataType] = upCastingOrder.toSet
--- End diff --

BTW, I think you have to make `validPartitionColumnTypes` a method rather 
than a `Set[DataType]` since `DecimalType` is not a singleton.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9767] Remove ConnectionManager.

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8161#issuecomment-130572392
  
  [Test build #40748 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40748/consoleFull)
 for   PR 8161 at commit 
[`bdf1d5e`](https://github.com/apache/spark/commit/bdf1d5e357fe423d356405acd0844991c652f150).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread yjshen

Github user yjshen commented on a diff in the pull request:

https://github.com/apache/spark/pull/8132#discussion_r36950167
  
--- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/ResolvedDataSource.scala
 ---
@@ -179,6 +179,13 @@ object ResolvedDataSource extends Logging {
   val fs = 
path.getFileSystem(sqlContext.sparkContext.hadoopConfiguration)
   path.makeQualified(fs.getUri, fs.getWorkingDirectory)
 }
+
+partitionColumnsSchema(data.schema, partitionColumns).foreach { 
field =
+  if 
(!PartitioningUtils.validPartitionColumnTypes.contains(field.dataType)) {
+throw new AnalysisException(sCannot use ${field.dataType} for 
partition column)
+  }
+}
--- End diff --

ok, I'll make this a function in PartitioningUtils to throw an 
analysisException.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9767] Remove ConnectionManager.

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8161#issuecomment-130575195
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9767] Remove ConnectionManager.

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8161#issuecomment-130575160
  
  [Test build #40748 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40748/console)
 for   PR 8161 at commit 
[`bdf1d5e`](https://github.com/apache/spark/commit/bdf1d5e357fe423d356405acd0844991c652f150).
 * This patch **fails MiMa tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9818][SQL][WIP]Revert SPARK-6136 to ena...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8101#issuecomment-130586425
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread yjshen

Github user yjshen commented on the pull request:

https://github.com/apache/spark/pull/8132#issuecomment-130586521
  
Jenkins, retest this please.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9818][SQL][WIP]Revert SPARK-6136 to ena...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8101#issuecomment-130586441
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8813][SQL] Combine files when there're ...

2015-08-13 Thread liancheng

Github user liancheng commented on a diff in the pull request:

https://github.com/apache/spark/pull/8125#discussion_r36946906
  
--- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/sources/CombineSmallFile.scala ---
@@ -0,0 +1,43 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the License); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an AS IS BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.sources
+
+import org.apache.hadoop.fs.{FileStatus, FileSystem, Path}
+import org.apache.spark.rdd.RDD
+import org.apache.spark.sql.SQLContext
+
+object CombineSmallFile {
+  def combineWithFiles[T](rdd: RDD[T], sqlContext: SQLContext, inputFiles: 
Array[FileStatus])
+  : RDD[T] = {
+if (sqlContext.conf.combineSmallFile) {
+  val totalLen = inputFiles.map { file =
+if (file.isDir) 0L else file.getLen
+  }.sum
+  val numPartitions = (totalLen / sqlContext.conf.splitSize + 1).toInt
+  rdd.coalesce(numPartitions)
--- End diff --

What if Hadoop block size is configured larger (as many users do)?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9661] [MLLib] [ML] Java compatibility

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8126#issuecomment-130566469
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9661] [MLLib] [ML] Java compatibility

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8126#issuecomment-130566485
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9580] [SQL] Replace singletons in SQL t...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8111#issuecomment-130567185
  
  [Test build #1543 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1543/console)
 for   PR 8111 at commit 
[`c4d44c9`](https://github.com/apache/spark/commit/c4d44c9139eff45048d849311c539dadda54004c).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `trait Identifiable `
  * `class VectorUDT extends UserDefinedType[Vector] `
  * `  implicit class StringToColumn(val sc: StringContext) `



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9580] [SQL] Replace singletons in SQL t...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8111#issuecomment-130567175
  
  [Test build #40737 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40737/console)
 for   PR 8111 at commit 
[`828144f`](https://github.com/apache/spark/commit/828144f96e4454824887c1d01fada20ce3510610).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `trait Identifiable `
  * `class VectorUDT extends UserDefinedType[Vector] `
  * `  implicit class StringToColumn(val sc: StringContext) `



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9580] [SQL] Replace singletons in SQL t...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8111#issuecomment-130567224
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9661] [MLLib] [ML] Java compatibility

2015-08-13 Thread MechCoder

Github user MechCoder commented on the pull request:

https://github.com/apache/spark/pull/8126#issuecomment-130570175
  
OK. I gave the diff another pass. That seems to be it.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread liancheng

Github user liancheng commented on a diff in the pull request:

https://github.com/apache/spark/pull/8132#discussion_r36949417
  
--- Diff: 
sql/hive/src/test/scala/org/apache/spark/sql/sources/hadoopFsRelationSuites.scala
 ---
@@ -554,6 +556,21 @@ abstract class HadoopFsRelationTest extends QueryTest 
with SQLTestUtils {
   clonedConf.foreach(entry = configuration.set(entry.getKey, 
entry.getValue))
 }
   }
+
+  test(SPARK-8887: Explicitly define which data types can be used as 
dynamic partition columns) {
+val df = Seq(
+  (1, v1, Date.valueOf(2015-08-10)),
+  (2, v2, Date.valueOf(2015-08-11)),
+  (3, v3, Date.valueOf(2015-08-12))).toDF(a, b, c)
+withTempDir { file =
+  intercept[AnalysisException] {
+
df.write.format(dataSourceName).partitionBy(c).save(file.getCanonicalPath)
+  }
+}
+intercept[AnalysisException] {
+  df.write.format(dataSourceName).partitionBy(c).saveAsTable(t)
+}
--- End diff --

Please wrap this block with `withTable(t) { ... }` so that `t` gets 
dropped.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9934] Deprecate NIO ConnectionManager.

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8162#issuecomment-130574356
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9934] Deprecate NIO ConnectionManager.

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8162#issuecomment-130574344
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8132#issuecomment-130586461
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8531] [ML] Update ML user guide for Min...

2015-08-13 Thread hhbyyh

Github user hhbyyh commented on the pull request:

https://github.com/apache/spark/pull/7211#issuecomment-130551690
  
@jkbradley Thanks for the review. 
I'm not sure if the latex part looks good.
And for python document, since python interface for MinMaxScaler is still 
under review, so I didn't add the example.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9882][Core] Priority-based scheduling f...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8129#issuecomment-130551698
  
  [Test build #40745 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40745/console)
 for   PR 8129 at commit 
[`9298fa0`](https://github.com/apache/spark/commit/9298fa0577dae7018c4d2aaa58301ce5e340251a).
 * This patch **fails Scala style tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `case class ApplicationSubmission(val appInfo: ApplicationInfo, val 
submittedTime: Date)`
  * `  class Pool(val poolName: String, val priority: Int, val cores: Int) `



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9882][Core] Priority-based scheduling f...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8129#issuecomment-130551699
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9182][SQL]Filters are not passed throug...

2015-08-13 Thread marmbrus

Github user marmbrus commented on a diff in the pull request:

https://github.com/apache/spark/pull/8049#discussion_r36947567
  
--- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala
 ---
@@ -343,31 +343,57 @@ private[sql] object DataSourceStrategy extends 
Strategy with Logging {
* and convert them.
*/
   protected[sql] def selectFilters(filters: Seq[Expression]) = {
+import CatalystTypeConverters._
+
 def translate(predicate: Expression): Option[Filter] = predicate match 
{
   case expressions.EqualTo(a: Attribute, Literal(v, _)) =
 Some(sources.EqualTo(a.name, v))
   case expressions.EqualTo(Literal(v, _), a: Attribute) =
 Some(sources.EqualTo(a.name, v))
+  case expressions.EqualTo(Cast(a: Attribute, _), l: Literal) =
+Some(sources.EqualTo(a.name, convertToScala(Cast(l, 
a.dataType).eval(), a.dataType)))
+  case expressions.EqualTo(l: Literal, Cast(a: Attribute, _)) =
+Some(sources.EqualTo(a.name, convertToScala(Cast(l, 
a.dataType).eval(), a.dataType)))
--- End diff --

No, given the possibly trickiness here I think we should bump the fix to 
1.6.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8813][SQL] Combine files when there're ...

2015-08-13 Thread liancheng

Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/8125#issuecomment-130566869
  
@watermen The use case you mentioned totally makes sense. However, I think 
usually people choose to compact fine grained files into much larger and fewer 
files as time goes by. A more reasonable solution might be:

1. Saving the most recent hot data (say 1 hr) every 5 min in simple file 
formats like CSV or JSON.

   These files tend to be pretty small, and I'd assume that using complex 
columnar formats like ORC and Parquet generally don't give you much performance 
benefits on the read path, but you still suffer from their costs like larger 
memory footprints and lower speed on the write path (it's more related to the 
width of the table rather than the number of rows.)

2. Compacting outdated data periodically (say every a few hours) into much 
larger and fewer chunks of data files in analytics friendly formats like ORC 
and Parquet

   In this way you avoid reading a large number of small files and enjoy 
the performance benefits brought by columnar formats.

3. Exposing the whole dataset by making two (or more) DataFrames out of 
these two parts of data and union them

Of course, the above comment is more like a design issue of the upper 
application. For this PR, the biggest problem I see is that, it makes a not 
recommended special use case as default case and introduces performance 
regression for other (more commonly seen) use cases.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9868] [SQL] [WIP] reproduce failure

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8160#issuecomment-130566969
  
  [Test build #1548 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1548/console)
 for   PR 8160 at commit 
[`044d0c1`](https://github.com/apache/spark/commit/044d0c1ac723c77d84d349a57991ac86fb87de59).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `trait Identifiable `
  * `class VectorUDT extends UserDefinedType[Vector] `



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread liancheng

Github user liancheng commented on a diff in the pull request:

https://github.com/apache/spark/pull/8132#discussion_r36948745
  
--- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/PartitioningUtils.scala
 ---
@@ -270,6 +270,8 @@ private[sql] object PartitioningUtils {
   private val upCastingOrder: Seq[DataType] =
 Seq(NullType, IntegerType, LongType, FloatType, DoubleType, StringType)
 
+  val validPartitionColumnTypes: Set[DataType] = upCastingOrder.toSet
--- End diff --

I think all data types inherit from `AtomicType` should be valid here.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread liancheng

Github user liancheng commented on a diff in the pull request:

https://github.com/apache/spark/pull/8132#discussion_r36949251
  
--- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/ResolvedDataSource.scala
 ---
@@ -179,6 +179,13 @@ object ResolvedDataSource extends Logging {
   val fs = 
path.getFileSystem(sqlContext.sparkContext.hadoopConfiguration)
   path.makeQualified(fs.getUri, fs.getWorkingDirectory)
 }
+
+partitionColumnsSchema(data.schema, partitionColumns).foreach { 
field =
+  if 
(!PartitioningUtils.validPartitionColumnTypes.contains(field.dataType)) {
+throw new AnalysisException(sCannot use ${field.dataType} for 
partition column)
+  }
+}
--- End diff --

Actually twice, the 3rd place is still quite similar though :)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread liancheng

Github user liancheng commented on a diff in the pull request:

https://github.com/apache/spark/pull/8132#discussion_r36949177
  
--- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/ResolvedDataSource.scala
 ---
@@ -179,6 +179,13 @@ object ResolvedDataSource extends Logging {
   val fs = 
path.getFileSystem(sqlContext.sparkContext.hadoopConfiguration)
   path.makeQualified(fs.getUri, fs.getWorkingDirectory)
 }
+
+partitionColumnsSchema(data.schema, partitionColumns).foreach { 
field =
+  if 
(!PartitioningUtils.validPartitionColumnTypes.contains(field.dataType)) {
+throw new AnalysisException(sCannot use ${field.dataType} for 
partition column)
+  }
+}
--- End diff --

Can we make this snippet a separate method? It's duplicated 3 times in this 
PR.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread yjshen

Github user yjshen commented on a diff in the pull request:

https://github.com/apache/spark/pull/8132#discussion_r36950083
  
--- Diff: 
sql/hive/src/test/scala/org/apache/spark/sql/sources/hadoopFsRelationSuites.scala
 ---
@@ -554,6 +556,21 @@ abstract class HadoopFsRelationTest extends QueryTest 
with SQLTestUtils {
   clonedConf.foreach(entry = configuration.set(entry.getKey, 
entry.getValue))
 }
   }
+
+  test(SPARK-8887: Explicitly define which data types can be used as 
dynamic partition columns) {
+val df = Seq(
+  (1, v1, Date.valueOf(2015-08-10)),
+  (2, v2, Date.valueOf(2015-08-11)),
+  (3, v3, Date.valueOf(2015-08-12))).toDF(a, b, c)
+withTempDir { file =
+  intercept[AnalysisException] {
+
df.write.format(dataSourceName).partitionBy(c).save(file.getCanonicalPath)
+  }
+}
+intercept[AnalysisException] {
+  df.write.format(dataSourceName).partitionBy(c).saveAsTable(t)
+}
+  }
--- End diff --

OK. I get this.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9767] Remove ConnectionManager.

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8161#issuecomment-130571993
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread yjshen

Github user yjshen commented on a diff in the pull request:

https://github.com/apache/spark/pull/8132#discussion_r36950026
  
--- Diff: 
sql/hive/src/test/scala/org/apache/spark/sql/sources/hadoopFsRelationSuites.scala
 ---
@@ -554,6 +556,21 @@ abstract class HadoopFsRelationTest extends QueryTest 
with SQLTestUtils {
   clonedConf.foreach(entry = configuration.set(entry.getKey, 
entry.getValue))
 }
   }
+
+  test(SPARK-8887: Explicitly define which data types can be used as 
dynamic partition columns) {
+val df = Seq(
+  (1, v1, Date.valueOf(2015-08-10)),
+  (2, v2, Date.valueOf(2015-08-11)),
+  (3, v3, Date.valueOf(2015-08-12))).toDF(a, b, c)
+withTempDir { file =
+  intercept[AnalysisException] {
+
df.write.format(dataSourceName).partitionBy(c).save(file.getCanonicalPath)
+  }
+}
+intercept[AnalysisException] {
+  df.write.format(dataSourceName).partitionBy(c).saveAsTable(t)
+}
--- End diff --

I didn't wrap this into withTable(t) because the saveAsTable will fail 
here, and if wrapped in withTable, 
`org.apache.spark.sql.catalyst.analysis.NoSuchTableException was thrown.`

```
def getTable(dbName: String, tableName: String): HiveTable = {
getTableOption(dbName, tableName).getOrElse(throw new 
NoSuchTableException)
  }
```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9767] Remove ConnectionManager.

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8161#issuecomment-130571975
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9757] [SQL] Fixes persistence of Parque...

2015-08-13 Thread asfgit

Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/8130


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9580] [SQL] Replace singletons in SQL t...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8111#issuecomment-130573888
  
**[Test build #1541 timed 
out](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1541/console)**
 for PR 8111 at commit 
[`828144f`](https://github.com/apache/spark/commit/828144f96e4454824887c1d01fada20ce3510610)
 after a configured wait of `175m`.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: Testing Jenkins do not merge.

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8139#issuecomment-130574634
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9934] Deprecate NIO ConnectionManager.

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8162#issuecomment-130574769
  
  [Test build #40749 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40749/consoleFull)
 for   PR 8162 at commit 
[`4fb536d`](https://github.com/apache/spark/commit/4fb536dcd38b21d5844611c03e436edff996006f).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8132#issuecomment-130585654
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9818][SQL][WIP]Revert SPARK-6136 to ena...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8101#issuecomment-130586781
  
  [Test build #40751 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40751/consoleFull)
 for   PR 8101 at commit 
[`aed4162`](https://github.com/apache/spark/commit/aed41621a103ca83b134070c0a11b3c1ed5d6922).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8531] [ML] Update ML user guide for Min...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/7211#issuecomment-130550285
  
  [Test build #40741 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40741/console)
 for   PR 7211 at commit 
[`b6ac0fc`](https://github.com/apache/spark/commit/b6ac0fc6eae4a03e3498891e9ee4ebfde418af8f).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9882][Core] Priority-based scheduling f...

2015-08-13 Thread viirya

Github user viirya commented on the pull request:

https://github.com/apache/spark/pull/8129#issuecomment-130550969
  
retest this please.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9885] [SQL] Also pass barrierPrefixes a...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8158#issuecomment-130553153
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9882][Core] Priority-based scheduling f...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8129#issuecomment-130553847
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9882][Core] Priority-based scheduling f...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8129#issuecomment-130553715
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9885] [SQL] Also pass barrierPrefixes a...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8158#issuecomment-130552920
  
  [Test build #40728 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40728/console)
 for   PR 8158 at commit 
[`2a134d5`](https://github.com/apache/spark/commit/2a134d548a64cdf3fc5299878262e98580d7eaa4).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `trait Identifiable `
  * `class VectorUDT extends UserDefinedType[Vector] `



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9757] [SQL] Fixes persistence of Parque...

2015-08-13 Thread liancheng

Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/8130#issuecomment-130558263
  
Verified under my local MySQL backed Hive 0.13.1 metastore and it works. 
Merging to master and branch-1.5.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9885] [SQL] Also pass barrierPrefixes a...

2015-08-13 Thread asfgit

Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/8158


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: SPARK-8949 - Print warnings when using preferr...

2015-08-13 Thread rxin

Github user rxin commented on a diff in the pull request:

https://github.com/apache/spark/pull/7874#discussion_r36947100
  
--- Diff: core/src/main/scala/org/apache/spark/SparkContext.scala ---
@@ -118,9 +118,11 @@ class SparkContext(config: SparkConf) extends Logging 
with ExecutorAllocationCli
* Can be generated using 
[[org.apache.spark.scheduler.InputFormatInfo.computePreferredLocations]]
* from a list of input files or InputFormats for the application.
*/
+  @Deprecated(Passing in preferred locations has no effect at all, see 
SPARK-8949)
--- End diff --

this should be
```scala
@deprecated((Passing in preferred locations has no effect at all, see 
SPARK-8949, 1.5.0)
```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread liancheng

Github user liancheng commented on a diff in the pull request:

https://github.com/apache/spark/pull/8132#discussion_r36949342
  
--- Diff: 
sql/hive/src/test/scala/org/apache/spark/sql/sources/hadoopFsRelationSuites.scala
 ---
@@ -554,6 +556,21 @@ abstract class HadoopFsRelationTest extends QueryTest 
with SQLTestUtils {
   clonedConf.foreach(entry = configuration.set(entry.getKey, 
entry.getValue))
 }
   }
+
+  test(SPARK-8887: Explicitly define which data types can be used as 
dynamic partition columns) {
+val df = Seq(
+  (1, v1, Date.valueOf(2015-08-10)),
+  (2, v2, Date.valueOf(2015-08-11)),
+  (3, v3, Date.valueOf(2015-08-12))).toDF(a, b, c)
+withTempDir { file =
+  intercept[AnalysisException] {
+
df.write.format(dataSourceName).partitionBy(c).save(file.getCanonicalPath)
+  }
+}
+intercept[AnalysisException] {
+  df.write.format(dataSourceName).partitionBy(c).saveAsTable(t)
+}
+  }
--- End diff --

We need to update this test case after putting `DateType` into the set of 
valid types of partition columns.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9767] Remove ConnectionManager.

2015-08-13 Thread rxin

GitHub user rxin opened a pull request:

https://github.com/apache/spark/pull/8161

[SPARK-9767] Remove ConnectionManager.

We introduced the Netty network module for shuffle in Spark 1.2, and has 
turned it on by default for 3 releases. The old ConnectionManager is difficult 
to maintain. It's time to remove it for Spark 1.6.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/rxin/spark SPARK-9767

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/8161.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #8161


commit bdf1d5e357fe423d356405acd0844991c652f150
Author: Reynold Xin r...@databricks.com
Date:   2015-08-13T08:12:43Z

[SPARK-9767] Remove ConnectionManager.

We introduced the Netty network module for shuffle in Spark 1.2, and has 
turned it on by default for 3 releases. The old ConnectionManager is difficult 
to maintain. It's time to remove it for Spark 1.6.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9580] [SQL] Replace singletons in SQL t...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8111#issuecomment-130571180
  
  [Test build #1545 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1545/console)
 for   PR 8111 at commit 
[`c4d44c9`](https://github.com/apache/spark/commit/c4d44c9139eff45048d849311c539dadda54004c).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `trait Identifiable `
  * `case class QRDecomposition[QType, RType](Q: QType, R: RType)`
  * `class VectorUDT extends UserDefinedType[Vector] `
  * `  implicit class StringToColumn(val sc: StringContext) `



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9868] [SQL] [WIP] reproduce failure

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8160#issuecomment-130572627
  
  [Test build #1549 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1549/console)
 for   PR 8160 at commit 
[`044d0c1`](https://github.com/apache/spark/commit/044d0c1ac723c77d84d349a57991ac86fb87de59).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `trait Identifiable `
  * `class VectorUDT extends UserDefinedType[Vector] `



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread liancheng

Github user liancheng commented on a diff in the pull request:

https://github.com/apache/spark/pull/8132#discussion_r36951210
  
--- Diff: 
sql/hive/src/test/scala/org/apache/spark/sql/sources/hadoopFsRelationSuites.scala
 ---
@@ -554,6 +556,21 @@ abstract class HadoopFsRelationTest extends QueryTest 
with SQLTestUtils {
   clonedConf.foreach(entry = configuration.set(entry.getKey, 
entry.getValue))
 }
   }
+
+  test(SPARK-8887: Explicitly define which data types can be used as 
dynamic partition columns) {
+val df = Seq(
+  (1, v1, Date.valueOf(2015-08-10)),
+  (2, v2, Date.valueOf(2015-08-11)),
+  (3, v3, Date.valueOf(2015-08-12))).toDF(a, b, c)
+withTempDir { file =
+  intercept[AnalysisException] {
+
df.write.format(dataSourceName).partitionBy(c).save(file.getCanonicalPath)
+  }
+}
+intercept[AnalysisException] {
+  df.write.format(dataSourceName).partitionBy(c).saveAsTable(t)
+}
--- End diff --

Oh I see, makes sense.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: Testing Jenkins do not merge.

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8139#issuecomment-130574566
  
  [Test build #40740 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40740/console)
 for   PR 8139 at commit 
[`e4e2254`](https://github.com/apache/spark/commit/e4e225457b8eac1640f5e06974047e3aabc83642).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9580] [SQL] Replace singletons in SQL t...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8111#issuecomment-130579622
  
**[Test build #40739 timed 
out](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40739/console)**
 for PR 8111 at commit 
[`c4d44c9`](https://github.com/apache/spark/commit/c4d44c9139eff45048d849311c539dadda54004c)
 after a configured wait of `175m`.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9720] [ML] Identifiable types need UID ...

2015-08-13 Thread BertrandDechoux

Github user BertrandDechoux commented on the pull request:

https://github.com/apache/spark/pull/8062#issuecomment-130579425
  
Like I said, I didn't run the test. There seems to be no clear easy way. I 
will have to invest time to find out.

Is there a way to see the result of amplab jenkins for this pull request?
https://amplab.cs.berkeley.edu/jenkins/

@mengxr I will take into account your 2 points. They are indeed both 
relevant.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9580] [SQL] Replace singletons in SQL t...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8111#issuecomment-130579959
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8132#issuecomment-130585700
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9818][SQL][WIP]Revert SPARK-6136 to ena...

2015-08-13 Thread yjshen

Github user yjshen commented on the pull request:

https://github.com/apache/spark/pull/8101#issuecomment-130586263
  
Jenkins, retest this please.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9925] [SQL] [TESTS] Set SQLConf.SHUFFLE...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8155#issuecomment-130587485
  
**[Test build #40743 timed 
out](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40743/console)**
 for PR 8155 at commit 
[`d09b603`](https://github.com/apache/spark/commit/d09b6031c42391dacc1d0d4bad44050b5885b1f5)
 after a configured wait of `175m`.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9877][Core] Fix StandaloneRestServer NP...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8127#issuecomment-130587302
  
  [Test build #40744 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40744/console)
 for   PR 8127 at commit 
[`fdb6158`](https://github.com/apache/spark/commit/fdb6158f6439dce80bdbd01ef2e483265ca5eb84).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9925] [SQL] [TESTS] Set SQLConf.SHUFFLE...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8155#issuecomment-130587553
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9877][Core] Fix StandaloneRestServer NP...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8127#issuecomment-130587346
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9868] [SQL] [WIP] reproduce failure

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8160#issuecomment-130550743
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9925] [SQL] [TESTS] Set SQLConf.SHUFFLE...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8155#issuecomment-130550893
  
  [Test build #40743 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40743/consoleFull)
 for   PR 8155 at commit 
[`d09b603`](https://github.com/apache/spark/commit/d09b6031c42391dacc1d0d4bad44050b5885b1f5).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9877][Core] Fix StandaloneRestServer NP...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8127#issuecomment-130550728
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9877][Core] Fix StandaloneRestServer NP...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8127#issuecomment-130550715
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9832] [SQL] add a thread-safe lookup fo...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8151#issuecomment-130550929
  
  [Test build #1536 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1536/console)
 for   PR 8151 at commit 
[`8daa2cd`](https://github.com/apache/spark/commit/8daa2cd1d8431fa6e10d6ec664e20e50da6f0139).
 * This patch **fails PySpark unit tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9868] [SQL] [WIP] reproduce failure

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8160#issuecomment-130552786
  
  [Test build #1549 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1549/consoleFull)
 for   PR 8160 at commit 
[`044d0c1`](https://github.com/apache/spark/commit/044d0c1ac723c77d84d349a57991ac86fb87de59).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9929][SQL] support metadata in withColu...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8159#issuecomment-130555772
  
  [Test build #40732 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40732/console)
 for   PR 8159 at commit 
[`4698d05`](https://github.com/apache/spark/commit/4698d05db5e874cc6cb7aa3dced022809bf3ba3d).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9882][Core] Priority-based scheduling f...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8129#issuecomment-130555864
  
  [Test build #40746 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40746/consoleFull)
 for   PR 8129 at commit 
[`867c941`](https://github.com/apache/spark/commit/867c9417d3119f53f055bb5064a8977b9ddd8304).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9929][SQL] support metadata in withColu...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8159#issuecomment-130555873
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9925] [SQL] [TESTS] Set SQLConf.SHUFFLE...

2015-08-13 Thread liancheng

Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/8155#issuecomment-130562648
  
Here's another place where this configuration is changed 
https://github.com/apache/spark/blob/84a27916a62980c8fcb0977c3a7fdb73c0bd5812/sql/core/src/test/scala/org/apache/spark/sql/SQLConfSuite.scala#L77


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9580] [SQL] Replace singletons in SQL t...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8111#issuecomment-130565139
  
  [Test build #1540 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1540/console)
 for   PR 8111 at commit 
[`828144f`](https://github.com/apache/spark/commit/828144f96e4454824887c1d01fada20ce3510610).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `  implicit class StringToColumn(val sc: StringContext) `



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9868] [SQL] [WIP] reproduce failure

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8160#issuecomment-130566615
  
  [Test build #1546 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1546/console)
 for   PR 8160 at commit 
[`044d0c1`](https://github.com/apache/spark/commit/044d0c1ac723c77d84d349a57991ac86fb87de59).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9580] [SQL] Replace singletons in SQL t...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8111#issuecomment-130566573
  
  [Test build #1544 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/1544/console)
 for   PR 8111 at commit 
[`c4d44c9`](https://github.com/apache/spark/commit/c4d44c9139eff45048d849311c539dadda54004c).
 * This patch **fails Spark unit tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `trait Identifiable `
  * `class VectorUDT extends UserDefinedType[Vector] `
  * `  implicit class StringToColumn(val sc: StringContext) `



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9661] [MLLib] [ML] Java compatibility

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8126#issuecomment-130566804
  
  [Test build #40747 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40747/consoleFull)
 for   PR 8126 at commit 
[`5fa5f7a`](https://github.com/apache/spark/commit/5fa5f7aafbd547d50eaa17f7f657ba29751babfa).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9661] [MLLib] [ML] Java compatibility

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8126#issuecomment-130573459
  
Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9934] Deprecate NIO ConnectionManager.

2015-08-13 Thread rxin

GitHub user rxin opened a pull request:

https://github.com/apache/spark/pull/8162

[SPARK-9934] Deprecate NIO ConnectionManager.

Deprecate NIO ConnectionManager in Spark 1.5.0, before removing it in Spark 
1.6.0.


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/rxin/spark SPARK-9934

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/8162.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #8162


commit 4fb536dcd38b21d5844611c03e436edff996006f
Author: Reynold Xin r...@databricks.com
Date:   2015-08-13T08:27:03Z

[SPARK-9934] Deprecate NIO ConnectionManager.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9661] [MLLib] [ML] Java compatibility

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8126#issuecomment-130573340
  
  [Test build #40747 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40747/console)
 for   PR 8126 at commit 
[`5fa5f7a`](https://github.com/apache/spark/commit/5fa5f7aafbd547d50eaa17f7f657ba29751babfa).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8132#issuecomment-130587234
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8132#issuecomment-130587220
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8132#issuecomment-130587199
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9882][Core] Priority-based scheduling f...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8129#issuecomment-130605258
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9882][Core] Priority-based scheduling f...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8129#issuecomment-130605274
  
Merged build started.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8132#issuecomment-130605400
  
  [Test build #40756 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40756/consoleFull)
 for   PR 8132 at commit 
[`d926a61`](https://github.com/apache/spark/commit/d926a615bbbcb40aaeba2a977c9c8c4b1787ecd2).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9882][Core] Priority-based scheduling f...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8129#issuecomment-130591634
  
Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-9882][Core] Priority-based scheduling f...

2015-08-13 Thread SparkQA

Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/8129#issuecomment-130591580
  
**[Test build #40746 timed 
out](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/40746/console)**
 for PR 8129 at commit 
[`867c941`](https://github.com/apache/spark/commit/867c9417d3119f53f055bb5064a8977b9ddd8304)
 after a configured wait of `175m`.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread yjshen

Github user yjshen commented on the pull request:

https://github.com/apache/spark/pull/8132#issuecomment-130591739
  
Jenkins, retest this please.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [SPARK-8887][SQL]Explicit define which data ty...

2015-08-13 Thread AmplabJenkins

Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/8132#issuecomment-130604506
  
 Merged build triggered.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

1 2 3 4 5 6 7 8 9 >

1 - 100 of 874 matches

Mail list logo