spark git commit: [SPARK-12692][BUILD][CORE] Scala style: Fix the style violation (Space before ", ")

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master b3b9ad23c -> f14922cff [SPARK-12692][BUILD][CORE] Scala style: Fix the style violation (Space before ",") Fix the style violation (space before , and :). This PR is a followup for #10643 Author: Kousuke Saruta

spark git commit: [HOT-FIX] bypass hive test when parse logical plan to json

2016-01-12 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.6 03e523e52 -> f71e5cc12 [HOT-FIX] bypass hive test when parse logical plan to json https://github.com/apache/spark/pull/10311 introduces some rare, non-deterministic flakiness for hive udf tests, see

spark git commit: [SPARK-12692][BUILD][SQL] Scala style: Fix the style violation (Space before ", ")

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master dc7b3870f -> cb7b864a2 [SPARK-12692][BUILD][SQL] Scala style: Fix the style violation (Space before ",") Fix the style violation (space before , and :). This PR is a followup for #10643 and rework of #10685 . Author: Kousuke Saruta

spark git commit: [SPARK-12788][SQL] Simplify BooleanEquality by using casts.

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master 924708496 -> b3b9ad23c [SPARK-12788][SQL] Simplify BooleanEquality by using casts. Author: Reynold Xin Closes #10730 from rxin/SPARK-12788. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: [SPARK-12558][SQL] AnalysisException when multiple functions applied in GROUP BY clause

2016-01-12 Thread yhuai
Repository: spark Updated Branches: refs/heads/master f14922cff -> dc7b3870f [SPARK-12558][SQL] AnalysisException when multiple functions applied in GROUP BY clause cloud-fan Can you please take a look ? In this case, we are failing during check analysis while validating the aggregation

spark git commit: [SPARK-12558][SQL] AnalysisException when multiple functions applied in GROUP BY clause

2016-01-12 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-1.6 f71e5cc12 -> dcdc864cf [SPARK-12558][SQL] AnalysisException when multiple functions applied in GROUP BY clause cloud-fan Can you please take a look ? In this case, we are failing during check analysis while validating the

spark git commit: [SPARK-12692][BUILD][SQL] Scala style: Fix the style violation (Space before ", " or ":")

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master 112abf910 -> 8cfa218f4 [SPARK-12692][BUILD][SQL] Scala style: Fix the style violation (Space before "," or ":") Fix the style violation (space before , and :). This PR is a followup for #10643. Author: Kousuke Saruta

spark git commit: [SPARK-7615][MLLIB] MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master 8cfa218f4 -> c48f2a3a5 [SPARK-7615][MLLIB] MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero Cosine similarity with 0 vector should be 0 Related to https://github.com/apache/spark/pull/10152 Author: Sean Owen

spark git commit: [SPARK-5273][MLLIB][DOCS] Improve documentation examples for LinearRegression

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 3221a7d91 -> 4c67d55c0 [SPARK-5273][MLLIB][DOCS] Improve documentation examples for LinearRegression Use a much smaller step size in LinearRegressionWithSGD MLlib examples to achieve a reasonable RMSE. Our training folks hit this

spark git commit: [SPARK-12638][API DOC] Parameter explanation not very accurate for rdd function "aggregate"

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 a6c9c68d8 -> 46fc7a12a [SPARK-12638][API DOC] Parameter explanation not very accurate for rdd function "aggregate" Currently, RDD function aggregate's parameter doesn't explain well, especially parameter "zeroValue". It's helpful to

spark git commit: [SPARK-12638][API DOC] Parameter explanation not very accurate for rdd function "aggregate"

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9c7f34af3 -> 9f0995bb0 [SPARK-12638][API DOC] Parameter explanation not very accurate for rdd function "aggregate" Currently, RDD function aggregate's parameter doesn't explain well, especially parameter "zeroValue". It's helpful to let

spark git commit: [SPARK-12582][TEST] IndexShuffleBlockResolverSuite fails in windows

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 46fc7a12a -> 3221a7d91 [SPARK-12582][TEST] IndexShuffleBlockResolverSuite fails in windows [SPARK-12582][Test] IndexShuffleBlockResolverSuite fails in windows * IndexShuffleBlockResolverSuite fails in windows due to file is not

spark git commit: [SPARK-12582][TEST] IndexShuffleBlockResolverSuite fails in windows

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9f0995bb0 -> 7e15044d9 [SPARK-12582][TEST] IndexShuffleBlockResolverSuite fails in windows [SPARK-12582][Test] IndexShuffleBlockResolverSuite fails in windows * IndexShuffleBlockResolverSuite fails in windows due to file is not closed. *

spark git commit: [SPARK-7615][MLLIB] MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 4c67d55c0 -> 94b39f777 [SPARK-7615][MLLIB] MLLIB Word2Vec wordVectors divided by Euclidean Norm equals to zero Cosine similarity with 0 vector should be 0 Related to https://github.com/apache/spark/pull/10152 Author: Sean Owen

spark git commit: [SPARK-9843][SQL] Make catalyst optimizer pass pluggable at runtime

2016-01-12 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 1d8887953 -> 508592b1b [SPARK-9843][SQL] Make catalyst optimizer pass pluggable at runtime Let me know whether you'd like to see it in other place Author: Robert Kruszewski Closes #10210 from

spark git commit: [SPARK-12724] SQL generation support for persisted data source tables

2016-01-12 Thread lian
Repository: spark Updated Branches: refs/heads/master 0d543b98f -> 8ed5f12d2 [SPARK-12724] SQL generation support for persisted data source tables This PR implements SQL generation support for persisted data source tables. A new field `metastoreTableIdentifier: Option[TableIdentifier]` is

spark git commit: [SPARK-5273][MLLIB][DOCS] Improve documentation examples for LinearRegression

2016-01-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master c48f2a3a5 -> 9c7f34af3 [SPARK-5273][MLLIB][DOCS] Improve documentation examples for LinearRegression Use a much smaller step size in LinearRegressionWithSGD MLlib examples to achieve a reasonable RMSE. Our training folks hit this exact

spark git commit: Revert "[SPARK-12692][BUILD][SQL] Scala style: Fix the style violation (Space before ", " or ":")"

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master 0ed430e31 -> 0d543b98f Revert "[SPARK-12692][BUILD][SQL] Scala style: Fix the style violation (Space before "," or ":")" This reverts commit 8cfa218f4f1b05f4d076ec15dd0a033ad3e4500d. Project:

spark git commit: Revert "[SPARK-12645][SPARKR] SparkR support hash function"

2016-01-12 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-1.6 94b39f777 -> 03e523e52 Revert "[SPARK-12645][SPARKR] SparkR support hash function" This reverts commit 8b5f23043322254c725c703c618ba3d3cc4a4240. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: [SPARK-12785][SQL] Add ColumnarBatch, an in memory columnar format for execution.

2016-01-12 Thread rxin
Repository: spark Updated Branches: refs/heads/master 4f60651cb -> 924708496 [SPARK-12785][SQL] Add ColumnarBatch, an in memory columnar format for execution. There are many potential benefits of having an efficient in memory columnar format as an alternate to UnsafeRow. This patch