spark git commit: [SPARK-13899][SQL] Produce InternalRow instead of external Row at CSV data source

2016-03-15 Thread rxin
Repository: spark Updated Branches: refs/heads/master 3c578c594 -> 92024797a [SPARK-13899][SQL] Produce InternalRow instead of external Row at CSV data source ## What changes were proposed in this pull request? https://issues.apache.org/jira/browse/SPARK-13899 This PR makes CSV data source

spark git commit: [SPARK-13920][BUILD] MIMA checks should apply to @Experimental and @DeveloperAPI APIs

2016-03-15 Thread rxin
Repository: spark Updated Branches: refs/heads/master 3665294d4 -> 3c578c594 [SPARK-13920][BUILD] MIMA checks should apply to @Experimental and @DeveloperAPI APIs ## What changes were proposed in this pull request? We are able to change `Experimental` and `DeveloperAPI` API freely but also

spark git commit: [SPARK-9837][ML] R-like summary statistics for GLMs via iteratively reweighted least squares

2016-03-15 Thread meng
Repository: spark Updated Branches: refs/heads/master 421f6c20e -> 3665294d4 [SPARK-9837][ML] R-like summary statistics for GLMs via iteratively reweighted least squares ## What changes were proposed in this pull request? Provide R-like summary statistics for GLMs via iteratively reweighted l

spark git commit: [SPARK-13917] [SQL] generate broadcast semi join

2016-03-15 Thread davies
Repository: spark Updated Branches: refs/heads/master 52b6a899b -> 421f6c20e [SPARK-13917] [SQL] generate broadcast semi join ## What changes were proposed in this pull request? This PR brings codegen support for broadcast left-semi join. ## How was this patch tested? Existing tests. Added

spark git commit: [MINOR][TEST][SQL] Remove wrong "expected" parameter in checkNaNWithoutCodegen

2016-03-15 Thread rxin
Repository: spark Updated Branches: refs/heads/master bbd887f53 -> 52b6a899b [MINOR][TEST][SQL] Remove wrong "expected" parameter in checkNaNWithoutCodegen ## What changes were proposed in this pull request? Remove the wrong "expected" parameter in MathFunctionsSuite.scala's checkNaNWithoutC

spark git commit: [SPARK-13918][SQL] Merge SortMergeJoin and SortMergerOuterJoin

2016-03-15 Thread rxin
Repository: spark Updated Branches: refs/heads/master 643649dcb -> bbd887f53 [SPARK-13918][SQL] Merge SortMergeJoin and SortMergerOuterJoin ## What changes were proposed in this pull request? This PR just move some code from SortMergeOuterJoin into SortMergeJoin. This is for support codegen

spark git commit: [SPARK-13895][SQL] DataFrameReader.text should return Dataset[String]

2016-03-15 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 41eaabf59 -> 643649dcb [SPARK-13895][SQL] DataFrameReader.text should return Dataset[String] ## What changes were proposed in this pull request? This patch changes DataFrameReader.text()'s return type from DataFrame to Dataset[String]. Cl

spark git commit: [SPARK-13626][CORE] Revert change to SparkConf's constructor.

2016-03-15 Thread vanzin
Repository: spark Updated Branches: refs/heads/master dddf2f2d8 -> 41eaabf59 [SPARK-13626][CORE] Revert change to SparkConf's constructor. It shouldn't be private. Author: Marcelo Vanzin Closes #11734 from vanzin/SPARK-13626-api. Project: http://git-wip-us.apache.org/repos/asf/spark/repo

spark git commit: [MINOR] a minor fix for the comments of a method in RPC Dispatcher

2016-03-15 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 50e3644d0 -> dddf2f2d8 [MINOR] a minor fix for the comments of a method in RPC Dispatcher ## What changes were proposed in this pull request? a minor fix for the comments of a method in RPC Dispatcher ## How was this patch tested? existi

spark git commit: [SPARK-13896][SQL][STRING] Dataset.toJSON should return Dataset

2016-03-15 Thread rxin
Repository: spark Updated Branches: refs/heads/master d89c71417 -> 50e3644d0 [SPARK-13896][SQL][STRING] Dataset.toJSON should return Dataset ## What changes were proposed in this pull request? Change the return type of toJson in Dataset class ## How was this patch tested? No additional unit te

spark git commit: [SPARK-13642][YARN] Changed the default application exit state to failed for yarn cluster mode

2016-03-15 Thread vanzin
Repository: spark Updated Branches: refs/heads/master 5e6f2f456 -> d89c71417 [SPARK-13642][YARN] Changed the default application exit state to failed for yarn cluster mode ## What changes were proposed in this pull request? Changing the default exit state to `failed` for any application runn

spark git commit: [SPARK-13893][SQL] Remove SQLContext.catalog/analyzer (internal method)

2016-03-15 Thread rxin
Repository: spark Updated Branches: refs/heads/master 48978abfa -> 5e6f2f456 [SPARK-13893][SQL] Remove SQLContext.catalog/analyzer (internal method) ## What changes were proposed in this pull request? Our internal code can go through SessionState.catalog and SessionState.analyzer. This brings

spark git commit: [SPARK-13576][BUILD] Don't create assembly for examples.

2016-03-15 Thread vanzin
Repository: spark Updated Branches: refs/heads/master bd5365bbe -> 48978abfa [SPARK-13576][BUILD] Don't create assembly for examples. As part of the goal to stop creating assemblies in Spark, this change modifies the mvn and sbt builds to not create an assembly for examples. Instead, dependen

spark git commit: [SPARK-13803] restore the changes in SPARK-3411

2016-03-15 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.4 54ff1f9e1 -> dae346172 [SPARK-13803] restore the changes in SPARK-3411 ## What changes were proposed in this pull request? This patch contains the functionality to balance the load of the cluster-mode drivers among workers This patch

spark git commit: [SPARK-13803] restore the changes in SPARK-3411

2016-03-15 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.5 7a24d94a8 -> 58fbdf6f7 [SPARK-13803] restore the changes in SPARK-3411 ## What changes were proposed in this pull request? This patch contains the functionality to balance the load of the cluster-mode drivers among workers This patch

spark git commit: [SPARK-13803] restore the changes in SPARK-3411

2016-03-15 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 6935b5080 -> 765307f41 [SPARK-13803] restore the changes in SPARK-3411 ## What changes were proposed in this pull request? This patch contains the functionality to balance the load of the cluster-mode drivers among workers This patch

spark git commit: [SPARK-13803] restore the changes in SPARK-3411

2016-03-15 Thread srowen
Repository: spark Updated Branches: refs/heads/master dafd70fbf -> bd5365bbe [SPARK-13803] restore the changes in SPARK-3411 ## What changes were proposed in this pull request? This patch contains the functionality to balance the load of the cluster-mode drivers among workers This patch res

spark git commit: [SPARK-12379][ML][MLLIB] Copy GBT implementation to spark.ml

2016-03-15 Thread mlnick
Repository: spark Updated Branches: refs/heads/master 10251a745 -> dafd70fbf [SPARK-12379][ML][MLLIB] Copy GBT implementation to spark.ml Currently, GBTs in spark.ml wrap the implementation in spark.mllib. This is preventing several improvements to GBTs in spark.ml, so we need to move the im

spark git commit: [SPARK-13660][SQL][TESTS] ContinuousQuerySuite floods the logs with garbage

2016-03-15 Thread rxin
Repository: spark Updated Branches: refs/heads/master 99bd2f0e9 -> 10251a745 [SPARK-13660][SQL][TESTS] ContinuousQuerySuite floods the logs with garbage ## What changes were proposed in this pull request? Use method 'testQuietly' to avoid ContinuousQuerySuite flooding the console logs with g

spark git commit: [SPARK-13840][SQL] Split Optimizer Rule ColumnPruning to ColumnPruning and EliminateOperator

2016-03-15 Thread rxin
Repository: spark Updated Branches: refs/heads/master 276c2d51a -> 99bd2f0e9 [SPARK-13840][SQL] Split Optimizer Rule ColumnPruning to ColumnPruning and EliminateOperator What changes were proposed in this pull request? Before this PR, two Optimizer rules `ColumnPruning` and `PushPredic