spark git commit: [SPARK-15033][SQL] fix a flaky test in CachedTableSuite

2016-04-30 Thread rxin
Repository: spark Updated Branches: refs/heads/master 507bea5ca -> 90787de86 [SPARK-15033][SQL] fix a flaky test in CachedTableSuite ## What changes were proposed in this pull request? This is caused by https://github.com/apache/spark/pull/12776, which removes the `synchronized` from all

spark git commit: [SPARK-14143] Options for parsing NaNs, Infinity and nulls for numeric types

2016-04-30 Thread rxin
Repository: spark Updated Branches: refs/heads/master 0182d9599 -> 507bea5ca [SPARK-14143] Options for parsing NaNs, Infinity and nulls for numeric types 1. Adds the following options for parsing NaNs: nanValue 2. Adds the following options for parsing infinity: positiveInf, negativeInf.

spark git commit: [SPARK-15034][SPARK-15035][SPARK-15036][SQL] Use spark.sql.warehouse.dir as the warehouse location

2016-04-30 Thread rxin
Repository: spark Updated Branches: refs/heads/master 19a6d192d -> 0182d9599 [SPARK-15034][SPARK-15035][SPARK-15036][SQL] Use spark.sql.warehouse.dir as the warehouse location This PR contains three changes: 1. We will use spark.sql.warehouse.dir set warehouse location. We will not use

spark git commit: [SPARK-15030][ML][SPARKR] Support formula in spark.kmeans in SparkR

2016-04-30 Thread meng
Repository: spark Updated Branches: refs/heads/master e5fb78baf -> 19a6d192d [SPARK-15030][ML][SPARKR] Support formula in spark.kmeans in SparkR ## What changes were proposed in this pull request? * ```RFormula``` supports empty response variable like ```~ x + y```. * Support formula in

spark git commit: [SPARK-14952][CORE][ML] Remove methods that were deprecated in 1.6.0

2016-04-30 Thread srowen
Repository: spark Updated Branches: refs/heads/master 0847fe4eb -> e5fb78baf [SPARK-14952][CORE][ML] Remove methods that were deprecated in 1.6.0 What changes were proposed in this pull request? This PR removes three methods the were deprecated in 1.6.0: - `PortableDataStream.close()` -

spark git commit: [SPARK-14653][ML] Remove json4s from mllib-local

2016-04-30 Thread meng
Repository: spark Updated Branches: refs/heads/master 1192fe4cd -> 0847fe4eb [SPARK-14653][ML] Remove json4s from mllib-local ## What changes were proposed in this pull request? This PR moves Vector.toJson/fromJson to ml.linalg.VectorEncoder under mllib/ to keep mllib-local's dependency

spark git commit: [SPARK-15028][SQL] Remove HiveSessionState.setDefaultOverrideConfs

2016-04-30 Thread yhuai
Repository: spark Updated Branches: refs/heads/master b3ea57931 -> 8dc3987d0 [SPARK-15028][SQL] Remove HiveSessionState.setDefaultOverrideConfs ## What changes were proposed in this pull request? This patch removes some code that are no longer relevant -- mainly

spark git commit: [SPARK-14831][.2][ML][R] rename ml.save/ml.load to write.ml/read.ml

2016-04-30 Thread meng
Repository: spark Updated Branches: refs/heads/master 7fbe1bb24 -> b3ea57931 [SPARK-14831][.2][ML][R] rename ml.save/ml.load to write.ml/read.ml ## What changes were proposed in this pull request? Continue the work of #12789 to rename ml.asve/ml.load to write.ml/read.ml, which are more

spark git commit: [SPARK-14412][.2][ML] rename *RDDStorageLevel to *StorageLevel in ml.ALS

2016-04-30 Thread meng
Repository: spark Updated Branches: refs/heads/master 5886b6217 -> 7fbe1bb24 [SPARK-14412][.2][ML] rename *RDDStorageLevel to *StorageLevel in ml.ALS ## What changes were proposed in this pull request? As discussed in #12660, this PR renames * intermediateRDDStorageLevel ->

spark git commit: [SPARK-14533][MLLIB] RowMatrix.computeCovariance inaccurate when values are very large (partial fix)

2016-04-30 Thread meng
Repository: spark Updated Branches: refs/heads/master f86f71763 -> 5886b6217 [SPARK-14533][MLLIB] RowMatrix.computeCovariance inaccurate when values are very large (partial fix) ## What changes were proposed in this pull request? Fix for part of SPARK-14533: trivial simplification and more

spark git commit: [MINOR][EXAMPLE] Use SparkSession instead of SQLContext in RDDRelation.scala

2016-04-30 Thread rxin
Repository: spark Updated Branches: refs/heads/master 3d09ceeef -> f86f71763 [MINOR][EXAMPLE] Use SparkSession instead of SQLContext in RDDRelation.scala ## What changes were proposed in this pull request? Now, `SQLContext` is used for backward-compatibility, we had better use

spark git commit: [SPARK-14850][.2][ML] use UnsafeArrayData.fromPrimitiveArray in ml.VectorUDT/MatrixUDT

2016-04-30 Thread meng
Repository: spark Updated Branches: refs/heads/master 73c20bf32 -> 3d09ceeef [SPARK-14850][.2][ML] use UnsafeArrayData.fromPrimitiveArray in ml.VectorUDT/MatrixUDT ## What changes were proposed in this pull request? This PR uses `UnsafeArrayData.fromPrimitiveArray` to implement

spark git commit: [SPARK-14831][SPARKR] Make the SparkR MLlib API more consistent with Spark

2016-04-30 Thread meng
Repository: spark Updated Branches: refs/heads/master 43b149fb8 -> bc36fe6e8 [SPARK-14831][SPARKR] Make the SparkR MLlib API more consistent with Spark ## What changes were proposed in this pull request? This PR splits the MLlib algorithms into two flavors: - the R flavor, which tries to

spark git commit: [SPARK-14391][LAUNCHER] Fix launcher communication test, take 2.

2016-04-30 Thread rxin
Repository: spark Updated Branches: refs/heads/master bc36fe6e8 -> 73c20bf32 [SPARK-14391][LAUNCHER] Fix launcher communication test, take 2. There's actually a race here: the state of the handler was changed before the connection was set, so the test code could be notified of the state

spark git commit: [SPARK-14850][ML] convert primitive array from/to unsafe array directly in VectorUDT/MatrixUDT

2016-04-30 Thread meng
Repository: spark Updated Branches: refs/heads/master 4bac703eb -> 43b149fb8 [SPARK-14850][ML] convert primitive array from/to unsafe array directly in VectorUDT/MatrixUDT ## What changes were proposed in this pull request? This PR adds `fromPrimitiveArray` and `toPrimitiveArray` in