spark git commit: [SPARK-7922] [MLLIB] use DataFrames for user/item factors in ALSModel

2015-05-28 Thread meng
languages. This PR also removed fitting params from `ALSModel`. coderxiang Author: Xiangrui Meng m...@databricks.com Closes #6468 from mengxr/SPARK-7922 and squashes the following commits: 7bfb1d5 [Xiangrui Meng] update ALSModel in PySpark 1ba5607 [Xiangrui Meng] use DataFrames for user/item factors

spark git commit: [MINOR] fix RegressionEvaluator doc

2015-05-28 Thread meng
Repository: spark Updated Branches: refs/heads/master c45d58c14 - 834e69952 [MINOR] fix RegressionEvaluator doc `make clean html` under `python/doc` returns ~~~ /Users/meng/src/spark/python/pyspark/ml/evaluation.py:docstring of pyspark.ml.evaluation.RegressionEvaluator.setParams:3: WARNING

spark git commit: [MINOR] fix RegressionEvaluator doc

2015-05-28 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.4 6e99dd5d0 - 1d49d8c3f [MINOR] fix RegressionEvaluator doc `make clean html` under `python/doc` returns ~~~ /Users/meng/src/spark/python/pyspark/ml/evaluation.py:docstring of pyspark.ml.evaluation.RegressionEvaluator.setParams:3

spark git commit: [SPARK-7926] [PYSPARK] use the official Pyrolite release

2015-05-28 Thread meng
upgrade to 4.6 because we don't have enough time for QA. I excludes `serpent` from its dependencies because we don't use it in Spark. ~~~ [info] +-net.jpountz.lz4:lz4:1.3.0 [info] +-net.razorvine:pyrolite:4.4 [info] +-net.sf.py4j:py4j:0.8.2.1 ~~~ davies Author: Xiangrui Meng m

spark git commit: [SPARK-7926] [PYSPARK] use the official Pyrolite release

2015-05-28 Thread meng
upgrade to 4.6 because we don't have enough time for QA. I excludes `serpent` from its dependencies because we don't use it in Spark. ~~~ [info] +-net.jpountz.lz4:lz4:1.3.0 [info] +-net.razorvine:pyrolite:4.4 [info] +-net.sf.py4j:py4j:0.8.2.1 ~~~ davies Author: Xiangrui Meng m

spark git commit: [SPARK-7535] [.1] [MLLIB] minor changes to the pipeline API

2015-05-27 Thread meng
: Xiangrui Meng m...@databricks.com Closes #6392 from mengxr/SPARK-7535.1 and squashes the following commits: 5ff5af8 [Xiangrui Meng] add unit test for CV.validateParams f1f8369 [Xiangrui Meng] update CV.validateParams() to test estimatorParamMaps 607445d [Xiangrui Meng] merge master 8716f5f [Xiangrui

spark git commit: [SPARK-7883] [DOCS] [MLLIB] Fixing broken trainImplicit Scala example in MLlib Collaborative Filtering documentation.

2015-05-26 Thread meng
in MLlib Collaborative Filtering documentation to match one of the possible ALS.trainImplicit function signatures. (cherry picked from commit 0463428b6e8f364f0b1f39445a60cd85ae7c07bc) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit

spark git commit: [SPARK-7883] [DOCS] [MLLIB] Fixing broken trainImplicit Scala example in MLlib Collaborative Filtering documentation.

2015-05-26 Thread meng
in MLlib Collaborative Filtering documentation to match one of the possible ALS.trainImplicit function signatures. (cherry picked from commit 0463428b6e8f364f0b1f39445a60cd85ae7c07bc) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit

spark git commit: [SPARK-7883] [DOCS] [MLLIB] Fixing broken trainImplicit Scala example in MLlib Collaborative Filtering documentation.

2015-05-26 Thread meng
in MLlib Collaborative Filtering documentation to match one of the possible ALS.trainImplicit function signatures. (cherry picked from commit 0463428b6e8f364f0b1f39445a60cd85ae7c07bc) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit

spark git commit: [SPARK-7883] [DOCS] [MLLIB] Fixing broken trainImplicit Scala example in MLlib Collaborative Filtering documentation.

2015-05-26 Thread meng
/0463428b Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/0463428b Branch: refs/heads/master Commit: 0463428b6e8f364f0b1f39445a60cd85ae7c07bc Parents: 8f20824 Author: Mike Dusenberry dusenberr...@gmail.com Authored: Tue May 26 18:08:57 2015 -0700 Committer: Xiangrui Meng m...@databricks.com

spark git commit: [SPARK-7844] [MLLIB] Fix broken tests in KernelDensity

2015-05-26 Thread meng
-7844] Fix broken tests in KernelDensity (cherry picked from commit 61664732b25b35f94be35a42cde651cbfd0e02b7) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/51d98b0e Tree: http

spark git commit: [SPARK-7844] [MLLIB] Fix broken tests in KernelDensity

2015-05-26 Thread meng
Branch: refs/heads/master Commit: 61664732b25b35f94be35a42cde651cbfd0e02b7 Parents: b7d8085 Author: MechCoder manojkumarsivaraj...@gmail.com Authored: Tue May 26 13:21:00 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Tue May 26 13:21:00 2015 -0700

spark git commit: [SPARK-7748] [MLLIB] Graduate spark.ml from alpha

2015-05-26 Thread meng
. jkbradley harsha2010 Author: Xiangrui Meng m...@databricks.com Closes #6417 from mengxr/SPARK-7748 and squashes the following commits: effbccd [Xiangrui Meng] organize imports c15028e [Xiangrui Meng] added missing docs 1b2e5f8 [Xiangrui Meng] update package doc 73ca791 [Xiangrui Meng] alpha - ex

spark git commit: [SPARK-7833] [ML] Add python wrapper for RegressionEvaluator

2015-05-24 Thread meng
] cleanup 7623b7d [Ram Sriharsha] python style fix 9743f83 [Ram Sriharsha] [SPARK-7833][ml] Add python wrapper for RegressionEvaluator (cherry picked from commit 65c696ecc0a913bbe1c8b8399d811da87e4c4343) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf

spark git commit: [SPARK-7833] [ML] Add python wrapper for RegressionEvaluator

2015-05-24 Thread meng
/asf/spark/tree/65c696ec Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/65c696ec Branch: refs/heads/master Commit: 65c696ecc0a913bbe1c8b8399d811da87e4c4343 Parents: ed21476 Author: Ram Sriharsha rsriharsha@hw11853.local Authored: Sun May 24 10:36:02 2015 -0700 Committer: Xiangrui Meng m

spark git commit: [SPARK-7578] [ML] [DOC] User guide for spark.ml Normalizer, IDF, StandardScaler

2015-05-22 Thread meng
: Xiangrui Meng m...@databricks.com Committed: Thu May 21 22:59:45 2015 -0700 -- docs/ml-features.md | 224 --- .../spark/ml/feature/JavaHashingTFSuite.java| 17 +- .../spark/ml/feature

spark git commit: [SPARK-7404] [ML] Add RegressionEvaluator to spark.ml

2015-05-22 Thread meng
f93b812 [Ram Sriharsha] fix test 1b6ebb3 [Ram Sriharsha] [SPARK-7404][ml] Add RegressionEvaluator to spark.ml (cherry picked from commit f490b3b4c706c92aa65d000b9d885f4d160a5f39) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http

spark git commit: [SPARK-7404] [ML] Add RegressionEvaluator to spark.ml

2015-05-22 Thread meng
: http://git-wip-us.apache.org/repos/asf/spark/diff/f490b3b4 Branch: refs/heads/master Commit: f490b3b4c706c92aa65d000b9d885f4d160a5f39 Parents: 3b68cb0 Author: Ram Sriharsha rsriharsha@hw11853.local Authored: Fri May 22 09:59:44 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Fri

spark git commit: [SPARK-7753] [MLLIB] Update KernelDensity API

2015-05-21 Thread meng
` method is removed from `Statistics`. The implementation is updated using BLAS, while the algorithm remains the same. sryza srowen Author: Xiangrui Meng m...@databricks.com Closes #6279 from mengxr/SPARK-7753 and squashes the following commits: 4cdfadc [Xiangrui Meng] add example code in the doc

spark git commit: [SPARK-7753] [MLLIB] Update KernelDensity API

2015-05-21 Thread meng
` method is removed from `Statistics`. The implementation is updated using BLAS, while the algorithm remains the same. sryza srowen Author: Xiangrui Meng m...@databricks.com Closes #6279 from mengxr/SPARK-7753 and squashes the following commits: 4cdfadc [Xiangrui Meng] add example code in the doc

spark git commit: [SPARK-7585] [ML] [DOC] VectorIndexer user guide section

2015-05-21 Thread meng
and Java unit test for it. (cherry picked from commit 6d75ed7e5ccf6c58143de4608115f9a2b3ff6cf4) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/e29b811e Tree: http://git-wip

spark git commit: [SPARK-7498] [MLLIB] add varargs back to setDefault

2015-05-21 Thread meng
Author: Xiangrui Meng m...@databricks.com Closes #6320 from mengxr/SPARK-7498 and squashes the following commits: 74a7259 [Xiangrui Meng] add varargs back to setDefault (cherry picked from commit cdc7c055c931c4c931a11b510de473455f3256da) Signed-off-by: Xiangrui Meng m...@databricks.com Project

spark git commit: [SPARK-7498] [MLLIB] add varargs back to setDefault

2015-05-21 Thread meng
Author: Xiangrui Meng m...@databricks.com Closes #6320 from mengxr/SPARK-7498 and squashes the following commits: 74a7259 [Xiangrui Meng] add varargs back to setDefault Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/cdc7c055

spark git commit: [SPARK-7585] [ML] [DOC] VectorIndexer user guide section

2015-05-21 Thread meng
/master Commit: 6d75ed7e5ccf6c58143de4608115f9a2b3ff6cf4 Parents: 15680ae Author: Joseph K. Bradley jos...@databricks.com Authored: Thu May 21 13:05:48 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Thu May 21 13:05:48 2015 -0700

spark git commit: [DOCS] [MLLIB] Fixing broken link in MLlib Linear Methods documentation.

2015-05-21 Thread meng
e4136ea6c457bc74cee312aa14974498ab4633eb) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/2cc7907d Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/2cc7907d Diff: http://git-wip-us.apache.org

spark git commit: [SPARK-7794] [MLLIB] update RegexTokenizer default settings

2015-05-21 Thread meng
: true, pattern: \\s+}`. jkbradley Author: Xiangrui Meng m...@databricks.com Closes #6330 from mengxr/SPARK-7794 and squashes the following commits: 5ee7cde [Xiangrui Meng] update RegexTokenizer default settings Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip

spark git commit: [SPARK-7794] [MLLIB] update RegexTokenizer default settings

2015-05-21 Thread meng
to `{gaps: true, pattern: \\s+}`. jkbradley Author: Xiangrui Meng m...@databricks.com Closes #6330 from mengxr/SPARK-7794 and squashes the following commits: 5ee7cde [Xiangrui Meng] update RegexTokenizer default settings (cherry picked from commit f5db4b416c922db7a8f1b0c098b4f08647106231) Signed-off

spark git commit: [SPARK-7793] [MLLIB] Use getOrElse for getting the threshold of SVM model

2015-05-21 Thread meng
] fix bug in topBykey and update test (cherry picked from commit 4f572008f804068c1a81cc334ff2367dbeae6493) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/f6a29c72 Tree: http

spark git commit: [SPARK-7537] [MLLIB] spark.mllib API updates

2015-05-20 Thread meng
` in the `RankingMetrics` constructor. Author: Xiangrui Meng m...@databricks.com Closes #6280 from mengxr/SPARK-7537 and squashes the following commits: 1bd2583 [Xiangrui Meng] organize imports 94afa7a [Xiangrui Meng] mark all toPMML methods experimental 4c40da1 [Xiangrui Meng] mention the factory method

spark git commit: [SPARK-7537] [MLLIB] spark.mllib API updates

2015-05-20 Thread meng
` in the `RankingMetrics` constructor. Author: Xiangrui Meng m...@databricks.com Closes #6280 from mengxr/SPARK-7537 and squashes the following commits: 1bd2583 [Xiangrui Meng] organize imports 94afa7a [Xiangrui Meng] mark all toPMML methods experimental 4c40da1 [Xiangrui Meng] mention the factory method

spark git commit: [SPARK-7774] [MLLIB] add sqlContext to MLlibTestSparkContext

2015-05-20 Thread meng
Repository: spark Updated Branches: refs/heads/master 42c592adb - ddec173cb [SPARK-7774] [MLLIB] add sqlContext to MLlibTestSparkContext to simplify test suites that require a SQLContext. Author: Xiangrui Meng m...@databricks.com Closes #6303 from mengxr/SPARK-7774 and squashes

spark git commit: [SPARK-7774] [MLLIB] add sqlContext to MLlibTestSparkContext

2015-05-20 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.4 4fd674336 - 9711e9bf1 [SPARK-7774] [MLLIB] add sqlContext to MLlibTestSparkContext to simplify test suites that require a SQLContext. Author: Xiangrui Meng m...@databricks.com Closes #6303 from mengxr/SPARK-7774 and squashes

spark git commit: [SPARK-7654] [MLLIB] Migrate MLlib to the DataFrame reader/writer API

2015-05-20 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.4 10698e113 - 996e2d4b3 [SPARK-7654] [MLLIB] Migrate MLlib to the DataFrame reader/writer API parquetFile - read.parquet rxin Author: Xiangrui Meng m...@databricks.com Closes #6281 from mengxr/SPARK-7654 and squashes the following

spark git commit: [SPARK-6094] [MLLIB] Add MultilabelMetrics in PySpark/MLlib

2015-05-20 Thread meng
/spark/diff/98a46f9d Branch: refs/heads/master Commit: 98a46f9dffec294386f6c39acafa7f11adb87a8f Parents: 589b12f Author: Yanbo Liang yblia...@gmail.com Authored: Wed May 20 07:55:51 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Wed May 20 07:55:51 2015 -0700

spark git commit: [SPARK-6094] [MLLIB] Add MultilabelMetrics in PySpark/MLlib

2015-05-20 Thread meng
commits: b8e3343 [Yanbo Liang] Add MultilabelMetrics in PySpark/MLlib (cherry picked from commit 98a46f9dffec294386f6c39acafa7f11adb87a8f) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark

spark git commit: [SPARK-7654] [MLLIB] Migrate MLlib to the DataFrame reader/writer API

2015-05-20 Thread meng
Repository: spark Updated Branches: refs/heads/master 3ddf051ee - 589b12f8e [SPARK-7654] [MLLIB] Migrate MLlib to the DataFrame reader/writer API parquetFile - read.parquet rxin Author: Xiangrui Meng m...@databricks.com Closes #6281 from mengxr/SPARK-7654 and squashes the following commits

spark git commit: [SPARK-7744] [DOCS] [MLLIB] Distributed matrix section in MLlib Data Types documentation should be reordered.

2015-05-19 Thread meng
distributed matrix. This will improve comprehensibility of the Distributed matrix section, especially for the new reader. (cherry picked from commit 3860520633770cc5719b2cdebe6dc3608798386d) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark

spark git commit: [SPARK-7744] [DOCS] [MLLIB] Distributed matrix section in MLlib Data Types documentation should be reordered.

2015-05-19 Thread meng
distributed matrix. This will improve comprehensibility of the Distributed matrix section, especially for the new reader. (cherry picked from commit 3860520633770cc5719b2cdebe6dc3608798386d) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark

spark git commit: [SPARK-7744] [DOCS] [MLLIB] Distributed matrix section in MLlib Data Types documentation should be reordered.

2015-05-19 Thread meng
/spark/tree/38605206 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/38605206 Branch: refs/heads/master Commit: 3860520633770cc5719b2cdebe6dc3608798386d Parents: 2bc5e06 Author: Mike Dusenberry dusenberr...@gmail.com Authored: Tue May 19 17:18:08 2015 -0700 Committer: Xiangrui Meng m

spark git commit: [SPARK-7681] [MLLIB] remove mima excludes for 1.3

2015-05-19 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.4 ac3197e1b - 2cce6bfea [SPARK-7681] [MLLIB] remove mima excludes for 1.3 There excludes are unnecessary for 1.3 because the changes were made in 1.4.x. Author: Xiangrui Meng m...@databricks.com Closes #6254 from mengxr/SPARK-7681-mima

spark git commit: [SPARK-7681] [MLLIB] remove mima excludes for 1.3

2015-05-19 Thread meng
Repository: spark Updated Branches: refs/heads/master df34793ad - 6845cb2ff [SPARK-7681] [MLLIB] remove mima excludes for 1.3 There excludes are unnecessary for 1.3 because the changes were made in 1.4.x. Author: Xiangrui Meng m...@databricks.com Closes #6254 from mengxr/SPARK-7681-mima

spark git commit: [SPARK-7652] [MLLIB] Update the implementation of naive Bayes prediction with BLAS

2015-05-19 Thread meng
: http://git-wip-us.apache.org/repos/asf/spark/diff/c12dff9b Branch: refs/heads/master Commit: c12dff9b82e4869f866a9b96ce0bf05503dd7dda Parents: 68fb2a4 Author: Liang-Chi Hsieh vii...@gmail.com Authored: Tue May 19 13:53:08 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Tue May 19

spark git commit: [SPARK-7652] [MLLIB] Update the implementation of naive Bayes prediction with BLAS

2015-05-19 Thread meng
] Remove toDense. 1b6cdfe [Liang-Chi Hsieh] Update the implementation of naive Bayes prediction with BLAS. (cherry picked from commit c12dff9b82e4869f866a9b96ce0bf05503dd7dda) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http

spark git commit: [SPARK-7047] [ML] ml.Model optional parent support

2015-05-19 Thread meng
jkbradley/parent-optional and squashes the following commits: d501774 [Joseph K. Bradley] Made Model.parent transient. Added Model.hasParent to test for null parent (cherry picked from commit fb90273212dc7241c9a0c3446e25e0e0b9377750) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http

spark git commit: [SPARK-7678] [ML] Fix default random seed in HasSeed

2015-05-19 Thread meng
/repos/asf/spark/tree/7b16e9f2 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/7b16e9f2 Branch: refs/heads/master Commit: 7b16e9f2118fbfbb1c0ba957161fe500c9aff82a Parents: fb90273 Author: Joseph K. Bradley jos...@databricks.com Authored: Tue May 19 10:57:47 2015 -0700 Committer: Xiangrui Meng m

spark git commit: [SPARK-7678] [ML] Fix default random seed in HasSeed

2015-05-19 Thread meng
seeds from Word2Vec and ALS. Changed shared param HasSeed to have default based on hashCode of class name, instead of random number. (cherry picked from commit 7b16e9f2118fbfbb1c0ba957161fe500c9aff82a) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos

spark git commit: [SPARK-6657] [PYSPARK] Fix doc warnings

2015-05-18 Thread meng
Repository: spark Updated Branches: refs/heads/master e32c0f69f - 1ecfac6e3 [SPARK-6657] [PYSPARK] Fix doc warnings Fixed the following warnings in `make clean html` under `python/docs`: ~~~ /Users/meng/src/spark/python/pyspark/mllib/evaluation.py:docstring

spark git commit: [SPARK-6657] [PYSPARK] Fix doc warnings

2015-05-18 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.4 0e7cd8ff8 - 2c94ffe7e [SPARK-6657] [PYSPARK] Fix doc warnings Fixed the following warnings in `make clean html` under `python/docs`: ~~~ /Users/meng/src/spark/python/pyspark/mllib/evaluation.py:docstring

spark git commit: [SPARK-7272] [MLLIB] User guide for PMML model export

2015-05-18 Thread meng
-export.md 680dc33 [Vincenzo Selvaggio] Update mllib-pmml-model-export.md 2e298b5 [Vincenzo Selvaggio] Update mllib-pmml-model-export.md a932f51 [Vincenzo Selvaggio] Create mllib-pmml-model-export.md (cherry picked from commit 814b3dabdf01abc7a2f25aa32284caccadeb7798) Signed-off-by: Xiangrui Meng m

spark git commit: [SPARK-7272] [MLLIB] User guide for PMML model export

2015-05-18 Thread meng
Authored: Mon May 18 08:46:33 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Mon May 18 08:46:33 2015 -0700 -- docs/mllib-guide.md | 1 + docs/mllib-pmml-model-export.md | 86

spark git commit: [SPARK-7681] [MLLIB] Add SparseVector support for gemv

2015-05-18 Thread meng
/master' into sparsevector_gemv c069507 [Liang-Chi Hsieh] Add SparseVector support for gemv with DenseMatrix. (cherry picked from commit d03638cc2d414cee9ac7481084672e454495dfc1) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http

spark git commit: [SPARK-7681] [MLLIB] Add SparseVector support for gemv

2015-05-18 Thread meng
: http://git-wip-us.apache.org/repos/asf/spark/diff/d03638cc Branch: refs/heads/master Commit: d03638cc2d414cee9ac7481084672e454495dfc1 Parents: 3a60038 Author: Liang-Chi Hsieh vii...@gmail.com Authored: Mon May 18 21:32:36 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Mon May

spark git commit: [SPARK-7694] [MLLIB] Use getOrElse for getting the threshold of LR model

2015-05-17 Thread meng
: 775e6f9909d4495cbc11c377508b43482d782742 Parents: ff71d34 Author: Shuo Xiang shuoxiang...@gmail.com Authored: Sun May 17 21:16:52 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Sun May 17 21:16:52 2015 -0700 -- .../org/apache/spark/mllib

spark git commit: [SPARK-7694] [MLLIB] Use getOrElse for getting the threshold of LR model

2015-05-17 Thread meng
commit 775e6f9909d4495cbc11c377508b43482d782742) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/0b6bc8a2 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/0b6bc8a2 Diff: http

spark git commit: [SPARK-7668] [MLLIB] Preserve isTransposed property for Matrix after calling map function

2015-05-15 Thread meng
/fix_matrix_map and squashes the following commits: 2a7cc97 [Liang-Chi Hsieh] Preserve isTransposed property for Matrix after calling map function. (cherry picked from commit f96b85ab44b82736363764ea39ee62884007f4a3) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip

spark git commit: [SPARK-7668] [MLLIB] Preserve isTransposed property for Matrix after calling map function

2015-05-15 Thread meng
/fix_matrix_map and squashes the following commits: 2a7cc97 [Liang-Chi Hsieh] Preserve isTransposed property for Matrix after calling map function. (cherry picked from commit f96b85ab44b82736363764ea39ee62884007f4a3) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip

spark git commit: [SPARK-7668] [MLLIB] Preserve isTransposed property for Matrix after calling map function

2015-05-15 Thread meng
Meng m...@databricks.com Committed: Fri May 15 10:03:29 2015 -0700 -- .../src/main/scala/org/apache/spark/mllib/linalg/Matrices.scala | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions

spark git commit: [SPARK-7568] [ML] ml.LogisticRegression doesn't output the right prediction

2015-05-14 Thread meng
from dbtsai/lor-example and squashes the following commits: ac63ce4 [DB Tsai] first commit (cherry picked from commit c1080b6fddb22d84694da2453e46a03fbc041576) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip

spark git commit: [SPARK-7612] [MLLIB] update NB training to use mllib's BLAS

2015-05-13 Thread meng
Repository: spark Updated Branches: refs/heads/master 3113da9c7 - d5f18de16 [SPARK-7612] [MLLIB] update NB training to use mllib's BLAS This is similar to the changes to k-means, which gives us better control on the performance. dbtsai Author: Xiangrui Meng m...@databricks.com Closes #6128

spark git commit: [SPARK-7612] [MLLIB] update NB training to use mllib's BLAS

2015-05-13 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.4 2d4a961f8 - 82f387fe2 [SPARK-7612] [MLLIB] update NB training to use mllib's BLAS This is similar to the changes to k-means, which gives us better control on the performance. dbtsai Author: Xiangrui Meng m...@databricks.com Closes

spark git commit: [SPARK-7593] [ML] Python Api for ml.feature.Bucketizer

2015-05-13 Thread meng
://git-wip-us.apache.org/repos/asf/spark/diff/5db18ba6 Branch: refs/heads/master Commit: 5db18ba6e1bd8c6307c41549176c53590cf344a0 Parents: 51030b8 Author: Burak Yavuz brk...@gmail.com Authored: Wed May 13 13:21:36 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Wed May 13 13:21:36

spark git commit: [SPARK-7593] [ML] Python Api for ml.feature.Bucketizer

2015-05-13 Thread meng
commits: 05285be [Burak Yavuz] added sphinx doc 6abb6ed [Burak Yavuz] added support for Bucketizer (cherry picked from commit 5db18ba6e1bd8c6307c41549176c53590cf344a0) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip

spark git commit: [SPARK-7382] [MLLIB] Feature Parity in PySpark for ml.classification

2015-05-13 Thread meng
and squashes the following commits: dd78237 [Burak Yavuz] fix style 1048e29 [Burak Yavuz] ready for PR (cherry picked from commit df2fb1305aba6781017b0973b0965b664f835e31) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip

spark git commit: [SPARK-7382] [MLLIB] Feature Parity in PySpark for ml.classification

2015-05-13 Thread meng
://git-wip-us.apache.org/repos/asf/spark/diff/df2fb130 Branch: refs/heads/master Commit: df2fb1305aba6781017b0973b0965b664f835e31 Parents: 61e05fc Author: Burak Yavuz brk...@gmail.com Authored: Wed May 13 15:13:09 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Wed May 13 15:13:09

spark git commit: [SPARK-7487] [ML] Feature Parity in PySpark for ml.regression

2015-05-12 Thread meng
8e935b0a214f8b477fe9579fbf6a2d0a27b59118) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/432694c1 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/432694c1 Diff: http

spark git commit: [SPARK-7571] [MLLIB] rename Math to math

2015-05-12 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.4 32819fcb7 - 217c9 [SPARK-7571] [MLLIB] rename Math to math `scala.Math` is deprecated since 2.8. This PR only touchs `Math` usages in MLlib. dbtsai Author: Xiangrui Meng m...@databricks.com Closes #6092 from mengxr/SPARK-7571

spark git commit: [SPARK-6092] [MLLIB] Add RankingMetrics in PySpark/MLlib

2015-05-11 Thread meng
newRankingMetrics 33f649c [Yanbo Liang] Add RankingMetrics in PySpark/MLlib (cherry picked from commit 042dda3c5c25b5ecb6ae4fd37c85b211b01c187b) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark

spark git commit: [SPARK-7522] [EXAMPLES] Removed angle brackets from dataFormat option

2015-05-11 Thread meng
from BryanCutler/dataFormat-option-7522 and squashes the following commits: b34afb4 [Bryan Cutler] [SPARK-7522] Removed angle brackets from dataFormat option (cherry picked from commit 4f8a15519267ac205424270155254382cc2d3690) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http

spark git commit: [SPARK-7522] [EXAMPLES] Removed angle brackets from dataFormat option

2015-05-11 Thread meng
Committer: Xiangrui Meng m...@databricks.com Committed: Mon May 11 09:23:47 2015 -0700 -- .../scala/org/apache/spark/examples/ml/DecisionTreeExample.scala | 2 +- .../src/main/scala/org/apache/spark/examples/ml/GBTExample.scala | 2

spark git commit: [SPARK-6091] [MLLIB] Add MulticlassMetrics in PySpark/MLlib

2015-05-10 Thread meng
the following commits: bb3e4ba [Yanbo Liang] trigger jenkins 53c045d [Yanbo Liang] keep compatibility for python 2.6 972d5ac [Yanbo Liang] Add MulticlassMetrics in PySpark/MLlib (cherry picked from commit bf7e81a51cd81706570615cd67362c86602dec88) Signed-off-by: Xiangrui Meng m...@databricks.com Project

spark git commit: [SPARK-6091] [MLLIB] Add MulticlassMetrics in PySpark/MLlib

2015-05-10 Thread meng
Committer: Xiangrui Meng m...@databricks.com Committed: Sun May 10 00:57:14 2015 -0700 -- .../mllib/evaluation/MulticlassMetrics.scala| 8 ++ python/pyspark/mllib/evaluation.py | 129 +++ 2 files

spark git commit: [SPARK-7474] [MLLIB] update ParamGridBuilder doctest

2015-05-08 Thread meng
/02ad2fd4-f50c-11e4-8c04-e58d1a61ad69.png) Author: Xiangrui Meng m...@databricks.com Closes #6001 from mengxr/SPARK-7474 and squashes the following commits: b94b11d [Xiangrui Meng] update ParamGridBuilder doctest Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip

spark git commit: [SPARK-7474] [MLLIB] update ParamGridBuilder doctest

2015-05-08 Thread meng
/7531290/02ad2fd4-f50c-11e4-8c04-e58d1a61ad69.png) Author: Xiangrui Meng m...@databricks.com Closes #6001 from mengxr/SPARK-7474 and squashes the following commits: b94b11d [Xiangrui Meng] update ParamGridBuilder doctest (cherry picked from commit 65afd3ce8b8a0b00f4ea8294eac14b72e964872d) Signed-off

spark git commit: [SPARK-7383] [ML] Feature Parity in PySpark for ml.features

2015-05-08 Thread meng
) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/85e11544 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/85e11544 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff

spark git commit: [SPARK-7383] [ML] Feature Parity in PySpark for ml.features

2015-05-08 Thread meng
brk...@gmail.com Authored: Fri May 8 11:14:39 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Fri May 8 11:14:39 2015 -0700 -- .../spark/ml/feature/PolynomialExpansion.scala | 2 +- .../org/apache/spark/ml

spark git commit: [SPARK-7390] [SQL] Only merge other CovarianceCounter when its count is greater than zero

2015-05-08 Thread meng
:16 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Fri May 8 14:41:16 2015 -0700 -- .../sql/execution/stat/StatFunctions.scala | 22 +++- 1 file changed, 12 insertions(+), 10 deletions

spark git commit: [SPARK-7498] [ML] removed varargs annotation from Params.setDefaults

2015-05-08 Thread meng
jkbradley/revert-varargs and squashes the following commits: 098ed39 [Joseph K. Bradley] removed varargs annotation from Params.setDefaults taking multiple ParamPairs (cherry picked from commit 29926238418223b0888d418d163feebf0217b35e) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http

spark git commit: [SPARK-7498] [ML] removed varargs annotation from Params.setDefaults

2015-05-08 Thread meng
Committer: Xiangrui Meng m...@databricks.com Committed: Fri May 8 21:55:54 2015 -0700 -- mllib/src/main/scala/org/apache/spark/ml/param/params.scala | 3 ++- mllib/src/test/java/org/apache/spark/ml/param/JavaTestParams.java

spark git commit: [SPARK-7262] [ML] Binary LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package

2015-05-08 Thread meng
May 8 21:43:05 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Fri May 8 21:43:05 2015 -0700 -- .../ml/classification/LogisticRegression.scala | 436 ++- .../spark/ml/regression

spark git commit: [SPARK-7262] [ML] Binary LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package

2015-05-08 Thread meng
Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/80bbe72d Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/80bbe72d Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/80bbe72d Branch: refs

spark git commit: [SPARK-7488] [ML] Feature Parity in PySpark for ml.recommendation

2015-05-08 Thread meng
: 84bf931f36edf1f319c9116f7f326959a6118991 Parents: 54e6fa0 Author: Burak Yavuz brk...@gmail.com Authored: Fri May 8 17:24:32 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Fri May 8 17:24:32 2015 -0700 -- .../apache/spark/ml

spark git commit: [SPARK-7429] [ML] Params cleanups

2015-05-07 Thread meng
be annotated with varargs. I thought it would not work before, but it apparently does. CrossValidator.transform should call transformSchema since the underlying Model might be a PipelineModel (cherry picked from commit 4f87e9562aa0dfe5467d7fbaba9278213106377c) Signed-off-by: Xiangrui Meng m

spark git commit: [SPARK-7429] [ML] Params cleanups

2015-05-07 Thread meng
7 01:28:44 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Thu May 7 01:28:44 2015 -0700 -- mllib/src/main/scala/org/apache/spark/ml/param/params.scala | 4 +--- .../main/scala/org/apache/spark/ml/tuning

spark git commit: [SPARK-6093] [MLLIB] Add RegressionMetrics in PySpark/MLlib

2015-05-07 Thread meng
the following commits: 6934af3 [Yanbo Liang] change to @property aac3bc5 [Yanbo Liang] Add RegressionMetrics in PySpark/MLlib (cherry picked from commit 1712a7c7057bf6dd5da8aea1d7fbecdf96ea4b32) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo

spark git commit: [SPARK-6093] [MLLIB] Add RegressionMetrics in PySpark/MLlib

2015-05-07 Thread meng
/1712a7c7 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/1712a7c7 Branch: refs/heads/master Commit: 1712a7c7057bf6dd5da8aea1d7fbecdf96ea4b32 Parents: 068c315 Author: Yanbo Liang yblia...@gmail.com Authored: Thu May 7 11:18:32 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed

spark git commit: [SPARK-6948] [MLLIB] compress vectors in VectorAssembler

2015-05-07 Thread meng
Repository: spark Updated Branches: refs/heads/master 658a478d3 - e43803b8f [SPARK-6948] [MLLIB] compress vectors in VectorAssembler The compression is based on storage. brkyvz Author: Xiangrui Meng m...@databricks.com Closes #5985 from mengxr/SPARK-6948 and squashes the following commits

spark git commit: [SPARK-7388] [SPARK-7383] wrapper for VectorAssembler in Python

2015-05-07 Thread meng
called `wCast` which is an internal function to obtain `Array[T]` from `Seq[T]` Author: Burak Yavuz brk...@gmail.com Author: Xiangrui Meng m...@databricks.com Closes #5930 from brkyvz/ml-feat and squashes the following commits: 73e745f [Burak Yavuz] Merge pull request #3 from mengxr/SPARK-7388

spark git commit: [SPARK-6940] [MLLIB] Add CrossValidator to Python ML pipeline API

2015-05-06 Thread meng
Repository: spark Updated Branches: refs/heads/master 9f019c722 - 32cdc815c [SPARK-6940] [MLLIB] Add CrossValidator to Python ML pipeline API Since CrossValidator is a meta algorithm, we copy the implementation in Python. jkbradley Author: Xiangrui Meng m...@databricks.com Closes #5926

spark git commit: [SPARK-6940] [MLLIB] Add CrossValidator to Python ML pipeline API

2015-05-06 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.4 20f923771 - 3e27a5437 [SPARK-6940] [MLLIB] Add CrossValidator to Python ML pipeline API Since CrossValidator is a meta algorithm, we copy the implementation in Python. jkbradley Author: Xiangrui Meng m...@databricks.com Closes #5926

[2/2] spark git commit: [SPARK-5995] [ML] Make Prediction dev API public

2015-05-06 Thread meng
-us.apache.org/repos/asf/spark/diff/1ad04dae Branch: refs/heads/master Commit: 1ad04dae038673a448f529c39b17817b78d6acd0 Parents: 7740996 Author: Joseph K. Bradley jos...@databricks.com Authored: Wed May 6 16:15:51 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Wed May 6 16:15:51 2015

[1/2] spark git commit: [SPARK-5995] [ML] Make Prediction dev API public

2015-05-06 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.4 14bcb84e8 - b681b9312 http://git-wip-us.apache.org/repos/asf/spark/blob/b681b931/mllib/src/main/scala/org/apache/spark/mllib/linalg/Vectors.scala -- diff --git

[2/2] spark git commit: [SPARK-5995] [ML] Make Prediction dev API public

2015-05-06 Thread meng
their prediction methods for efficient computation of multiple output columns. (cherry picked from commit 1ad04dae038673a448f529c39b17817b78d6acd0) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos

[1/2] spark git commit: [SPARK-5995] [ML] Make Prediction dev API public

2015-05-06 Thread meng
Repository: spark Updated Branches: refs/heads/master 774099670 - 1ad04dae0 http://git-wip-us.apache.org/repos/asf/spark/blob/1ad04dae/mllib/src/main/scala/org/apache/spark/mllib/linalg/Vectors.scala -- diff --git

spark git commit: [SPARK-7432] [MLLIB] disable cv doctest

2015-05-06 Thread meng
Repository: spark Updated Branches: refs/heads/master 14502d5e5 - 773aa2525 [SPARK-7432] [MLLIB] disable cv doctest Temporarily disable flaky doctest for CrossValidator. jkbradley Author: Xiangrui Meng m...@databricks.com Closes #5962 from mengxr/disable-pyspark-cv-test and squashes

spark git commit: [SPARK-6267] [MLLIB] Python API for IsotonicRegression

2015-05-05 Thread meng
Repository: spark Updated Branches: refs/heads/master ba2b56614 - 7b1457839 [SPARK-6267] [MLLIB] Python API for IsotonicRegression https://issues.apache.org/jira/browse/SPARK-6267 Author: Yanbo Liang yblia...@gmail.com Author: Xiangrui Meng m...@databricks.com Closes #5890 from yanboliang

spark git commit: [SPARK-6267] [MLLIB] Python API for IsotonicRegression

2015-05-05 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.4 8aa6681d5 - 384ac3c11 [SPARK-6267] [MLLIB] Python API for IsotonicRegression https://issues.apache.org/jira/browse/SPARK-6267 Author: Yanbo Liang yblia...@gmail.com Author: Xiangrui Meng m...@databricks.com Closes #5890 from

spark git commit: [SPARK-7202] [MLLIB] [PYSPARK] Add SparseMatrixPickler to SerDe

2015-05-05 Thread meng
-7202 and squashes the following commits: 7e689dc [MechCoder] [SPARK-7202] Add SparseMatrixPickler to SerDe (cherry picked from commit 5ab652cdb8bef10214edd079502a7f49017579aa) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit

spark git commit: Closes #5591 Closes #5878

2015-05-05 Thread meng
/9d250e64 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/9d250e64 Branch: refs/heads/master Commit: 9d250e64dac263bcbbad6b023382ac7b5b592408 Parents: 5995ada Author: Xiangrui Meng m...@databricks.com Authored: Tue May 5 08:00:31 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed

spark git commit: [SPARK-7202] [MLLIB] [PYSPARK] Add SparseMatrixPickler to SerDe

2015-05-05 Thread meng
Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/5ab652cd Branch: refs/heads/master Commit: 5ab652cdb8bef10214edd079502a7f49017579aa Parents: c6d1efb Author: MechCoder manojkumarsivaraj...@gmail.com Authored: Tue May 5 07:53:11 2015 -0700 Committer: Xiangrui Meng m...@databricks.com

spark git commit: [SPARK-6612] [MLLIB] [PYSPARK] Python KMeans parity

2015-05-05 Thread meng
Subramonian] added arguments in python tests 21eb84c [Hrishikesh Subramonian] Python Kmeans - setEpsilon, setInitializationSteps, k and computeCost added. (cherry picked from commit 5995ada96b661546a80657f2c5ed20604593e4aa) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip

<    4   5   6   7   8   9   10   11   12   13   >