svn commit: r1696648 - in /spark: mllib/index.md site/mllib/index.html

2015-08-19 Thread meng
Author: meng Date: Wed Aug 19 19:11:08 2015 New Revision: 1696648 URL: http://svn.apache.org/r1696648 Log: update MLlib page for 1.5 Modified: spark/mllib/index.md spark/site/mllib/index.html Modified: spark/mllib/index.md URL: http://svn.apache.org/viewvc/spark/mllib/index.md?rev

spark git commit: [SPARK-8918] [MLLIB] [DOC] Add @since tags to mllib.clustering

2015-08-19 Thread meng
/72fdeb64630470f6f46cf3eed8ffbfe83a7c4659). MechCoder Closes #8256 Author: Xiangrui Meng m...@databricks.com Author: Xiaoqing Wang spark...@126.com Author: MechCoder manojkumarsivaraj...@gmail.com Closes #8288 from mengxr/SPARK-8918. Project: http://git-wip-us.apache.org/repos/asf/spark/repo

spark git commit: [SPARK-8918] [MLLIB] [DOC] Add @since tags to mllib.clustering

2015-08-19 Thread meng
/commit/72fdeb64630470f6f46cf3eed8ffbfe83a7c4659). MechCoder Closes #8256 Author: Xiangrui Meng m...@databricks.com Author: Xiaoqing Wang spark...@126.com Author: MechCoder manojkumarsivaraj...@gmail.com Closes #8288 from mengxr/SPARK-8918. (cherry picked from commit

spark git commit: [SPARK-9895] User Guide for RFormula Feature Transformer

2015-08-19 Thread meng
...@databricks.com Authored: Wed Aug 19 15:43:08 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Wed Aug 19 15:43:08 2015 -0700 -- docs/ml-features.md | 108 +++ .../org

spark git commit: [SPARK-9895] User Guide for RFormula Feature Transformer

2015-08-19 Thread meng
-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/56a37b01 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/56a37b01 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff

spark git commit: [SPARK-10076] [ML] make MultilayerPerceptronClassifier layers and weights public

2015-08-18 Thread meng
17 23:57:02 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Mon Aug 17 23:57:02 2015 -0700 -- .../spark/ml/classification/MultilayerPerceptronClassifier.scala | 4 ++-- 1 file changed, 2 insertions(+), 2

spark git commit: [SPARK-10076] [ML] make MultilayerPerceptronClassifier layers and weights public

2015-08-18 Thread meng
```. Users can not get ```layers``` and ```weights``` from a ```MultilayerPerceptronClassificationModel``` currently. Author: Yanbo Liang yblia...@gmail.com Closes #8263 from yanboliang/mlp-public. (cherry picked from commit dd0614fd618ad28cb77aecfbd49bb319b98fdba0) Signed-off-by: Xiangrui Meng m

spark git commit: [SPARK-9900] [MLLIB] User guide for Association Rules

2015-08-18 Thread meng
picked from commit f5ea3912900ccdf23e2eb419a342bfe3c0c0b61b) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/7ff0e5d2 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree

spark git commit: [SPARK-10085] [MLLIB] [DOCS] removed unnecessary numpy array import

2015-08-18 Thread meng
picked from commit 8bae9015b7e7b4528ca2bc5180771cb95d2aac13) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/9bd2e6f7 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree

spark git commit: [SPARK-10085] [MLLIB] [DOCS] removed unnecessary numpy array import

2015-08-18 Thread meng
: 8bae9015b7e7b4528ca2bc5180771cb95d2aac13 Parents: 747c2ba Author: Piotr Migdal pmig...@gmail.com Authored: Tue Aug 18 12:59:28 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Tue Aug 18 12:59:28 2015 -0700 -- docs/mllib-linear

spark git commit: [SPARK-10029] [MLLIB] [DOC] Add Python examples for mllib IsotonicRegression user guide

2015-08-18 Thread meng
/master Commit: f4fa61effe34dae2f0eab0bef57b2dee220cf92f Parents: f5ea391 Author: Yanbo Liang yblia...@gmail.com Authored: Tue Aug 18 12:55:36 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Tue Aug 18 12:55:36 2015 -0700

spark git commit: [SPARK-9900] [MLLIB] User guide for Association Rules

2015-08-18 Thread meng
: f5ea3912900ccdf23e2eb419a342bfe3c0c0b61b Parents: c1840a8 Author: Feynman Liang fli...@databricks.com Authored: Tue Aug 18 12:53:57 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Tue Aug 18 12:53:57 2015 -0700 -- docs/mllib

spark git commit: [SPARK-8924] [MLLIB, DOCUMENTATION] Added @since tags to mllib.tree

2015-08-18 Thread meng
from commit 1dbffba37a84c62202befd3911d25888f958191d) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/56f4da26 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/56f4da26

spark git commit: [SPARK-8924] [MLLIB, DOCUMENTATION] Added @since tags to mllib.tree

2015-08-18 Thread meng
: 1dbffba37a84c62202befd3911d25888f958191d Parents: 492ac1f Author: Bryan Cutler bjcut...@us.ibm.com Authored: Tue Aug 18 14:58:30 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Tue Aug 18 14:58:30 2015 -0700 -- .../apache/spark/mllib

spark git commit: [SPARK-10032] [PYSPARK] [DOC] Add Python example for mllib LDAModel user guide

2015-08-18 Thread meng
: 747c2ba8006d5b86f3be8dfa9ace639042a35628 Parents: f4fa61e Author: Yanbo Liang yblia...@gmail.com Authored: Tue Aug 18 12:56:36 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Tue Aug 18 12:56:36 2015 -0700 -- docs/mllib

spark git commit: [SPARK-10032] [PYSPARK] [DOC] Add Python example for mllib LDAModel user guide

2015-08-18 Thread meng
. (cherry picked from commit 747c2ba8006d5b86f3be8dfa9ace639042a35628) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/ec7079f9 Tree: http://git-wip-us.apache.org/repos/asf/spark

spark git commit: [SPARK-10029] [MLLIB] [DOC] Add Python examples for mllib IsotonicRegression user guide

2015-08-18 Thread meng
yanboliang/spark-10029. (cherry picked from commit f4fa61effe34dae2f0eab0bef57b2dee220cf92f) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/80debff1 Tree: http://git-wip

spark git commit: [SPARK-8920] [MLLIB] Add @since tags to mllib.linalg

2015-08-17 Thread meng
: 088b11ec5949e135cb3db2a1ce136837e046c288 Parents: fdaf17f Author: Sameer Abhyankar sabhyankar@sabhyankar-MBP.Samavihome Authored: Mon Aug 17 16:00:23 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Mon Aug 17 16:00:23 2015 -0700

spark git commit: [SPARK-7707] User guide and example code for KernelDensity

2015-08-17 Thread meng
) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/5de0ffbd Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/5de0ffbd Diff: http://git-wip-us.apache.org/repos/asf/spark/diff

spark git commit: [SPARK-7808] [ML] add package doc for ml.feature

2015-08-17 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.5 bfb4c8425 - 35542504c [SPARK-7808] [ML] add package doc for ml.feature This PR adds a short description of `ml.feature` package with code example. The Java package doc will come in a separate PR. jkbradley Author: Xiangrui Meng m

spark git commit: [SPARK-9902] [MLLIB] Add Java and Python examples to user guide for 1-sample KS test

2015-08-17 Thread meng
. (cherry picked from commit c90c605dc6a876aef3cc204ac15cd65bab9743ad) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/9740d43d Tree: http://git-wip-us.apache.org/repos/asf

spark git commit: [SPARK-9902] [MLLIB] Add Java and Python examples to user guide for 1-sample KS test

2015-08-17 Thread meng
: c90c605dc6a876aef3cc204ac15cd65bab9743ad Parents: f9d1a92 Author: jose.cambronero jose.cambron...@cloudera.com Authored: Mon Aug 17 19:09:45 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Mon Aug 17 19:09:45 2015 -0700

spark git commit: [SPARK-10068] [MLLIB] Adds links to MLlib types, algos, utilities listing

2015-08-17 Thread meng
Parents: 772e7c1 Author: Feynman Liang fli...@databricks.com Authored: Mon Aug 17 15:42:14 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Mon Aug 17 15:42:14 2015 -0700 -- docs/mllib-guide.md | 26

spark git commit: [SPARK-10068] [MLLIB] Adds links to MLlib types, algos, utilities listing

2015-08-17 Thread meng
fdaf17f63f751f02623414fbc7d0a2f545364050) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/bb3bb2a4 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/bb3bb2a4 Diff: http://git

spark git commit: [SPARK-7707] User guide and example code for KernelDensity

2015-08-17 Thread meng
...@cloudera.com Authored: Mon Aug 17 17:57:51 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Mon Aug 17 17:57:51 2015 -0700 -- docs/mllib-statistics.md | 77 +++ 1 file

spark git commit: [SPARK-7707] User guide and example code for KernelDensity

2015-08-17 Thread meng
) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/f7f2ac69 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/f7f2ac69 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff

spark git commit: [SPARK-7808] [ML] add package doc for ml.feature

2015-08-17 Thread meng
Repository: spark Updated Branches: refs/heads/master ee093c8b9 - e290029a3 [SPARK-7808] [ML] add package doc for ml.feature This PR adds a short description of `ml.feature` package with code example. The Java package doc will come in a separate PR. jkbradley Author: Xiangrui Meng m

spark git commit: [SPARK-8920] [MLLIB] Add @since tags to mllib.linalg

2015-08-17 Thread meng
. (cherry picked from commit 088b11ec5949e135cb3db2a1ce136837e046c288) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/0f1417b6 Tree: http://git-wip-us.apache.org/repos/asf

spark git commit: [SPARK-9898] [MLLIB] Prefix Span user guide

2015-08-17 Thread meng
/SPARK-9898. (cherry picked from commit 0b6b01761370629ce387c143a25d41f3a334ff28) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/18b3d11f Tree: http://git-wip-us.apache.org

spark git commit: [SPARK-9898] [MLLIB] Prefix Span user guide

2015-08-17 Thread meng
Commit: 0b6b01761370629ce387c143a25d41f3a334ff28 Parents: 18523c1 Author: Feynman Liang fli...@databricks.com Authored: Mon Aug 17 17:53:24 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Mon Aug 17 17:53:24 2015 -0700

spark git commit: [SPARK-9959] [MLLIB] Association Rules Java Compatibility

2015-08-17 Thread meng
: 3ff81ad Author: Feynman Liang fli...@databricks.com Authored: Mon Aug 17 09:58:34 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Mon Aug 17 09:58:34 2015 -0700 -- .../spark/mllib/fpm/AssociationRules.scala | 30

spark git commit: [SPARK-9828] [PYSPARK] Mutable values should not be default arguments

2015-08-14 Thread meng
Author: MechCoder manojkumarsivaraj...@gmail.com Authored: Fri Aug 14 12:46:05 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Fri Aug 14 12:46:05 2015 -0700 -- python/pyspark/ml/evaluation.py | 4 +++- python

spark git commit: [SPARK-9828] [PYSPARK] Mutable values should not be default arguments

2015-08-14 Thread meng
ffa05c84fe75663fc33f3d954d1cb1e084ab3280) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/969e8b31 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/969e8b31 Diff: http://git-wip-us.apache.org

spark git commit: [SPARK-9981] [ML] Made labels public for StringIndexerModel

2015-08-14 Thread meng
2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Fri Aug 14 14:05:03 2015 -0700 -- .../apache/spark/ml/feature/StringIndexer.scala | 5 - .../spark/ml/feature/StringIndexerSuite.scala | 18

spark git commit: [SPARK-9981] [ML] Made labels public for StringIndexerModel

2015-08-14 Thread meng
(to catch the issue with removing the inverse() method), so this adds it back. mengxr Author: Joseph K. Bradley jos...@databricks.com Closes #8211 from jkbradley/stridx-labels. (cherry picked from commit 2a6590e510aba3bfc6603d280023128b3f5ac702) Signed-off-by: Xiangrui Meng m...@databricks.com

spark git commit: [SPARK-9918] [MLLIB] remove runs from k-means and rename epsilon to tol

2015-08-13 Thread meng
algorithms. The Python constructor is updated to include all parameters. jkbradley yu-iskw Author: Xiangrui Meng m...@databricks.com Closes #8148 from mengxr/SPARK-9918 and squashes the following commits: 149b9e5 [Xiangrui Meng] fix constructor in Python and rename epsilon to tol 3cc15b3 [Xiangrui

spark git commit: [SPARK-9918] [MLLIB] remove runs from k-means and rename epsilon to tol

2015-08-13 Thread meng
algorithms. The Python constructor is updated to include all parameters. jkbradley yu-iskw Author: Xiangrui Meng m...@databricks.com Closes #8148 from mengxr/SPARK-9918 and squashes the following commits: 149b9e5 [Xiangrui Meng] fix constructor in Python and rename epsilon to tol 3cc15b3 [Xiangrui Meng

spark git commit: [MINOR] [ML] change MultilayerPerceptronClassifierModel to MultilayerPerceptronClassificationModel

2015-08-13 Thread meng
` to `MultilayerPerceptronClassificationModel` like `DecisionTreeClassificationModel`, `GBTClassificationModel` and so on. Author: Yanbo Liang yblia...@gmail.com Closes #8164 from yanboliang/mlp-name. (cherry picked from commit 4b70798c96b0a784b85fda461426ec60f609be12) Signed-off-by: Xiangrui Meng m...@databricks.com Project

spark git commit: [MINOR] [DOC] fix mllib pydoc warnings

2015-08-13 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.5 2b1353249 - 883c7d35f [MINOR] [DOC] fix mllib pydoc warnings Switch to correct Sphinx syntax. MechCoder Author: Xiangrui Meng m...@databricks.com Closes #8169 from mengxr/mllib-pydoc-fix. (cherry picked from commit

spark git commit: [MINOR] [ML] change MultilayerPerceptronClassifierModel to MultilayerPerceptronClassificationModel

2015-08-13 Thread meng
Committer: Xiangrui Meng m...@databricks.com Committed: Thu Aug 13 09:31:14 2015 -0700 -- .../MultilayerPerceptronClassifier.scala| 16 1 file changed, 8 insertions(+), 8 deletions

spark git commit: [MINOR] [DOC] fix mllib pydoc warnings

2015-08-13 Thread meng
Repository: spark Updated Branches: refs/heads/master 4b70798c9 - 65fec798c [MINOR] [DOC] fix mllib pydoc warnings Switch to correct Sphinx syntax. MechCoder Author: Xiangrui Meng m...@databricks.com Closes #8169 from mengxr/mllib-pydoc-fix. Project: http://git-wip-us.apache.org/repos/asf

spark git commit: [SPARK-9922] [ML] rename StringIndexerReverse to IndexToString

2015-08-13 Thread meng
to `IndexToString` might be better. ~~I also changed `invert` to `inverse` without arguments. `inputCol` and `outputCol` could be set after.~~ I also removed `invert`. jkbradley holdenk Author: Xiangrui Meng m...@databricks.com Closes #8152 from mengxr/SPARK-9922. Project: http://git-wip

spark git commit: [SPARK-9922] [ML] rename StringIndexerReverse to IndexToString

2015-08-13 Thread meng
. Renaming to `IndexToString` might be better. ~~I also changed `invert` to `inverse` without arguments. `inputCol` and `outputCol` could be set after.~~ I also removed `invert`. jkbradley holdenk Author: Xiangrui Meng m...@databricks.com Closes #8152 from mengxr/SPARK-9922. (cherry picked from commit

spark git commit: [SPARK-9909] [ML] [TRIVIAL] move weightCol to shared params

2015-08-12 Thread meng
: 6e409bc1357f49de2efdfc4226d074b943fb1153 Parents: caa14d9 Author: Holden Karau hol...@pigscanfly.ca Authored: Wed Aug 12 16:54:45 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Wed Aug 12 16:54:45 2015 -0700

spark git commit: [SPARK-9909] [ML] [TRIVIAL] move weightCol to shared params

2015-08-12 Thread meng
-toSharedParams. (cherry picked from commit 6e409bc1357f49de2efdfc4226d074b943fb1153) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/2f8793b5 Tree: http://git-wip-us.apache.org/repos

spark git commit: [SPARK-9913] [MLLIB] LDAUtils should be private

2015-08-12 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.5 08f767a1e - 6aca0cf34 [SPARK-9913] [MLLIB] LDAUtils should be private feynmanliang Author: Xiangrui Meng m...@databricks.com Closes #8142 from mengxr/SPARK-9913. (cherry picked from commit caa14d9dc9e2eb1102052b22445b63b0e004e3c7

spark git commit: [SPARK-9913] [MLLIB] LDAUtils should be private

2015-08-12 Thread meng
Repository: spark Updated Branches: refs/heads/master 7035d880a - caa14d9dc [SPARK-9913] [MLLIB] LDAUtils should be private feynmanliang Author: Xiangrui Meng m...@databricks.com Closes #8142 from mengxr/SPARK-9913. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http

spark git commit: [SPARK-9915] [ML] stopWords should use StringArrayParam

2015-08-12 Thread meng
Repository: spark Updated Branches: refs/heads/master e6aef5576 - fc1c7fd66 [SPARK-9915] [ML] stopWords should use StringArrayParam hhbyyh Author: Xiangrui Meng m...@databricks.com Closes #8141 from mengxr/SPARK-9915. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http

spark git commit: [SPARK-8967] [DOC] add Since annotation

2015-08-12 Thread meng
/9230761/fa72865c-40d8-11e5-807e-0f3c815c5acd.png) `Since` annotation: ![screen shot 2015-08-11 at 10 00 28 pm](https://cloud.githubusercontent.com/assets/829644/9230764/0041d7f4-40d9-11e5-8124-c3f3e5d5b31f.png) rxin Author: Xiangrui Meng m...@databricks.com Closes #8131 from mengxr/SPARK-8967

spark git commit: [SPARK-9912] [MLLIB] QRDecomposition should use QType and RType for type names instead of UType and VType

2015-08-12 Thread meng
Repository: spark Updated Branches: refs/heads/master 6e409bc13 - e6aef5576 [SPARK-9912] [MLLIB] QRDecomposition should use QType and RType for type names instead of UType and VType hhbyyh Author: Xiangrui Meng m...@databricks.com Closes #8140 from mengxr/SPARK-9912. Project: http://git

spark git commit: [SPARK-9912] [MLLIB] QRDecomposition should use QType and RType for type names instead of UType and VType

2015-08-12 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.5 2f8793b5f - 31b7fdc06 [SPARK-9912] [MLLIB] QRDecomposition should use QType and RType for type names instead of UType and VType hhbyyh Author: Xiangrui Meng m...@databricks.com Closes #8140 from mengxr/SPARK-9912. (cherry picked

spark git commit: [SPARK-9915] [ML] stopWords should use StringArrayParam

2015-08-12 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.5 31b7fdc06 - ed73f5439 [SPARK-9915] [ML] stopWords should use StringArrayParam hhbyyh Author: Xiangrui Meng m...@databricks.com Closes #8141 from mengxr/SPARK-9915. (cherry picked from commit fc1c7fd66e64ccea53b31cd2fbb98bc6d307329c

spark git commit: [SPARK-9903] [MLLIB] skip local processing in PrefixSpan if there are no small prefixes

2015-08-12 Thread meng
becomes unnecessary. feynmanliang Author: Xiangrui Meng m...@databricks.com Closes #8136 from mengxr/SPARK-9903. (cherry picked from commit d7053bea985679c514b3add029631ea23e1730ce) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit

spark git commit: [SPARK-9917] [ML] add getMin/getMax and doc for originalMin/origianlMax in MinMaxScaler

2015-08-12 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.5 8229437c3 - 16f4bf4ca [SPARK-9917] [ML] add getMin/getMax and doc for originalMin/origianlMax in MinMaxScaler hhbyyh Author: Xiangrui Meng m...@databricks.com Closes #8145 from mengxr/SPARK-9917. (cherry picked from commit

spark git commit: [SPARK-9917] [ML] add getMin/getMax and doc for originalMin/origianlMax in MinMaxScaler

2015-08-12 Thread meng
Repository: spark Updated Branches: refs/heads/master a8ab2634c - 5fc058a1f [SPARK-9917] [ML] add getMin/getMax and doc for originalMin/origianlMax in MinMaxScaler hhbyyh Author: Xiangrui Meng m...@databricks.com Closes #8145 from mengxr/SPARK-9917. Project: http://git-wip-us.apache.org

spark git commit: [SPARK-8922] [DOCUMENTATION, MLLIB] Add @since tags to mllib.evaluation

2015-08-12 Thread meng
: shikai.tang tar.sk...@gmail.com Authored: Wed Aug 12 21:53:15 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Wed Aug 12 21:53:15 2015 -0700 -- .../BinaryClassificationMetrics.scala | 32

spark git commit: [SPARK-8922] [DOCUMENTATION, MLLIB] Add @since tags to mllib.evaluation

2015-08-12 Thread meng
df543892122342b97e5137b266959ba97589b3ef) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/69028403 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/69028403 Diff: http://git-wip-us.apache.org/repos/asf/spark

spark git commit: [SPARK-9903] [MLLIB] skip local processing in PrefixSpan if there are no small prefixes

2015-08-12 Thread meng
becomes unnecessary. feynmanliang Author: Xiangrui Meng m...@databricks.com Closes #8136 from mengxr/SPARK-9903. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/d7053bea Tree: http://git-wip-us.apache.org/repos/asf/spark/tree

spark git commit: [SPARK-9914] [ML] define setters explicitly for Java and use setParam group in RFormula

2015-08-12 Thread meng
: Xiangrui Meng m...@databricks.com Closes #8143 from mengxr/SPARK-9914 and squashes the following commits: d36c887 [Xiangrui Meng] remove setters from model a49021b [Xiangrui Meng] define setters explicitly for Java and use setParam group Project: http://git-wip-us.apache.org/repos/asf/spark/repo

spark git commit: [SPARK-7583] [MLLIB] User guide update for RegexTokenizer

2015-08-12 Thread meng
/regexTokenizerDoc. (cherry picked from commit 66d87c1d76bea2b81993156ac1fa7dad6c312ebf) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/2d86fadd Tree: http://git-wip

spark git commit: [SPARK-7583] [MLLIB] User guide update for RegexTokenizer

2015-08-12 Thread meng
/master Commit: 66d87c1d76bea2b81993156ac1fa7dad6c312ebf Parents: be5d191 Author: Yuhao Yang hhb...@gmail.com Authored: Wed Aug 12 09:35:32 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Wed Aug 12 09:35:32 2015 -0700

spark git commit: [SPARK-9847] [ML] Modified copyValues to distinguish between default, explicit param values

2015-08-12 Thread meng
...@databricks.com Authored: Wed Aug 12 10:48:52 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Wed Aug 12 10:48:52 2015 -0700 -- .../scala/org/apache/spark/ml/param/params.scala | 19 --- .../org

spark git commit: Closes #1290 Closes #4934

2015-08-11 Thread meng
/423cdfd8 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/423cdfd8 Branch: refs/heads/master Commit: 423cdfd83d7fd02a4f8cf3e714db913fd3f9ca09 Parents: f16bc68 Author: Xiangrui Meng m...@databricks.com Authored: Tue Aug 11 14:08:09 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed

spark git commit: [SPARK-8925] [MLLIB] Add @since tags to mllib.util

2015-08-11 Thread meng
: Sudhakar Thota sudhakarth...@sudhakars-mbp-2.usca.ibm.com Closes #7436 from sthota2014/SPARK-8925_thotas. (cherry picked from commit 017b5de07ef6cff249e984a2ab781c520249ac76) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit

spark git commit: [SPARK-8925] [MLLIB] Add @since tags to mllib.util

2015-08-11 Thread meng
Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/017b5de0 Branch: refs/heads/master Commit: 017b5de07ef6cff249e984a2ab781c520249ac76 Parents: be3e271 Author: Sudhakar Thota sudhakarth...@yahoo.com Authored: Tue Aug 11 14:31:51 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed

spark git commit: [SPARK-8345] [ML] Add an SQL node as a feature transformer

2015-08-11 Thread meng
: refs/heads/master Commit: 8cad854ef6a2066de5adffcca6b79a205ccfd5f3 Parents: bce7279 Author: Yanbo Liang yblia...@gmail.com Authored: Tue Aug 11 11:01:59 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Tue Aug 11 11:01:59 2015 -0700

spark git commit: [SPARK-9713] [ML] Document SparkR MLlib glm() integration in Spark 1.5

2015-08-11 Thread meng
...@databricks.com Closes #8085 from ericl/docs. (cherry picked from commit 74a293f4537c6982345166f8883538f81d850872) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/890c75bc Tree: http

spark git commit: [SPARK-9713] [ML] Document SparkR MLlib glm() integration in Spark 1.5

2015-08-11 Thread meng
Branch: refs/heads/master Commit: 74a293f4537c6982345166f8883538f81d850872 Parents: 3ef0f32 Author: Eric Liang e...@databricks.com Authored: Tue Aug 11 21:26:03 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Tue Aug 11 21:26:03 2015 -0700

spark git commit: [SPARK-9615] [SPARK-9616] [SQL] [MLLIB] Bugs related to FrequentItems when merging and with Tungsten

2015-08-06 Thread meng
] address 1 506753e [Burak Yavuz] fixed and added reg test 47bfd50 [Burak Yavuz] pushing (cherry picked from commit 98e69467d4fda2c26a951409b5b7c6f1e9345ce4) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org

spark git commit: [SPARK-9615] [SPARK-9616] [SQL] [MLLIB] Bugs related to FrequentItems when merging and with Tungsten

2015-08-06 Thread meng
-us.apache.org/repos/asf/spark/diff/98e69467 Branch: refs/heads/master Commit: 98e69467d4fda2c26a951409b5b7c6f1e9345ce4 Parents: 076ec05 Author: Burak Yavuz brk...@gmail.com Authored: Thu Aug 6 10:29:40 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Thu Aug 6 10:29:40 2015 -0700

spark git commit: [SPARK-6486] [MLLIB] [PYTHON] Add BlockMatrix to PySpark.

2015-08-05 Thread meng
34dcf10104460816382908b2b8eeb6c925e862bf) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/eedb996d Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/eedb996d Diff: http://git-wip-us.apache.org/repos/asf

spark git commit: [SPARK-6486] [MLLIB] [PYTHON] Add BlockMatrix to PySpark.

2015-08-05 Thread meng
: Mike Dusenberry mwdus...@us.ibm.com Authored: Wed Aug 5 07:40:50 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Wed Aug 5 07:40:50 2015 -0700 -- docs/mllib-data-types.md| 41

spark git commit: [SPARK-5895] [ML] Add VectorSlicer - updated

2015-08-05 Thread meng
: a018b85716fd510ae95a3c66d676bbdb90f8d4e7 Parents: 9c87892 Author: Xusen Yin yinxu...@gmail.com Authored: Wed Aug 5 17:07:55 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Wed Aug 5 17:07:55 2015 -0700 -- .../apache/spark/ml/feature

spark git commit: [SPARK-9657] Fix return type of getMaxPatternLength

2015-08-05 Thread meng
: refs/heads/master Commit: dac090d1e9be7dec6c5ebdb2a81105b87e853193 Parents: f9c2a2a Author: Feynman Liang fli...@databricks.com Authored: Wed Aug 5 15:42:18 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Wed Aug 5 15:42:18 2015 -0700

spark git commit: [SPARK-9657] Fix return type of getMaxPatternLength

2015-08-05 Thread meng
return type of getMaxPatternLength (cherry picked from commit dac090d1e9be7dec6c5ebdb2a81105b87e853193) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/30e9fcfb Tree: http://git

spark git commit: [SPARK-5895] [ML] Add VectorSlicer - updated

2015-08-05 Thread meng
from commit a018b85716fd510ae95a3c66d676bbdb90f8d4e7) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/3b617e87 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/3b617e87

spark git commit: [SPARK-9540] [MLLIB] optimize PrefixSpan implementation

2015-08-04 Thread meng
primitive arrays. 7. Expose `maxLocalProjDBSize`. 8. Tests are not changed except using `0` instead of `-1` as the delimiter. `Postfix`'s API doc should be a good place to start. Closes #7594 feynmanliang zhangjiajin Author: Xiangrui Meng m...@databricks.com Closes #7937 from mengxr/SPARK-9540

spark git commit: [SPARK-6485] [MLLIB] [PYTHON] Add CoordinateMatrix/RowMatrix/IndexedRowMatrix to PySpark.

2015-08-04 Thread meng
/asf/spark/tree/571d5b53 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/571d5b53 Branch: refs/heads/master Commit: 571d5b5363ff4dbbce1f7019ab8e86cbc3cba4d5 Parents: 1833d9c Author: Mike Dusenberry mwdus...@us.ibm.com Authored: Tue Aug 4 16:30:03 2015 -0700 Committer: Xiangrui Meng m

spark git commit: [SPARK-6485] [MLLIB] [PYTHON] Add CoordinateMatrix/RowMatrix/IndexedRowMatrix to PySpark.

2015-08-04 Thread meng
to interface with the factory method. Added DistributedMatrix, DistributedMatrices, and RowMatrix classes to the pyspark.mllib.linalg api. (cherry picked from commit 571d5b5363ff4dbbce1f7019ab8e86cbc3cba4d5) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos

spark git commit: [SPARK-9586] [ML] Update BinaryClassificationEvaluator to use setRawPredictionCol

2015-08-04 Thread meng
(cherry picked from commit b77d3b9688d56d33737909375d1d0db07da5827b) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/cff0fe29 Tree: http://git-wip-us.apache.org/repos/asf/spark

spark git commit: [SPARK-9586] [ML] Update BinaryClassificationEvaluator to use setRawPredictionCol

2015-08-04 Thread meng
: b77d3b9688d56d33737909375d1d0db07da5827b Parents: 571d5b5 Author: Joseph K. Bradley jos...@databricks.com Authored: Tue Aug 4 16:52:43 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Tue Aug 4 16:52:43 2015 -0700

spark git commit: [SPARK-9544] [MLLIB] add Python API for RFormula

2015-08-03 Thread meng
: Xiangrui Meng m...@databricks.com Closes #7879 from mengxr/SPARK-9544 and squashes the following commits: 3d5ff03 [Xiangrui Meng] add an doctest for . and - 5e969a5 [Xiangrui Meng] fix pydoc 1cd41f8 [Xiangrui Meng] organize imports 3c18b10 [Xiangrui Meng] add Python API for RFormula (cherry picked

spark git commit: [SPARK-9544] [MLLIB] add Python API for RFormula

2015-08-03 Thread meng
: Xiangrui Meng m...@databricks.com Closes #7879 from mengxr/SPARK-9544 and squashes the following commits: 3d5ff03 [Xiangrui Meng] add an doctest for . and - 5e969a5 [Xiangrui Meng] fix pydoc 1cd41f8 [Xiangrui Meng] organize imports 3c18b10 [Xiangrui Meng] add Python API for RFormula Project: http

spark git commit: [SPARK-9000] [MLLIB] Support generic item types in PrefixSpan

2015-08-02 Thread meng
/asf/spark/tree/28d944e8 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/28d944e8 Branch: refs/heads/master Commit: 28d944e86d066eb4c651dd803f0b022605ed644e Parents: 57084e0 Author: Feynman Liang fli...@databricks.com Authored: Sat Aug 1 23:11:25 2015 -0700 Committer: Xiangrui Meng m

spark git commit: [SPARK-9527] [MLLIB] add PrefixSpanModel and make PrefixSpan Java friendly

2015-08-02 Thread meng
-friendly method `javaSequence` 3. Overload `run` for Java users. 4. Added a unit test in Java to check Java compatibility. zhangjiajin feynmanliang Author: Xiangrui Meng m...@databricks.com Closes #7869 from mengxr/SPARK-9527 and squashes the following commits: 4345594 [Xiangrui Meng] add

spark git commit: [SPARK-8999] [MLLIB] PrefixSpan non-temporal sequences

2015-08-01 Thread meng
Parents: 6503897 Author: zhangjiajin zhangjia...@huawei.com Authored: Sat Aug 1 01:56:27 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Sat Aug 1 01:56:27 2015 -0700 -- .../spark/mllib/fpm/LocalPrefixSpan.scala

spark git commit: [SPARK-8169] [ML] Add StopWordsRemover as a transformer

2015-08-01 Thread meng
Parents: d2a9b66 Author: Yuhao Yang hhb...@gmail.com Authored: Sat Aug 1 02:31:28 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Sat Aug 1 02:31:28 2015 -0700 -- .../spark/ml/feature/StopWordsRemover.scala

spark git commit: [SPARK-9490] [DOCS] [MLLIB] MLlib evaluation metrics guide example python code uses deprecated print statement

2015-07-31 Thread meng
: 873ab0f9692d8ea6220abdb8d9200041068372a8 Parents: 815c824 Author: Sean Owen so...@cloudera.com Authored: Fri Jul 31 13:45:28 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Fri Jul 31 13:45:28 2015 -0700 -- docs/ml-guide.md

spark git commit: [SPARK-8998] [MLLIB] Distribute PrefixSpan computation for large projected databases

2015-07-30 Thread meng
/repos/asf/spark/diff/d212a314 Branch: refs/heads/master Commit: d212a314227dec26c0dbec8ed3422d0ec8f818f9 Parents: c581593 Author: zhangjiajin zhangjia...@huawei.com Authored: Thu Jul 30 08:14:09 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Thu Jul 30 08:14:09 2015 -0700

spark git commit: [SPARK-7368] [MLLIB] Add QR decomposition for RowMatrix

2015-07-30 Thread meng
hhb...@gmail.com Authored: Thu Jul 30 07:49:10 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Thu Jul 30 07:49:10 2015 -0700 -- .../linalg/SingularValueDecomposition.scala | 8 .../mllib/linalg

spark git commit: [SPARK-] [MLLIB] minor fix on tokenizer doc

2015-07-30 Thread meng
: Xiangrui Meng m...@databricks.com Committed: Thu Jul 30 08:20:52 2015 -0700 -- mllib/src/main/scala/org/apache/spark/ml/feature/Tokenizer.scala | 2 +- 1 file changed, 1 insertion(+), 1 deletion

spark git commit: [SPARK-] [MLLIB] minor fix on tokenizer doc

2015-07-30 Thread meng
. Author: Yuhao Yang hhb...@gmail.com Closes #7791 from hhbyyh/docFix and squashes the following commits: cdf2542 [Yuhao Yang] minor fix on tokenizer doc (cherry picked from commit 9c0501c5d04d83ca25ce433138bf64df6a14dc58) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git

spark git commit: [SPARK-5561] [MLLIB] Generalized PeriodicCheckpointer for RDDs and Graphs

2015-07-30 Thread meng
Commit: c5815930be46a89469440b7c61b59764fb67a54c Parents: d31c618 Author: Joseph K. Bradley jos...@databricks.com Authored: Thu Jul 30 07:56:15 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Thu Jul 30 07:56:15 2015 -0700

spark git commit: [SPARK-8671] [ML] Added isotonic regression to the pipeline API.

2015-07-30 Thread meng
: martinzapletal zapletal-mar...@email.cz Authored: Thu Jul 30 15:57:14 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Thu Jul 30 15:57:14 2015 -0700 -- .../ml/regression/IsotonicRegression.scala | 144

spark git commit: [SPARK-9463] [ML] Expose model coefficients with names in SparkR RFormula

2015-07-30 Thread meng
: Eric Liang e...@databricks.com Authored: Thu Jul 30 16:15:43 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Thu Jul 30 16:15:43 2015 -0700 -- R/pkg/NAMESPACE | 3 ++- R/pkg/R

spark git commit: [SPARK-9225] [MLLIB] LDASuite needs unit tests for empty documents

2015-07-30 Thread meng
/asf/spark/diff/a6e53a9c Branch: refs/heads/master Commit: a6e53a9c8b24326d1b6dca7a0e36ce6c643daa77 Parents: 9c0501c Author: Meihua Wu meihu...@umich.edu Authored: Thu Jul 30 08:52:01 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Thu Jul 30 08:52:01 2015 -0700

spark git commit: [SPARK-9277] [MLLIB] SparseVector constructor must throw an error when declared number of elements less than array length

2015-07-30 Thread meng
Authored: Thu Jul 30 09:19:55 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Thu Jul 30 09:19:55 2015 -0700 -- .../org/apache/spark/mllib/linalg/Vectors.scala | 2 ++ .../org/apache/spark/mllib/linalg

spark git commit: [MINOR] [MLLIB] fix doc for RegexTokenizer

2015-07-30 Thread meng
Repository: spark Updated Branches: refs/heads/master ed3cb1d21 - 81464f2a8 [MINOR] [MLLIB] fix doc for RegexTokenizer This is #7791 for Python. hhbyyh Author: Xiangrui Meng m...@databricks.com Closes #7798 from mengxr/regex-tok-py and squashes the following commits: baa2dcd [Xiangrui Meng

spark git commit: [MINOR] [MLLIB] fix doc for RegexTokenizer

2015-07-30 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.4 020dd30e5 - 6e85064f4 [MINOR] [MLLIB] fix doc for RegexTokenizer This is #7791 for Python. hhbyyh Author: Xiangrui Meng m...@databricks.com Closes #7798 from mengxr/regex-tok-py and squashes the following commits: baa2dcd [Xiangrui

spark git commit: [SPARK-9408] [PYSPARK] [MLLIB] Refactor linalg.py to /linalg

2015-07-30 Thread meng
. If this passes Jenkins, we can merge this one first and then try to move `__init__.py` to `local.py` in a separate PR. Closes #7731 Author: Xiangrui Meng m...@databricks.com Closes #7746 from mengxr/SPARK-9408 and squashes the following commits: 0e05a3b [Xiangrui Meng] merge master 1135551

<    2   3   4   5   6   7   8   9   10   11   >