spark git commit: [SPARK-6278][MLLIB] Mention the change of objective in linear regression

2015-03-13 Thread meng
Repository: spark
Updated Branches:
  refs/heads/branch-1.3 dc287f38f - 214f68103


[SPARK-6278][MLLIB] Mention the change of objective in linear regression

As discussed in the RC3 vote thread, we should mention the change of objective 
in linear regression in the migration guide. srowen

Author: Xiangrui Meng m...@databricks.com

Closes #4978 from mengxr/SPARK-6278 and squashes the following commits:

fb3bbe6 [Xiangrui Meng] mention regularization parameter
bfd6cff [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into 
SPARK-6278
375fd09 [Xiangrui Meng] address Sean's comments
f87ae71 [Xiangrui Meng] mention step size change

(cherry picked from commit 7f13434a5c52b815c584ec773ab0e5df1a35ea86)
Signed-off-by: Xiangrui Meng m...@databricks.com


Project: http://git-wip-us.apache.org/repos/asf/spark/repo
Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/214f6810
Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/214f6810
Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/214f6810

Branch: refs/heads/branch-1.3
Commit: 214f68103219317416e2278e80b8fc0fb5a616f4
Parents: dc287f3
Author: Xiangrui Meng m...@databricks.com
Authored: Fri Mar 13 10:27:28 2015 -0700
Committer: Xiangrui Meng m...@databricks.com
Committed: Fri Mar 13 10:27:34 2015 -0700

--
 docs/mllib-guide.md | 2 ++
 1 file changed, 2 insertions(+)
--


http://git-wip-us.apache.org/repos/asf/spark/blob/214f6810/docs/mllib-guide.md
--
diff --git a/docs/mllib-guide.md b/docs/mllib-guide.md
index 4c7a7d9..03b948c 100644
--- a/docs/mllib-guide.md
+++ b/docs/mllib-guide.md
@@ -107,6 +107,8 @@ In the `spark.mllib` package, there were several breaking 
changes.  The first ch
 * In `DecisionTree`, the deprecated class method `train` has been removed. 
 (The object/static `train` methods remain.)
 * In `Strategy`, the `checkpointDir` parameter has been removed.  
Checkpointing is still supported, but the checkpoint directory must be set 
before calling tree and tree ensemble training.
 * `PythonMLlibAPI` (the interface between Scala/Java and Python for MLlib) was 
a public API but is now private, declared `private[python]`.  This was never 
meant for external use.
+* In linear regression (including Lasso and ridge regression), the squared 
loss is now divided by 2.
+  So in order to produce the same result as in 1.2, the regularization 
parameter needs to be divided by 2 and the step size needs to be multiplied by 
2.
 
 ## Previous Spark Versions
 


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org



spark git commit: [SPARK-6278][MLLIB] Mention the change of objective in linear regression

2015-03-13 Thread meng
Repository: spark
Updated Branches:
  refs/heads/master dc4abd4dc - 7f13434a5


[SPARK-6278][MLLIB] Mention the change of objective in linear regression

As discussed in the RC3 vote thread, we should mention the change of objective 
in linear regression in the migration guide. srowen

Author: Xiangrui Meng m...@databricks.com

Closes #4978 from mengxr/SPARK-6278 and squashes the following commits:

fb3bbe6 [Xiangrui Meng] mention regularization parameter
bfd6cff [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into 
SPARK-6278
375fd09 [Xiangrui Meng] address Sean's comments
f87ae71 [Xiangrui Meng] mention step size change


Project: http://git-wip-us.apache.org/repos/asf/spark/repo
Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/7f13434a
Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/7f13434a
Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/7f13434a

Branch: refs/heads/master
Commit: 7f13434a5c52b815c584ec773ab0e5df1a35ea86
Parents: dc4abd4
Author: Xiangrui Meng m...@databricks.com
Authored: Fri Mar 13 10:27:28 2015 -0700
Committer: Xiangrui Meng m...@databricks.com
Committed: Fri Mar 13 10:27:28 2015 -0700

--
 docs/mllib-guide.md | 2 ++
 1 file changed, 2 insertions(+)
--


http://git-wip-us.apache.org/repos/asf/spark/blob/7f13434a/docs/mllib-guide.md
--
diff --git a/docs/mllib-guide.md b/docs/mllib-guide.md
index 598374f..f8e8794 100644
--- a/docs/mllib-guide.md
+++ b/docs/mllib-guide.md
@@ -102,6 +102,8 @@ In the `spark.mllib` package, there were several breaking 
changes.  The first ch
 * In `DecisionTree`, the deprecated class method `train` has been removed. 
 (The object/static `train` methods remain.)
 * In `Strategy`, the `checkpointDir` parameter has been removed.  
Checkpointing is still supported, but the checkpoint directory must be set 
before calling tree and tree ensemble training.
 * `PythonMLlibAPI` (the interface between Scala/Java and Python for MLlib) was 
a public API but is now private, declared `private[python]`.  This was never 
meant for external use.
+* In linear regression (including Lasso and ridge regression), the squared 
loss is now divided by 2.
+  So in order to produce the same result as in 1.2, the regularization 
parameter needs to be divided by 2 and the step size needs to be multiplied by 
2.
 
 ## Previous Spark Versions
 


-
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org