spark git commit: [SPARK-6278][MLLIB] Mention the change of objective in linear regression
Repository: spark Updated Branches: refs/heads/branch-1.3 dc287f38f - 214f68103 [SPARK-6278][MLLIB] Mention the change of objective in linear regression As discussed in the RC3 vote thread, we should mention the change of objective in linear regression in the migration guide. srowen Author: Xiangrui Meng m...@databricks.com Closes #4978 from mengxr/SPARK-6278 and squashes the following commits: fb3bbe6 [Xiangrui Meng] mention regularization parameter bfd6cff [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into SPARK-6278 375fd09 [Xiangrui Meng] address Sean's comments f87ae71 [Xiangrui Meng] mention step size change (cherry picked from commit 7f13434a5c52b815c584ec773ab0e5df1a35ea86) Signed-off-by: Xiangrui Meng m...@databricks.com Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/214f6810 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/214f6810 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/214f6810 Branch: refs/heads/branch-1.3 Commit: 214f68103219317416e2278e80b8fc0fb5a616f4 Parents: dc287f3 Author: Xiangrui Meng m...@databricks.com Authored: Fri Mar 13 10:27:28 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Fri Mar 13 10:27:34 2015 -0700 -- docs/mllib-guide.md | 2 ++ 1 file changed, 2 insertions(+) -- http://git-wip-us.apache.org/repos/asf/spark/blob/214f6810/docs/mllib-guide.md -- diff --git a/docs/mllib-guide.md b/docs/mllib-guide.md index 4c7a7d9..03b948c 100644 --- a/docs/mllib-guide.md +++ b/docs/mllib-guide.md @@ -107,6 +107,8 @@ In the `spark.mllib` package, there were several breaking changes. The first ch * In `DecisionTree`, the deprecated class method `train` has been removed. (The object/static `train` methods remain.) * In `Strategy`, the `checkpointDir` parameter has been removed. Checkpointing is still supported, but the checkpoint directory must be set before calling tree and tree ensemble training. * `PythonMLlibAPI` (the interface between Scala/Java and Python for MLlib) was a public API but is now private, declared `private[python]`. This was never meant for external use. +* In linear regression (including Lasso and ridge regression), the squared loss is now divided by 2. + So in order to produce the same result as in 1.2, the regularization parameter needs to be divided by 2 and the step size needs to be multiplied by 2. ## Previous Spark Versions - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org
spark git commit: [SPARK-6278][MLLIB] Mention the change of objective in linear regression
Repository: spark Updated Branches: refs/heads/master dc4abd4dc - 7f13434a5 [SPARK-6278][MLLIB] Mention the change of objective in linear regression As discussed in the RC3 vote thread, we should mention the change of objective in linear regression in the migration guide. srowen Author: Xiangrui Meng m...@databricks.com Closes #4978 from mengxr/SPARK-6278 and squashes the following commits: fb3bbe6 [Xiangrui Meng] mention regularization parameter bfd6cff [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into SPARK-6278 375fd09 [Xiangrui Meng] address Sean's comments f87ae71 [Xiangrui Meng] mention step size change Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/7f13434a Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/7f13434a Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/7f13434a Branch: refs/heads/master Commit: 7f13434a5c52b815c584ec773ab0e5df1a35ea86 Parents: dc4abd4 Author: Xiangrui Meng m...@databricks.com Authored: Fri Mar 13 10:27:28 2015 -0700 Committer: Xiangrui Meng m...@databricks.com Committed: Fri Mar 13 10:27:28 2015 -0700 -- docs/mllib-guide.md | 2 ++ 1 file changed, 2 insertions(+) -- http://git-wip-us.apache.org/repos/asf/spark/blob/7f13434a/docs/mllib-guide.md -- diff --git a/docs/mllib-guide.md b/docs/mllib-guide.md index 598374f..f8e8794 100644 --- a/docs/mllib-guide.md +++ b/docs/mllib-guide.md @@ -102,6 +102,8 @@ In the `spark.mllib` package, there were several breaking changes. The first ch * In `DecisionTree`, the deprecated class method `train` has been removed. (The object/static `train` methods remain.) * In `Strategy`, the `checkpointDir` parameter has been removed. Checkpointing is still supported, but the checkpoint directory must be set before calling tree and tree ensemble training. * `PythonMLlibAPI` (the interface between Scala/Java and Python for MLlib) was a public API but is now private, declared `private[python]`. This was never meant for external use. +* In linear regression (including Lasso and ridge regression), the squared loss is now divided by 2. + So in order to produce the same result as in 1.2, the regularization parameter needs to be divided by 2 and the step size needs to be multiplied by 2. ## Previous Spark Versions - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org