[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-03 Thread freeman-lab
Github user freeman-lab commented on a diff in the pull request: https://github.com/apache/spark/pull/4306#discussion_r24038692 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/regression/StreamingLinearAlgorithm.scala --- @@ -58,14 +58,14 @@ abstract class

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72530762 LGTM except minor inline comments. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72556110 [Test build #26550 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26550/consoleFull) for PR 4306 at commit

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread freeman-lab
Github user freeman-lab commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72555492 Thanks for the review! I think I dealt with everything, and in a couple places I tweaked the corresponding point in `StreamingLinearRegression` for parity. --- If

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72571147 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread freeman-lab
Github user freeman-lab commented on a diff in the pull request: https://github.com/apache/spark/pull/4306#discussion_r23966222 --- Diff: examples/src/main/scala/org/apache/spark/examples/mllib/StreamingLogisticRegression.scala --- @@ -0,0 +1,74 @@ +/* + * Licensed to the

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4306#discussion_r23967886 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/classification/StreamingLogisticRegressionWithSGD.scala --- @@ -0,0 +1,97 @@ +/* + * Licensed

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4306#discussion_r23967929 --- Diff: examples/src/main/scala/org/apache/spark/examples/mllib/StreamingLogisticRegression.scala --- @@ -0,0 +1,74 @@ +/* + * Licensed to the

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72572876 [Test build #26567 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26567/consoleFull) for PR 4306 at commit

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread freeman-lab
Github user freeman-lab commented on a diff in the pull request: https://github.com/apache/spark/pull/4306#discussion_r23965614 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/classification/StreamingLogisticRegressionWithSGD.scala --- @@ -0,0 +1,97 @@ +/* + *

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72571137 **[Test build #26550 timed out](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26550/consoleFull)** for PR 4306 at commit

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72572634 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72432824 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72432815 [Test build #26504 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26504/consoleFull) for PR 4306 at commit

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread freeman-lab
Github user freeman-lab commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72583404 Should we just set `initialWeights=Vectors.dense(0.0)` by default? --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72586638 Could we try to set model type to `Option[Model]` and check whether it is set in the code? --- If your project is set up for it, you can reply to this email and have

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72578614 [Test build #26567 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26567/consoleFull) for PR 4306 at commit

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72589097 [Test build #26599 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26599/consoleFull) for PR 4306 at commit

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72594005 [Test build #26599 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26599/consoleFull) for PR 4306 at commit

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72594009 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread freeman-lab
Github user freeman-lab commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72580780 @mengxr the test failure is real, I think there's a conflict due to this recent change to master

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread freeman-lab
Github user freeman-lab commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72588966 Nice idea! Just took a stab, see what you think. Was reproducing the test failure locally, and it is now fixed with this change. --- If your project is set up for

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4306#discussion_r23985526 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/regression/StreamingLinearAlgorithm.scala --- @@ -58,14 +58,14 @@ abstract class

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72578624 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread freeman-lab
Github user freeman-lab commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72581477 Ok, the problem is that we've been setting `initialWeights` to `null` by default and then checking that weights are defined before starting training (because we

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/4306 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72602371 Merged into master. Thanks!! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4306#discussion_r23941935 --- Diff: examples/src/main/scala/org/apache/spark/examples/mllib/StreamingLogisticRegression.scala --- @@ -0,0 +1,74 @@ +/* + * Licensed to the

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4306#discussion_r23941945 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/classification/StreamingLogisticRegressionWithSGD.scala --- @@ -0,0 +1,97 @@ +/* + * Licensed

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4306#discussion_r23941941 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/classification/StreamingLogisticRegressionWithSGD.scala --- @@ -0,0 +1,97 @@ +/* + * Licensed

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4306#discussion_r23941944 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/classification/StreamingLogisticRegressionWithSGD.scala --- @@ -0,0 +1,97 @@ +/* + * Licensed

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4306#discussion_r23941932 --- Diff: examples/src/main/scala/org/apache/spark/examples/mllib/StreamingLogisticRegression.scala --- @@ -0,0 +1,74 @@ +/* + * Licensed to the

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4306#discussion_r23942029 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/classification/StreamingLogisticRegressionWithSGD.scala --- @@ -0,0 +1,97 @@ +/* + * Licensed

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread freeman-lab
GitHub user freeman-lab opened a pull request: https://github.com/apache/spark/pull/4306 [SPARK-4979][MLLIB] Streaming logisitic regression This adds support for streaming logistic regression with stochastic gradient descent, in the same manner as the existing implementation of

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72423945 [Test build #26503 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26503/consoleFull) for PR 4306 at commit

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72424053 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72424049 [Test build #26503 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26503/consoleFull) for PR 4306 at commit

[GitHub] spark pull request: [SPARK-4979][MLLIB] Streaming logisitic regres...

2015-02-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4306#issuecomment-72424457 [Test build #26504 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26504/consoleFull) for PR 4306 at commit