[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-06 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22512662 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,84 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-06 Thread tgaloppo
Github user tgaloppo commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22521704 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,84 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-06 Thread tgaloppo
Github user tgaloppo commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22545409 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,84 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-06 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22542741 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,84 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-06 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/3871 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-06 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22552390 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,84 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-06 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68934738 @tgaloppo Now that my confusion is over...LGTM Thanks very much! CC: @mengxr --- If your project is set up for it, you can reply to this email and have your

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-06 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68941542 Merged into master. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68806850 [Test build #25069 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25069/consoleFull) for PR 3871 at commit

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-05 Thread tgaloppo
Github user tgaloppo commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68806762 @mengxr Changes made. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-05 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22502677 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,84 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68813348 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-05 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68813341 [Test build #25069 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25069/consoleFull) for PR 3871 at commit

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-05 Thread tgaloppo
Github user tgaloppo commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22504722 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,84 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-05 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22507242 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,84 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-05 Thread tgaloppo
Github user tgaloppo commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22508610 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,84 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-05 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22485212 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussianSuite.scala --- @@ -0,0 +1,69 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-05 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22486202 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,74 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-04 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68648254 @tgaloppo 2 more small comments, but after those, I believe this will be ready. Thanks! --- If your project is set up for it, you can reply to this email and have

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-04 Thread tgaloppo
Github user tgaloppo commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68649845 @jkbradley Good call on the test suite; I have added some non-center points to the tests. I also added the brackets to the in-comment link. --- If your project is

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-04 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68650576 @tgaloppo Thanks for the updates! LGTM after tests pass CC: @mengxr --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-04 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68652965 [Test build #25038 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25038/consoleFull) for PR 3871 at commit

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-04 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68652967 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-03 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22434157 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussianSuite.scala --- @@ -0,0 +1,61 @@ +/* + * Licensed to the

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-03 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68606410 @tgaloppo thanks for verifying about the test pdf values --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-03 Thread tgaloppo
Github user tgaloppo commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22434487 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussianSuite.scala --- @@ -0,0 +1,61 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-03 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68610310 [Test build #25016 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25016/consoleFull) for PR 3871 at commit

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-03 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68607516 [Test build #25016 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25016/consoleFull) for PR 3871 at commit

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68610312 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68572495 [Test build #24998 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24998/consoleFull) for PR 3871 at commit

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68572497 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68580983 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22429027 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,69 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22429029 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussianSuite.scala --- @@ -0,0 +1,61 @@ +/* + * Licensed to the

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22429025 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,69 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22429030 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussianSuite.scala --- @@ -0,0 +1,61 @@ +/* + * Licensed to the

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22429041 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,69 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68580981 [Test build #25001 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25001/consoleFull) for PR 3871 at commit

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68581031 @tgaloppo Thanks for the updates. Sure, the log-space computation could be in another PR. Just to make sure: Did you compute the PDF values in the tests using

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68562386 @tgaloppo Could you please add a description? It can be based off of the JIRA, just enough to cover the main points of the PR. Thanks! --- If your project is set up

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68578626 [Test build #25001 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25001/consoleFull) for PR 3871 at commit

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22423972 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,62 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22423967 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,62 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22423970 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,62 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22423959 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,62 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22423964 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,62 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22423961 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,62 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/3871#discussion_r22423962 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala --- @@ -17,23 +17,62 @@ package

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68565593 @tgaloppo @mengxr What are your thoughts about doing the computation in log space as much as possible, and then exponentiating at the end? I'm mainly thinking about

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread tgaloppo
Github user tgaloppo commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68581355 @jkbradley I used Octave's mvnpdf from the statistics package for the non-singular cases; it can not handle singular covariance matrices, so I was only able to recreate

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68566334 [Test build #24998 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24998/consoleFull) for PR 3871 at commit

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68565659 @tgaloppo The logic looks good; my comments are basically about clarity (except for the log space question). Thanks for the PR! --- If your project is set up for it,

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68565800 One more request: Could you please add a unit test with a singular matrix? Thank you! Perhaps in a new suite for MultivariateGaussian --- If your project is set up

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread tgaloppo
Github user tgaloppo commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68575728 @jkbradley I think performing the pdf calculation in log-space (and providing a logpdf() method) is a good idea. Perhaps we can make this part of transitioning

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68584507 [Test build #25006 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25006/consoleFull) for PR 3871 at commit

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68582759 [Test build #25006 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25006/consoleFull) for PR 3871 at commit

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68584510 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-01 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68491851 [Test build #24987 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24987/consoleFull) for PR 3871 at commit

[GitHub] spark pull request: SPARK-5017 [MLlib] - Use SVD to compute determ...

2015-01-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/3871#issuecomment-68491855 Test PASSed. Refer to this link for build results (access rights to CI server needed):