Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22512662
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,84 @@
package
Github user tgaloppo commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22521704
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,84 @@
package
Github user tgaloppo commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22545409
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,84 @@
package
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22542741
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,84 @@
package
Github user asfgit closed the pull request at:
https://github.com/apache/spark/pull/3871
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22552390
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,84 @@
package
Github user jkbradley commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68934738
@tgaloppo Now that my confusion is over...LGTM Thanks very much!
CC: @mengxr
---
If your project is set up for it, you can reply to this email and have your
Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68941542
Merged into master. Thanks!
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68806850
[Test build #25069 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25069/consoleFull)
for PR 3871 at commit
Github user tgaloppo commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68806762
@mengxr Changes made.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22502677
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,84 @@
package
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68813348
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68813341
[Test build #25069 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25069/consoleFull)
for PR 3871 at commit
Github user tgaloppo commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22504722
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,84 @@
package
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22507242
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,84 @@
package
Github user tgaloppo commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22508610
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,84 @@
package
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22485212
--- Diff:
mllib/src/test/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussianSuite.scala
---
@@ -0,0 +1,69 @@
+/*
+ * Licensed to the Apache
Github user mengxr commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22486202
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,74 @@
package
Github user jkbradley commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68648254
@tgaloppo 2 more small comments, but after those, I believe this will be
ready. Thanks!
---
If your project is set up for it, you can reply to this email and have
Github user tgaloppo commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68649845
@jkbradley Good call on the test suite; I have added some non-center points
to the tests. I also added the brackets to the in-comment link.
---
If your project is
Github user jkbradley commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68650576
@tgaloppo Thanks for the updates! LGTM after tests pass
CC: @mengxr
---
If your project is set up for it, you can reply to this email and have your
reply
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68652965
[Test build #25038 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25038/consoleFull)
for PR 3871 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68652967
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22434157
--- Diff:
mllib/src/test/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussianSuite.scala
---
@@ -0,0 +1,61 @@
+/*
+ * Licensed to the
Github user jkbradley commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68606410
@tgaloppo thanks for verifying about the test pdf values
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as
Github user tgaloppo commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22434487
--- Diff:
mllib/src/test/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussianSuite.scala
---
@@ -0,0 +1,61 @@
+/*
+ * Licensed to the Apache
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68610310
[Test build #25016 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25016/consoleFull)
for PR 3871 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68607516
[Test build #25016 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25016/consoleFull)
for PR 3871 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68610312
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68572495
[Test build #24998 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24998/consoleFull)
for PR 3871 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68572497
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68580983
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22429027
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,69 @@
package
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22429029
--- Diff:
mllib/src/test/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussianSuite.scala
---
@@ -0,0 +1,61 @@
+/*
+ * Licensed to the
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22429025
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,69 @@
package
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22429030
--- Diff:
mllib/src/test/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussianSuite.scala
---
@@ -0,0 +1,61 @@
+/*
+ * Licensed to the
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22429041
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,69 @@
package
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68580981
[Test build #25001 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25001/consoleFull)
for PR 3871 at commit
Github user jkbradley commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68581031
@tgaloppo Thanks for the updates. Sure, the log-space computation could be
in another PR.
Just to make sure: Did you compute the PDF values in the tests using
Github user jkbradley commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68562386
@tgaloppo Could you please add a description? It can be based off of the
JIRA, just enough to cover the main points of the PR. Thanks!
---
If your project is set up
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68578626
[Test build #25001 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25001/consoleFull)
for PR 3871 at commit
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22423972
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,62 @@
package
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22423967
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,62 @@
package
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22423970
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,62 @@
package
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22423959
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,62 @@
package
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22423964
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,62 @@
package
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22423961
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,62 @@
package
Github user jkbradley commented on a diff in the pull request:
https://github.com/apache/spark/pull/3871#discussion_r22423962
--- Diff:
mllib/src/main/scala/org/apache/spark/mllib/stat/impl/MultivariateGaussian.scala
---
@@ -17,23 +17,62 @@
package
Github user jkbradley commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68565593
@tgaloppo @mengxr What are your thoughts about doing the computation in
log space as much as possible, and then exponentiating at the end? I'm mainly
thinking about
Github user tgaloppo commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68581355
@jkbradley I used Octave's mvnpdf from the statistics package for the
non-singular cases; it can not handle singular covariance matrices, so I was
only able to recreate
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68566334
[Test build #24998 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24998/consoleFull)
for PR 3871 at commit
Github user jkbradley commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68565659
@tgaloppo The logic looks good; my comments are basically about clarity
(except for the log space question). Thanks for the PR!
---
If your project is set up for it,
Github user jkbradley commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68565800
One more request: Could you please add a unit test with a singular matrix?
Thank you! Perhaps in a new suite for MultivariateGaussian
---
If your project is set up
Github user tgaloppo commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68575728
@jkbradley I think performing the pdf calculation in log-space (and
providing a logpdf() method) is a good idea. Perhaps we can make this part of
transitioning
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68584507
[Test build #25006 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25006/consoleFull)
for PR 3871 at commit
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68582759
[Test build #25006 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/25006/consoleFull)
for PR 3871 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68584510
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68491851
[Test build #24987 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/24987/consoleFull)
for PR 3871 at commit
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/3871#issuecomment-68491855
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
59 matches
Mail list logo