[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-30 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-72279293 LGTM except minor user guide issues, which will be addressed in SPARK-5503. I've merged this into master. Thanks for the contributing! (Now MLlib depends on GraphX.)

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-30 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/4254 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-30 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23876362 --- Diff: docs/mllib-clustering.md --- @@ -34,6 +34,26 @@ a given dataset, the algorithm returns the best clustering result). * *initializationSteps*

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-30 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23876361 --- Diff: docs/mllib-clustering.md --- @@ -34,6 +34,26 @@ a given dataset, the algorithm returns the best clustering result). * *initializationSteps*

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-30 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-72277872 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-30 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-72265723 [Test build #26420 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26420/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-30 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-72275436 [Test build #26420 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26420/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-30 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-72275448 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-30 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-72277861 [Test build #26423 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26423/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-30 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-72267039 [Test build #26423 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26423/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756614 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756595 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756625 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756569 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756566 --- Diff: mllib/pom.xml --- @@ -103,6 +108,13 @@ typetest-jar/type scopetest/scope /dependency +!--dependency ---

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756564 --- Diff: docs/mllib-clustering-pic.md --- @@ -0,0 +1,30 @@ +--- +layout: global +title: Clustering - MLlib +displayTitle: a

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756570 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756558 --- Diff: docs/mllib-clustering-pic.md --- @@ -0,0 +1,30 @@ +--- --- End diff -- Shall we make it a section in `mllib-clustering.md`? --- If

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756576 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756601 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756568 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756632 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756645 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/clustering/PowerIterationClusteringSuite.scala --- @@ -0,0 +1,317 @@ +/* + * Licensed to the

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756574 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756588 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756608 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756634 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756629 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756594 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23756605 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71997342 @fjiang6 @javadba Please focus on the public APIs first and then the implementation. The best way to check public APIs is generating the html doc and look what are

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23803786 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23808251 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/clustering/PowerIterationClusteringSuite.scala --- @@ -0,0 +1,317 @@ +/* + * Licensed to the

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23807937 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23807026 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23801499 --- Diff: docs/mllib-clustering-pic.md --- @@ -0,0 +1,30 @@ +--- --- End diff -- OK --- If your project is set up for it, you can reply to

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23802807 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23808004 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23807925 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23807487 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23807503 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23807540 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23802566 --- Diff: docs/mllib-clustering-pic.md --- @@ -0,0 +1,30 @@ +--- +layout: global +title: Clustering - MLlib +displayTitle: a

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23802606 --- Diff: mllib/pom.xml --- @@ -103,6 +108,13 @@ typetest-jar/type scopetest/scope /dependency +!--dependency ---

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23807449 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23802691 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23807660 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23802882 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23802897 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23816580 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23818389 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-72136140 [Test build #26354 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26354/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23819495 --- Diff: data/mllib/pic_data.txt --- @@ -0,0 +1,299 @@ +1000 0.000.0001250.380.012684 0.000638

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23811947 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23816692 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23818325 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-72142763 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-72142757 [Test build #26354 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26354/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23816854 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PowerIterationClustering.scala --- @@ -0,0 +1,220 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-29 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23819122 --- Diff: data/mllib/pic_data.txt --- @@ -0,0 +1,299 @@ +1000 0.000.0001250.380.012684 0.000638

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71924240 [Test build #26252 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26252/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71924410 [Test build #26252 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26252/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71924413 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread fjiang6
GitHub user fjiang6 opened a pull request: https://github.com/apache/spark/pull/4254 [SPARK-4259][MLlib]: Add Power Iteration Clustering Algorithm with Gaussian Similarity Function Add single pseudo-eigenvector PIC Including documentations, one property file and updated pom.xml

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71947798 [Test build #26272 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26272/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71942958 For spectral clustering, the input should be pairwise distances, e.g., `RDD[(Long, Long, Double)]`. It could be converted into a Graph easily with `fromEdgeRDD`. Besides,

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71944902 [Test build #26261 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26261/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71944910 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread javadba
Github user javadba commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71947549 I have moved out the Gaussian / Affinity matrix calculations. It is not clear where their new home / if they have a new home. Presently the testcases rely upon them -

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23728087 --- Diff: mllib/src/test/resources/log4j.mllib.properties --- @@ -0,0 +1,41 @@ +# --- End diff -- Remove this file from the PR. --- If your

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23728755 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PIClustering.scala --- @@ -0,0 +1,433 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23728797 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PIClustering.scala --- @@ -0,0 +1,433 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread javadba
Github user javadba commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71929175 RE: Is it possible to do Gaussian similarity in another PR? It should be part of the feature transformation but not within PIC. It would be easier for code

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71925073 Is it possible to do Gaussian similarity in another PR? It should be part of the feature transformation but not within PIC. --- If your project is set up for it, you can

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23727672 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PIClustering.scala --- @@ -0,0 +1,433 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23728909 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/clustering/PIClusteringSuite.scala --- @@ -0,0 +1,140 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread javadba
Github user javadba commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71932032 Hi, We have suggestion here: to separate the creation/definition of the input graph from the PIC: val G = PIC.createGaussianAffinityMatrix(sc,

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71936239 [Test build #26261 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26261/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23727835 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PIClustering.scala --- @@ -0,0 +1,433 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23727970 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/clustering/PIClusteringSuite.scala --- @@ -0,0 +1,140 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23728061 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/clustering/PIClusteringSuite.scala --- @@ -0,0 +1,140 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23728804 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/clustering/PIClusteringSuite.scala --- @@ -0,0 +1,140 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23727804 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PIClustering.scala --- @@ -0,0 +1,433 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23728761 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/clustering/PIClustering.scala --- @@ -0,0 +1,433 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread javadba
Github user javadba commented on a diff in the pull request: https://github.com/apache/spark/pull/4254#discussion_r23728918 --- Diff: mllib/src/test/resources/log4j.mllib.properties --- @@ -0,0 +1,41 @@ +# --- End diff -- OK --- If your project is set up for it,

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71965181 [Test build #26282 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26282/consoleFull) for PR 4254 at commit

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71965185 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-4259][MLlib]: Add Power Iteration Clust...

2015-01-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4254#issuecomment-71959963 [Test build #26282 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/26282/consoleFull) for PR 4254 at commit