[GitHub] spark pull request: Spark-5854 personalized page rank

2015-05-01 Thread jegonzal
Github user jegonzal commented on a diff in the pull request:

https://github.com/apache/spark/pull/4774#discussion_r29521247
  
--- Diff: graphx/src/main/scala/org/apache/spark/graphx/lib/PageRank.scala 
---
@@ -103,8 +132,14 @@ object PageRank extends Logging {
   // that didn't receive a message. Requires a shuffle for 
broadcasting updated ranks to the
   // edge partitions.
   prevRankGraph = rankGraph
+  val rPrb = if (personalized) {
+(src: VertexId ,id: VertexId) = resetProb * delta(src,id)
+  } else {
+(src: VertexId, id: VertexId) = resetProb
+  }
+
   rankGraph = rankGraph.joinVertices(rankUpdates) {
-(id, oldRank, msgSum) = resetProb + (1.0 - resetProb) * msgSum
+(id, oldRank, msgSum) = rPrb(src,id) + (1.0 - resetProb) * msgSum
--- End diff --

This all looks correct but I have a minor concern that the extra function 
call and branching might increase overhead if the hotspot optimizations don't 
inline.  Do we have a sense as to the performance cost of this change?   An 
alternative, less elegant solution would be to have two code paths for lines 
141 and 142 depending on whether personalization is enabled.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-05-01 Thread jegonzal
Github user jegonzal commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-98196331
  
Overall this looks great!  I apologize for the delayed response.   I am 
going to go ahead and merge this now and then we can tune the performance in a 
later pull request. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-05-01 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-98199444
  
Thanks Joey, I appreciate it.  I can see your concern w/r/t the branching.  
If I can get some HW and time, I'll see if I notice a performance regression 
with the change.

Cheers,
Dan

Sent from my iPhone

 On May 1, 2015, at 11:22 AM, Joey notificati...@github.com wrote:
 
 Overall this looks great! I apologize for the delayed response. I am 
going to go ahead and merge this now and then we can tune the performance in a 
later pull request.
 
 —
 Reply to this email directly or view it on GitHub.
 



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-05-01 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/4774


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-27 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-96816965
  
@jegonzal does this algorithm look correct to you?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-96769570
  
Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-22 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-95256308
  
@jegonzal does this algorithm look correct to you? 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-17 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-94025941
  
OK, I'll update w/r/t the comments today.  I'd appreciate it if someone
took a glance at the algorithm; it's as specified in the referred paper,
but another set of eyes would be very, very helpful.

On Wed, Apr 15, 2015 at 2:43 PM, Ankur Dave notificati...@github.com
wrote:

 This looks good aside from the comments. I didn't verify the algorithm; it
 would be good if @jegonzal https://github.com/jegonzal could take a
 look.

 —
 Reply to this email directly or view it on GitHub
 https://github.com/apache/spark/pull/4774#issuecomment-93577536.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-17 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-94093081
  
  [Test build #30507 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30507/consoleFull)
 for   PR 4774 at commit 
[`d6cebac`](https://github.com/apache/spark/commit/d6cebac881756938bfad7b10bc9674083715a2c1).
 * This patch **does not merge cleanly**.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-94121865
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30511/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-17 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-94121860
  
  [Test build #30511 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30511/consoleFull)
 for   PR 4774 at commit 
[`8b907db`](https://github.com/apache/spark/commit/8b907db18ddc2ae5c0d16380e824c15e76183567).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.
 * This patch does not change any dependencies.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-17 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-94107018
  
  [Test build #30509 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30509/consoleFull)
 for   PR 4774 at commit 
[`2c20e5d`](https://github.com/apache/spark/commit/2c20e5d64749c533bc5fdf7304ed175cef686644).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-94107117
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30509/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-17 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-94107110
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30507/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-17 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-94107116
  
  [Test build #30509 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30509/consoleFull)
 for   PR 4774 at commit 
[`2c20e5d`](https://github.com/apache/spark/commit/2c20e5d64749c533bc5fdf7304ed175cef686644).
 * This patch **fails Scala style tests**.
 * This patch merges cleanly.
 * This patch adds the following public classes _(experimental)_:
  * `  case class Params(`
  * `sealed abstract class Node extends Serializable `
  * `sealed trait Split extends Serializable `
  * `final class CategoricalSplit(`
  * `final class ContinuousSplit(override val featureIndex: Int, val 
threshold: Double) extends Split `
  * `trait DecisionTreeModel `

 * This patch does not change any dependencies.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-17 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-94107104
  
  [Test build #30507 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30507/consoleFull)
 for   PR 4774 at commit 
[`d6cebac`](https://github.com/apache/spark/commit/d6cebac881756938bfad7b10bc9674083715a2c1).
 * This patch **passes all tests**.
 * This patch **does not merge cleanly**.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-17 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-94112401
  
  [Test build #30511 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30511/consoleFull)
 for   PR 4774 at commit 
[`8b907db`](https://github.com/apache/spark/commit/8b907db18ddc2ae5c0d16380e824c15e76183567).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-15 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-93562171
  
Is this going to get merged at some point?

On Tue, Mar 31, 2015 at 10:51 AM, Yusup notificati...@github.com wrote:

 +1

 —
 Reply to this email directly or view it on GitHub
 https://github.com/apache/spark/pull/4774#issuecomment-88186888.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-15 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-93562703
  
@dwmclary This doesn't merge, so would at least need a rebase. It's up to 
@ankurdave and @jegonzal et al.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-15 Thread ankurdave
Github user ankurdave commented on a diff in the pull request:

https://github.com/apache/spark/pull/4774#discussion_r28465870
  
--- Diff: graphx/src/main/scala/org/apache/spark/graphx/lib/PageRank.scala 
---
@@ -133,7 +167,29 @@ object PageRank extends Logging {
* containing the normalized weight.
*/
   def runUntilConvergence[VD: ClassTag, ED: ClassTag](
-  graph: Graph[VD, ED], tol: Double, resetProb: Double = 0.15): 
Graph[Double, Double] =
+ graph: Graph[VD, ED], tol: Double, resetProb: Double = 0.15): 
Graph[Double, Double] =
--- End diff --

style: indent parameter lists by 4 spaces


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-15 Thread ankurdave
Github user ankurdave commented on a diff in the pull request:

https://github.com/apache/spark/pull/4774#discussion_r28465105
  
--- Diff: graphx/src/main/scala/org/apache/spark/graphx/lib/PageRank.scala 
---
@@ -89,6 +114,10 @@ object PageRank extends Logging {
   // Set the vertex attributes to the initial pagerank values
   .mapVertices( (id, attr) = resetProb )
 
+val personalized = srcId isDefined
+val src:VertexId = srcId.getOrElse(-1L)
+def delta(u:VertexId, v:VertexId):Double = { if (u == v) 1.0 else 0.0 }
--- End diff --

Minor style comment: there should be a space after the colon here and on 
the line above.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-15 Thread ankurdave
Github user ankurdave commented on a diff in the pull request:

https://github.com/apache/spark/pull/4774#discussion_r28465270
  
--- Diff: graphx/src/main/scala/org/apache/spark/graphx/lib/PageRank.scala 
---
@@ -133,7 +167,29 @@ object PageRank extends Logging {
* containing the normalized weight.
*/
   def runUntilConvergence[VD: ClassTag, ED: ClassTag](
-  graph: Graph[VD, ED], tol: Double, resetProb: Double = 0.15): 
Graph[Double, Double] =
+ graph: Graph[VD, ED], tol: Double, resetProb: Double = 0.15): 
Graph[Double, Double] =
+  {
+  runUntilConvergenceWithOptions(graph, tol, resetProb)
+  }
+
+  /**
+   * Run a dynamic version of PageRank returning a graph with vertex 
attributes containing the
+   * PageRank and edge attributes containing the normalized edge weight.
+   *
+   * @tparam VD the original vertex attribute (not used)
+   * @tparam ED the original edge attribute (not used)
+   *
+   * @param graph the graph on which to compute PageRank
+   * @param tol the tolerance allowed at convergence (smaller = more 
accurate).
+   * @param resetProb the random reset probability (alpha)
+   * @param srcId the source vertex for a Personalized Page Rank (optional)
+   *
+   * @return the graph containing with each vertex containing the PageRank 
and each edge
+   * containing the normalized weight.
+   */
+  def runUntilConvergenceWithOptions[VD: ClassTag, ED: ClassTag](
+  graph: Graph[VD, ED], tol: Double, resetProb: Double = 0.15,
+   srcId: Option[VertexId] = None): Graph[Double, Double] =
--- End diff --

style: inconsistent indent


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-15 Thread ankurdave
Github user ankurdave commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-93577536
  
This looks good aside from the comments. I didn't verify the algorithm; it 
would be good if @jegonzal could take a look.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-15 Thread ankurdave
Github user ankurdave commented on a diff in the pull request:

https://github.com/apache/spark/pull/4774#discussion_r28465288
  
--- Diff: graphx/src/main/scala/org/apache/spark/graphx/lib/PageRank.scala 
---
@@ -148,6 +204,10 @@ object PageRank extends Logging {
   .mapVertices( (id, attr) = (0.0, 0.0) )
   .cache()
 
+val personalized = srcId isDefined
--- End diff --

style: prefer `srcId.isDefined`


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-15 Thread ankurdave
Github user ankurdave commented on a diff in the pull request:

https://github.com/apache/spark/pull/4774#discussion_r28465211
  
--- Diff: graphx/src/main/scala/org/apache/spark/graphx/lib/PageRank.scala 
---
@@ -104,7 +133,12 @@ object PageRank extends Logging {
   // edge partitions.
   prevRankGraph = rankGraph
   rankGraph = rankGraph.joinVertices(rankUpdates) {
+if (personalized) {
--- End diff --

It would be better to move the conditional outside the closure (i.e., `if 
(personalized) { rankGraph.joinVertices ... } else { ...} `) to avoid a branch 
at each vertex.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-04-15 Thread ankurdave
Github user ankurdave commented on a diff in the pull request:

https://github.com/apache/spark/pull/4774#discussion_r28465575
  
--- Diff: graphx/src/main/scala/org/apache/spark/graphx/lib/PageRank.scala 
---
@@ -170,8 +241,16 @@ object PageRank extends Logging {
 val initialMessage = resetProb / (1.0 - resetProb)
 
 // Execute a dynamic version of Pregel.
-Pregel(pagerankGraph, initialMessage, activeDirection = 
EdgeDirection.Out)(
-  vertexProgram, sendMessage, messageCombiner)
-  .mapVertices((vid, attr) = attr._1)
+if (personalized) {
+  Pregel(pagerankGraph, initialMessage, activeDirection = 
EdgeDirection.Out)(
+personalizedVertexProgram, sendMessage, messageCombiner)
+.mapVertices((vid, attr) = attr._1)
+} else {
+  Pregel(pagerankGraph, initialMessage, activeDirection = 
EdgeDirection.Out)(
+vertexProgram, sendMessage, messageCombiner)
+.mapVertices((vid, attr) = attr._1)
+}
--- End diff --

I think you could simplify this to something like
```scala
val vp = if (personalized) personalizedVertexProgram else vertexProgram
Pregel(pagerankGraph, initialMessage, activeDirection = EdgeDirection.Out)(
  vp, sendMessage, messageCombiner)
  .mapVertices((vid, attr) = attr._1)
```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-31 Thread xperian
Github user xperian commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-88186888
  
+1 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-19 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-83669688
  
Good to merge?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-16 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-81538617
  
  [Test build #28641 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28641/consoleFull)
 for   PR 4774 at commit 
[`fba0edd`](https://github.com/apache/spark/commit/fba0eddf3f905b5485585785feaeb082de4b3eb2).
 * This patch **fails MiMa tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-16 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-81538624
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28641/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-16 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-81829391
  
OK, that should fix the binary incompatibility on the vertexProgram.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-16 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-81831186
  
  [Test build #28661 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28661/consoleFull)
 for   PR 4774 at commit 
[`d711677`](https://github.com/apache/spark/commit/d71167712ff8da3707214751c07b0932bfc5bb72).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-16 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-81847916
  
  [Test build #28664 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28664/consoleFull)
 for   PR 4774 at commit 
[`7d00c23`](https://github.com/apache/spark/commit/7d00c2386d643aa3d4fb248cfcbe5c7183bfaeba).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-16 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-81831781
  
  [Test build #28661 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28661/consoleFull)
 for   PR 4774 at commit 
[`d711677`](https://github.com/apache/spark/commit/d71167712ff8da3707214751c07b0932bfc5bb72).
 * This patch **fails Scala style tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-16 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-81831805
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28661/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-16 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-81886679
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28664/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-16 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-81886657
  
  [Test build #28664 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28664/consoleFull)
 for   PR 4774 at commit 
[`7d00c23`](https://github.com/apache/spark/commit/7d00c2386d643aa3d4fb248cfcbe5c7183bfaeba).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-16 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-81482802
  
  [Test build #28641 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28641/consoleFull)
 for   PR 4774 at commit 
[`fba0edd`](https://github.com/apache/spark/commit/fba0eddf3f905b5485585785feaeb082de4b3eb2).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-16 Thread ankurdave
Github user ankurdave commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-81480270
  
ok to test


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-13 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-78824623
  
OK, got 'em.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-12 Thread rxin
Github user rxin commented on a diff in the pull request:

https://github.com/apache/spark/pull/4774#discussion_r26365904
  
--- Diff: graphx/src/main/scala/org/apache/spark/graphx/lib/PageRank.scala 
---
@@ -148,11 +204,20 @@ object PageRank extends Logging {
   .mapVertices( (id, attr) = (0.0, 0.0) )
   .cache()
 
+val personalized = if (srcId isDefined) true else false
+val src:VertexId = srcId.getOrElse(-1L)
--- End diff --

add a space after :


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-12 Thread rxin
Github user rxin commented on a diff in the pull request:

https://github.com/apache/spark/pull/4774#discussion_r26365902
  
--- Diff: graphx/src/main/scala/org/apache/spark/graphx/lib/PageRank.scala 
---
@@ -148,11 +204,20 @@ object PageRank extends Logging {
   .mapVertices( (id, attr) = (0.0, 0.0) )
   .cache()
 
+val personalized = if (srcId isDefined) true else false
--- End diff --

this is just 
```
val personalized = srcId.isDefined
```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-12 Thread zsxwing
Github user zsxwing commented on a diff in the pull request:

https://github.com/apache/spark/pull/4774#discussion_r26362827
  
--- Diff: graphx/src/main/scala/org/apache/spark/graphx/lib/PageRank.scala 
---
@@ -74,10 +76,35 @@ object PageRank extends Logging {
*
* @return the graph containing with each vertex containing the PageRank 
and each edge
* containing the normalized weight.
+   */
+
+  def run[VD: ClassTag, ED: ClassTag](graph: Graph[VD, ED], numIter: Int,
+resetProb: Double = 0.15): Graph[Double, Double] =
+  {
+runWithOptions(graph, numIter, resetProb)
+  }
+
+  /**
+   * Run PageRank for a fixed number of iterations returning a graph
+   * with vertex attributes containing the PageRank and edge
+   * attributes the normalized edge weight.
+   *
+   * @tparam VD the original vertex attribute (not used)
+   * @tparam ED the original edge attribute (not used)
+   *
+   * @param graph the graph on which to compute PageRank
+   * @param numIter the number of iterations of PageRank to run
+   * @param resetProb the random reset probability (alpha)
+   * @param srcId the source vertex for a Personalized Page Rank (optional)
+   *
+   * @return the graph containing with each vertex containing the PageRank 
and each edge
+   * containing the normalized weight.
*
*/
-  def run[VD: ClassTag, ED: ClassTag](
-  graph: Graph[VD, ED], numIter: Int, resetProb: Double = 0.15): 
Graph[Double, Double] =
+
--- End diff --

The extra space line is redundant. Could you remove all extra spaces 
between comments and methods in this file?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-12 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-78804717
  
Whitespace removed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-12 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-78766347
  
OK, that should be a reasonable solution.  Thanks for the advice @rxin.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-12 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-78614976
  
I certainly agree that binary compatibility matters.  I think it's mainly a
question of which is more desirable: fewer repeated LOC or binary
compatibility.

I can write new methods for each of the approaches to PPR, but it would
require repeating a lot of the pregel boilerplate.  If you feel strongly
that's a better approach, I'm happy to make the change.

On Thu, Mar 12, 2015 at 1:40 PM, Sean Owen notificati...@github.com wrote:

 Yes but binary compatibility also matters. I can't see the Jenkins output
 anymore, but I recall thinking it was probably legitimate. Can you write
 this differently without optional args maybe?

 —
 Reply to this email directly or view it on GitHub
 https://github.com/apache/spark/pull/4774#issuecomment-78606657.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-12 Thread rxin
Github user rxin commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-78616511
  
Also you don't necessarily need to duplicate a lot of code. You can have 
one Pregel function calling another.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-12 Thread rxin
Github user rxin commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-78616278
  
Sorry binary compatibility is not a preference. It is a requirement here 
... otherwise it is a huge hassle for people to upgrade, which leads to more 
fragmentation in versions.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-12 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-78606657
  
Yes but binary compatibility also matters. I can't see the Jenkins output 
anymore, but I recall thinking it was probably legitimate. Can you write this 
differently without optional args maybe?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-12 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-78594489
  
Does anyone have a comment on this MiMa failure?  The fact that 
PageRankSuite passes illustrates that it's source compatible.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-05 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-77434500
  
I think this change actually introduces binary incompatability even though 
you have only added optional args. It may be source compatible. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-05 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-77436963
  
OK, thanks Sean, that was my reading of it too.

On Thu, Mar 5, 2015 at 11:32 AM, Sean Owen notificati...@github.com wrote:

 I think this change actually introduces binary incompatability even though
 you have only added optional args. It may be source compatible.

 —
 Reply to this email directly or view it on GitHub
 https://github.com/apache/spark/pull/4774#issuecomment-77434500.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-05 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-77433149
  
I'm not really sure what to do about this MiMa error.  Suggestions?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-05 Thread dwmclary
Github user dwmclary commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-77398055
  
test this please


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-05 Thread shaneknapp
Github user shaneknapp commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-77410205
  
jenkins, test this please


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-05 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-77410688
  
  [Test build #28301 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28301/consoleFull)
 for   PR 4774 at commit 
[`44ada8e`](https://github.com/apache/spark/commit/44ada8e02bd2d307a17ee5186e636877d25b3e03).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-05 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-77427253
  
  [Test build #28301 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28301/consoleFull)
 for   PR 4774 at commit 
[`44ada8e`](https://github.com/apache/spark/commit/44ada8e02bd2d307a17ee5186e636877d25b3e03).
 * This patch **fails MiMa tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-03-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-77427282
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28301/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-02-26 Thread ankurdave
Github user ankurdave commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-76252508
  
Cool!

ok to test


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-02-25 Thread dwmclary
GitHub user dwmclary opened a pull request:

https://github.com/apache/spark/pull/4774

Spark-5854 personalized page rank

Here's a modification to PageRank which does personalized PageRank.  The 
approach is basically similar to that outlined by Bahmani et al. from 2010 
(http://arxiv.org/pdf/1006.2880.pdf).

I'm sure this needs tuning up or other considerations, so let me know how I 
can improve this.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/dwmclary/spark 
SPARK-5854-Personalized-PageRank

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/4774.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #4774


commit 6dc2c29b6d4ab6eb5ceb34b0f6e135843dd6cdd9
Author: Dan McClary dan.mccl...@gmail.com
Date:   2015-02-24T07:24:06Z

initial implementation of personalized page rank

commit a5609420b897df357079a7cbe3dfaf155f72f7b7
Author: Dan McClary dan.mccl...@gmail.com
Date:   2015-02-25T22:30:54Z

rolled PPR into pregel code for PageRank

commit b67ac693896e2d6c8d080f107e0e86343a522387
Author: Dan McClary dan.mccl...@gmail.com
Date:   2015-02-25T22:43:59Z

updated tolerance on chain PPR

commit 1ffed95b640c3933976ddfd2ecce1a7fa60c83a3
Author: Dan McClary dan.mccl...@gmail.com
Date:   2015-02-25T22:49:28Z

updated tolerance on chain PPR

commit 44ada8e02bd2d307a17ee5186e636877d25b3e03
Author: Dan McClary dan.mccl...@gmail.com
Date:   2015-02-25T22:51:40Z

updated tolerance on chain PPR




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-02-25 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-76086384
  
Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: Spark-5854 personalized page rank

2015-02-25 Thread rxin
Github user rxin commented on the pull request:

https://github.com/apache/spark/pull/4774#issuecomment-76120556
  
cc Ankur @ankurdave 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org