[
https://issues.apache.org/jira/browse/GIRAPH-584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13615058#comment-13615058
]
Sebastian Schelter commented on GIRAPH-584:
-------------------------------------------
Looks like a very good idea. I have a few suggestions: AFAIK, LinkRank is a
slight modification of PageRank and also relies on doing a random walk on the
link graph. We already have some tooling for doing this (see GIRAPH-480 for
latest developments) that should be leveraged.
Furthermore, it would be great to get some performance numbers on a large
graph. E.g. you could use the Webbase dataset
http://law.di.unimi.it/webdata/webbase-2001/ which has more than a billion
links.
> Giraph implementation of Nutch LinkRank Algorithm
> -------------------------------------------------
>
> Key: GIRAPH-584
> URL: https://issues.apache.org/jira/browse/GIRAPH-584
> Project: Giraph
> Issue Type: Task
> Components: graph
> Environment: Nutch trunk branch
> http://svn.apache.org/repos/asf/nutch/trunk/
> Reporter: Lewis John McGibbney
> Labels: gsoc2013
>
> This issue is initially aimed at the delegation of the Nutch page rank
> mechanism (called LinkRank) to Apache Giraph.
> Motivation for this is simple, we envisage that this would probably save us
> quite a bit of code and be more efficient.
> The idea is to attract and build interest around this issue and propose it
> for this years Google Summer of Code.
> We would like to build links between the Giraph and Nutch communities and
> make both Nutch and Giraph better in the process.
> INTERESTED MENTORS
> ------------------
> Lewis John McGibbney - lewismc at apache dot org
> INTERESTED STUDENTS
> -------------------
> Student name - Student email
> Ahmet Emre Aladag - emre.aladag at agmlab dot com
> Related threads
> [0] http://s.apache.org/Sz9
> [1] http://s.apache.org/ssa
> [2] http://wiki.apache.org/nutch/CommandLineOptions#Webgraph_classes
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira