No, please do! Link to the nutch as well, I can refer people to the JIRA to see what they think and if they might participate. Thanks!
On Sun, Mar 24, 2013 at 11:30 AM, Lewis John Mcgibbney < [email protected]> wrote: > Hi Eli, > Thanks for your response... and if your able to reach out to potential > candidates this would be excellent. > > I am working as PostDoc at Stanford so will be looking for students from > there as well. > I will keep this thread alive and would very much appreciate if you (and > others) are able to update it as well. > Giraph currently has no GSoC proposal, Nutch does. You can see all > proposals here (0). > I am tempted to log the issue right now. > It would be excellent if we could contribute something to both Giraph and > Nutch here. > Does anyone have an issue with me logging a Jira in Giraph? > Thank you > Lewis > (0) http://s.apache.org/0Xh > > On Sunday, March 24, 2013, Eli Reisman <[email protected]> wrote: > > Thanks, I have a couple folks in mind let me ask about it. They might not > > be previous Nutch or Giraph users as of now but would be very clever ;) > > > > > > On Fri, Mar 22, 2013 at 4:15 PM, Lewis John Mcgibbney < > > [email protected]> wrote: > > > >> Hi All, > >> > >> I have a huge apology to make on this one. > >> I thought that there was no interest in this proposal and therefore > dropped > >> it for the time being. > >> I reach out specifically to Claudio and Eli respectively here and > apologise > >> entirely for not getting back to you guys. > >> > >> So the questions (respectively) were the following > >> > >> 1. I do not see a direct connection between giraph and nutch, except > if > >> you want to run ranking/PageRank on the indexed stuff. But in that > case, > >> the integration is quite trivial and boils down to the inputformat. > >> Could > >> you develop your ideas further please? > >> 2. Like an internship project? I might know some people. Or did you > have > >> someone in mind? > >> > >> My answers are as follows > >> > >> 1. We anticipate the delegation of our LinkRank (PageRank > >> implementation) mechanism a graph library like Apache Giraph. This > could > >> remove a bit of code from Nutch and would hopefully be more > efficient. > >> I am > >> well aware of your contributions to Nutch Claudio :0) I think that > your > >> input here would be extremely helpful in helping us get this off of > the > >> ground. You can read a bit more about the justification behind this > >> here. > >> [0] > >> 2. The Google Summer of Code project runs every year and I have been > >> getting interested and involved in it these last few years. I was > >> looking > >> for the following > >> > >> > >> - Someone who is a student as of this May > >> - Someone who is interested in working with Giraph for graph > processing > >> (ideally an existing member of the Giraph community however this is > not > >> a > >> MUST). > >> - Someone who is interested in a Page Rank implementation within > Giraph > >> which could be utilised within Nutch (ideally also familiar with > graph > >> structures produced by a crawler such as Nutch... however again not a > >> MUST). > >> > >> Honestly if you have any potential student candidates in mind, please > reach > >> out to them. > >> > >> Additionally any feedback on the above would be excellent. I appreciate > >> that this reply is well, well overdue but I think the project would be a > >> great one to get off the ground and could be the beginning of forming a > >> nice bridge between out communities. > >> > >> Thank you very much in advance. > >> > >> Lewis > >> > >> [0] http://www.infoq.com/articles/nioche-apache-nutch2 > >> -- > >> *Lewis* > >> > > > > -- > *Lewis* >
