[Dbpedia-discussion] MapReduce expert needed to help DBpedia [as GSoC co-mentor]

2014-03-06 Thread Dimitris Kontokostas
Dear all,

We want to adapt the DBpedia extraction framewok to work with a MapReduce
framework. [1]

We want to implement this idea through GSoC 14 and already got two
interested students [2] [3].
Unfortunately we are not experienced in this field and our existing
contacts could not join. Thus,  we are looking for someone to help us
mentor the technical aspects of this project.

About GSoC (http://en.wikipedia.org/wiki/GSoC)
The *Google Summer of Code* (*GSoC*) is an annual program, first held from
May to August 2005,[1]http://en.wikipedia.org/wiki/GSoC#cite_note-LinSOC-1 in
which Google awards stipends (of US$5,500, as of 2014) to all students who
successfully complete a requested free and open-source software coding
project during the summer.
See some additional info on our page [4]

Best,
Dimitris

[1] http://wiki.dbpedia.org/gsoc2014/ideas/ExtractionwithMapReduce/
[2] student 
#1http://sourceforge.net/mailarchive/forum.php?thread_name=CA%2Bu4%2Ba3g3dSd9L%3DM173hryYPp9HjwtNYgUU6Jcedy9MUAmzMVA%40mail.gmail.comforum_name=dbpedia-gsoc
[3] student 
#2http://sourceforge.net/p/dbpedia/mailman/dbpedia-gsoc/thread/CAOk94WbB7%2BEzaWveP4OWCGeXvKdVUv790wAL%2BuRsoxTb1VEDeQ%40mail.gmail.com/#msg32063932
[4] http://wiki.dbpedia.org/gsoc2014?v=kx0#h358-6


-- 
Dimitris Kontokostas
Department of Computer Science, University of Leipzig
Research Group: http://aksw.org
Homepage:http://aksw.org/DimitrisKontokostas
--
Subversion Kills Productivity. Get off Subversion  Make the Move to Perforce.
With Perforce, you get hassle-free workflows. Merge that actually works. 
Faster operations. Version large binaries.  Built-in WAN optimization and the
freedom to use Git, Perforce or both. Make the move to Perforce.
http://pubads.g.doubleclick.net/gampad/clk?id=122218951iu=/4140/ostg.clktrk___
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion


Re: [Dbpedia-discussion] MapReduce expert needed to help DBpedia [as GSoC co-mentor]

2014-03-06 Thread Nicolas Torzec
Great idea and much needed move ;)

Within the Hadoop platform, the MapReduce framework is focused on distributed 
batch processing.
Other frameworks are more focused on streaming…
= Have you considered the pros and cons?

FYI, we are using the DBpedia Extraction framework at Yahoo Labs for some 
projects, and have been thinking about porting it to Hadoop for some time.
We may be able to help…

--
Nicolas Torzec
Yahoo Labs


From: Dimitris Kontokostas 
kontokos...@informatik.uni-leipzig.demailto:kontokos...@informatik.uni-leipzig.de
Date: Thursday, March 6, 2014 at 5:04 AM
To: semantic-...@w3.orgmailto:semantic-...@w3.org 
semantic-...@w3.orgmailto:semantic-...@w3.org, Linked Data community 
public-...@w3.orgmailto:public-...@w3.org, DBpedia Discussions 
dbpedia-discussion@lists.sourceforge.netmailto:dbpedia-discussion@lists.sourceforge.net,
 DBpediaDevelopers 
dbpedia-develop...@lists.sourceforge.netmailto:dbpedia-develop...@lists.sourceforge.net,
 
dbp-spotlight-us...@lists.sourceforge.netmailto:dbp-spotlight-us...@lists.sourceforge.net
 
dbp-spotlight-us...@lists.sourceforge.netmailto:dbp-spotlight-us...@lists.sourceforge.net,
 DBpediaSpotlight Developers 
dbp-spotlight-develop...@lists.sourceforge.netmailto:dbp-spotlight-develop...@lists.sourceforge.net
Subject: [Dbpedia-discussion] MapReduce expert needed to help DBpedia [as GSoC 
co-mentor]

Dear all,

We want to adapt the DBpedia extraction framewok to work with a MapReduce 
framework. [1]

We want to implement this idea through GSoC 14 and already got two interested 
students [2] [3].
Unfortunately we are not experienced in this field and our existing contacts 
could not join. Thus,  we are looking for someone to help us mentor the 
technical aspects of this project.

About GSoC (http://en.wikipedia.org/wiki/GSoC)
The Google Summer of Code (GSoC) is an annual program, first held from May to 
August 2005,[1]http://en.wikipedia.org/wiki/GSoC#cite_note-LinSOC-1 in which 
Google awards stipends (of US$5,500, as of 2014) to all students who 
successfully complete a requested free and open-source software coding project 
during the summer.
See some additional info on our page [4]

Best,
Dimitris

[1] http://wiki.dbpedia.org/gsoc2014/ideas/ExtractionwithMapReduce/
[2] student 
#1http://sourceforge.net/mailarchive/forum.php?thread_name=CA%2Bu4%2Ba3g3dSd9L%3DM173hryYPp9HjwtNYgUU6Jcedy9MUAmzMVA%40mail.gmail.comforum_name=dbpedia-gsoc
[3] student 
#2http://sourceforge.net/p/dbpedia/mailman/dbpedia-gsoc/thread/CAOk94WbB7%2BEzaWveP4OWCGeXvKdVUv790wAL%2BuRsoxTb1VEDeQ%40mail.gmail.com/#msg32063932
[4] http://wiki.dbpedia.org/gsoc2014?v=kx0#h358-6


--
Dimitris Kontokostas
Department of Computer Science, University of Leipzig
Research Group: http://aksw.org
Homepage:http://aksw.org/DimitrisKontokostas
--
Subversion Kills Productivity. Get off Subversion  Make the Move to Perforce.
With Perforce, you get hassle-free workflows. Merge that actually works. 
Faster operations. Version large binaries.  Built-in WAN optimization and the
freedom to use Git, Perforce or both. Make the move to Perforce.
http://pubads.g.doubleclick.net/gampad/clk?id=122218951iu=/4140/ostg.clktrk___
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion


Re: [Dbpedia-discussion] MapReduce expert needed to help DBpedia [as GSoC co-mentor]

2014-03-06 Thread Dimitris Kontokostas
On Thu, Mar 6, 2014 at 6:42 PM, Nicolas Torzec torz...@yahoo-inc.comwrote:

  Great idea and much needed move ;)


Really good to know that :) We don't have a direct use case for this idea,
we just thought it would increase DBpedia usage in big data pipelines


  Within the Hadoop platform, the MapReduce framework is focused on
 distributed batch processing.
 Other frameworks are more focused on streaming…
 = Have you considered the pros and cons?


Actually no, this is one of the reasons we need the expert. We have a very
general idea of the existing frameworks but cannot make that decision with
confidence.


 FYI, we are using the DBpedia Extraction framework at Yahoo Labs for some
 projects, and have been thinking about porting it to Hadoop for some time.
 We may be able to help…


Since you (Yahoo Labs) can provide one of our use cases, it could all fit
very well.
However, we'd need some commitment. The application period starts next week
[1] and if we won't find anyone we'll have to drop this.

Regarding the workflow, we will provide the DBpedia know-how and the
expert will have two tasks
1) Ensure that the student's application is technically good and, if the
students gets accepted
2) periodically (weekly) check his progress during the coding period

Best,
Dimitris

[1] https://www.google-melange.com/gsoc/events/google/gsoc2014



  --
 Nicolas Torzec
 Yahoo Labs


   From: Dimitris Kontokostas kontokos...@informatik.uni-leipzig.de
 Date: Thursday, March 6, 2014 at 5:04 AM
 To: semantic-...@w3.org semantic-...@w3.org, Linked Data community 
 public-...@w3.org, DBpedia Discussions 
 dbpedia-discussion@lists.sourceforge.net, DBpediaDevelopers 
 dbpedia-develop...@lists.sourceforge.net, 
 dbp-spotlight-us...@lists.sourceforge.net 
 dbp-spotlight-us...@lists.sourceforge.net, DBpediaSpotlight Developers 
 dbp-spotlight-develop...@lists.sourceforge.net
 Subject: [Dbpedia-discussion] MapReduce expert needed to help DBpedia [as
 GSoC co-mentor]

   Dear all,

  We want to adapt the DBpedia extraction framewok to work with a
 MapReduce framework. [1]

  We want to implement this idea through GSoC 14 and already got two
 interested students [2] [3].
 Unfortunately we are not experienced in this field and our existing
 contacts could not join. Thus,  we are looking for someone to help us
 mentor the technical aspects of this project.

  About GSoC (http://en.wikipedia.org/wiki/GSoC)
  The *Google Summer of Code* (*GSoC*) is an annual program, first held
 from May to August 
 2005,[1]http://en.wikipedia.org/wiki/GSoC#cite_note-LinSOC-1 in
 which Google awards stipends (of US$5,500, as of 2014) to all students
 who successfully complete a requested free and open-source software coding
 project during the summer.
  See some additional info on our page [4]

  Best,
 Dimitris

 [1] http://wiki.dbpedia.org/gsoc2014/ideas/ExtractionwithMapReduce/
 [2] student 
 #1http://sourceforge.net/mailarchive/forum.php?thread_name=CA%2Bu4%2Ba3g3dSd9L%3DM173hryYPp9HjwtNYgUU6Jcedy9MUAmzMVA%40mail.gmail.comforum_name=dbpedia-gsoc
 [3] student 
 #2http://sourceforge.net/p/dbpedia/mailman/dbpedia-gsoc/thread/CAOk94WbB7%2BEzaWveP4OWCGeXvKdVUv790wAL%2BuRsoxTb1VEDeQ%40mail.gmail.com/#msg32063932
 [4] http://wiki.dbpedia.org/gsoc2014?v=kx0#h358-6


  --
 Dimitris Kontokostas
 Department of Computer Science, University of Leipzig
 Research Group: http://aksw.org
 Homepage:http://aksw.org/DimitrisKontokostas


 --
 Subversion Kills Productivity. Get off Subversion  Make the Move to
 Perforce.
 With Perforce, you get hassle-free workflows. Merge that actually works.
 Faster operations. Version large binaries.  Built-in WAN optimization and
 the
 freedom to use Git, Perforce or both. Make the move to Perforce.

 http://pubads.g.doubleclick.net/gampad/clk?id=122218951iu=/4140/ostg.clktrk
 ___
 Dbpedia-discussion mailing list
 Dbpedia-discussion@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion




-- 
Dimitris Kontokostas
Department of Computer Science, University of Leipzig
Research Group: http://aksw.org
Homepage:http://aksw.org/DimitrisKontokostas
--
Subversion Kills Productivity. Get off Subversion  Make the Move to Perforce.
With Perforce, you get hassle-free workflows. Merge that actually works. 
Faster operations. Version large binaries.  Built-in WAN optimization and the
freedom to use Git, Perforce or both. Make the move to Perforce.
http://pubads.g.doubleclick.net/gampad/clk?id=122218951iu=/4140/ostg.clktrk___
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion