Hi Radhika,
first you might want to try to run the extraction from here: https://github.com/dbpedia/dbpedia-wiktionary Just to see how it works. Then move to the Scala source code. In the end you would need a Webservice + HTML GUI that gets a set of pages and an XML file (or maybe this can also be broken down in GUI ) and then puts out the triples.

The difficult thing with this project is to make it very easy to use for everybody. Also some sort of config management and user collaboration would be nice or getting the Wiktionary Live extraction running. Actually, you could also target other Wikis, but Wiktionary one of the most prominent targets.

Basically, I think you just have to start writing your proposal and read this here:
http://wiki.dbpedia.org/gsoc2013/apply?v=f4p

Scala is kind of hard to learn because of the syntax. I hope your Python knowledge helps...

All the best,
Sebastian

Am 22.04.2013 19:56, schrieb Radhika Gaonkar:

Dear Dbpedia developers,

I am Radhika Gaonkar, an undergraduate computer science student at BITS Pilani , Goa. I am planning to apply to DBpedia for the project Wikitionary 2 RDF extraction GUI.

I am about to complete a project on recommendation systems for user's bookmarks. I have used a lot of nlp and machine learning techniques for this. Specifically talking, tools such as nltk in python and maximum entropy classifiers.

Currently I am working on developing knowledge graphs using neo4j in java. Its a small project using wikipedia dumps. I recently started this and I am studying graph databases for the same. I read the master thesis paper by Brekle, Jonas. I would love to work in this field . I checked up on scala and it should take me 2-3 more days to get familiar with it

I am really late with this, but it will be great if you can help me get started. I have checked up on the dbpedia pages and I am aware of the work that you guys are looking for. How exactly should I approach the entire application process?


Best Regards,
--
Radhika Gaonkar
3rd year B.E. Hons Computer Science
BITS Pilani K. K . Birla Goa Campus
Contact no. || +91 9004753662




------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter


_______________________________________________
Dbpedia-gsoc mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc


--
Dipl. Inf. Sebastian Hellmann
Department of Computer Science, University of Leipzig
Projects: http://nlp2rdf.org , http://linguistics.okfn.org , http://dbpedia.org/Wiktionary , http://dbpedia.org
Homepage: http://bis.informatik.uni-leipzig.de/SebastianHellmann
Research Group: http://aksw.org
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Dbpedia-gsoc mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc

Reply via email to