Hi Radhika,
first you might want to try to run the extraction from here:
https://github.com/dbpedia/dbpedia-wiktionary
Just to see how it works. Then move to the Scala source code. In the end
you would need a Webservice + HTML GUI that gets a set of pages and an
XML file (or maybe this can also be broken down in GUI ) and then puts
out the triples.
The difficult thing with this project is to make it very easy to use for
everybody. Also some sort of config management and user collaboration
would be nice or getting the Wiktionary Live extraction running.
Actually, you could also target other Wikis, but Wiktionary one of the
most prominent targets.
Basically, I think you just have to start writing your proposal and read
this here:
http://wiki.dbpedia.org/gsoc2013/apply?v=f4p
Scala is kind of hard to learn because of the syntax. I hope your Python
knowledge helps...
All the best,
Sebastian
Am 22.04.2013 19:56, schrieb Radhika Gaonkar:
Dear Dbpedia developers,
I am Radhika Gaonkar, an undergraduate computer science student at
BITS Pilani , Goa. I am planning to apply to DBpedia for the project
Wikitionary 2 RDF extraction GUI.
I am about to complete a project on recommendation systems for
user's bookmarks. I have used a lot of nlp and machine learning
techniques for this. Specifically talking, tools such as nltk in
python and maximum entropy classifiers.
Currently I am working on developing knowledge graphs using neo4j
in java. Its a small project using wikipedia dumps. I recently started
this and I am studying graph databases for the same. I read the master
thesis paper by Brekle, Jonas. I would love to work in this field . I
checked up on scala and it should take me 2-3 more days to get
familiar with it
I am really late with this, but it will be great if you can help
me get started. I have checked up on the dbpedia pages and I am aware
of the work that you guys are looking for. How exactly should I
approach the entire application process?
Best Regards,
--
Radhika Gaonkar
3rd year B.E. Hons Computer Science
BITS Pilani K. K . Birla Goa Campus
Contact no. || +91 9004753662
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Dbpedia-gsoc mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc
--
Dipl. Inf. Sebastian Hellmann
Department of Computer Science, University of Leipzig
Projects: http://nlp2rdf.org , http://linguistics.okfn.org ,
http://dbpedia.org/Wiktionary , http://dbpedia.org
Homepage: http://bis.informatik.uni-leipzig.de/SebastianHellmann
Research Group: http://aksw.org
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Dbpedia-gsoc mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc