Hello Huseyin, welcome to the project. Have a look at the Warm up tasks [1]. Best way to start for this task is setting up the extraction framework and get it to run. Go for the dump extraction as a starter. There is an extra module for DBpedia Live, but it demands some additional efforts (and credentials) to get updates from Wikipedia. Admittedly, the Documentation is not the very best, so we might need to explain things as you proceed.
Best regards Magnus [1] https://github.com/dbpedia/extraction-framework/wiki/Warm-up-tasks [2] https://github.com/dbpedia/extraction-framework/wiki/DBpedia-live Am 03.03.2015 um 18:31 schrieb Hüseyin Zengin <[email protected]>: > Hi, > I am a computer engineering student, 3rd year. I have experience with Scala, > C++, Python, PHP, Java. I also have experience with MongoDB, Elastic Search > etc. > I am interested in contributing DBpedia under GSoC program. > The idea "5.8. DBpedia Live scaling & new interface" really fits to my skills > and past works. > > Can you help me to dive into DBpedia with the this project? > > BTW, It is really good to see Scala projects at this scale on GSoC > -- > Huseyin ZENGIN > ------------------------------------------------------------------------------ > Dive into the World of Parallel Programming The Go Parallel Website, sponsored > by Intel and developed in partnership with Slashdot Media, is your hub for all > things parallel software development, from weekly thought leadership blogs to > news, videos, case studies, tutorials and more. Take a look and join the > conversation now. > http://goparallel.sourceforge.net/_______________________________________________ > Dbpedia-gsoc mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc -- Magnus Knuth Hasso-Plattner-Institut für Softwaresystemtechnik GmbH Prof.-Dr.-Helmert-Str. 2-3 14482 Potsdam Amtsgericht Potsdam, HRB 12184 Geschäftsführung: Prof. Dr. Christoph Meinel tel: +49 331 5509 547 email: [email protected] web: http://www.hpi.de/ webID: http://magnus.13mm.de/ ------------------------------------------------------------------------------ Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/ _______________________________________________ Dbpedia-gsoc mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc
