Dear Mentors,
This is an updated report of my idea and plan on project "5.10 DBpedia Metadata
Datasets".
DBpedia extracts information from Wikipedia by inforboxes, then, add semantic
annotations to each page by using RDF data model and ontologies. RDF statements
use classes and properties to describe resources, and these classes and
properties are defined in some given ontologies. However, in the early versions
of DBpedia's extractor, it did not make significant use of RDF, and some even
extracted without using any ontology while parsing the infobox.
The plan of my work could be divided into two major parts.
Part A,
Firstly, I will collect information about different annotation models, such as
RDF, OWL, Wikidata model, RDF* (defined in Foundations of an Alternative
Approach to Reification in RDF) and so on. Then, I will compare and analyze the
pros and cons of each annotations.
Part B,
Build an extension for DBpedia extraction framework. The purpose of this
extension is to allow DBpedia extraction framework to extract metadata by the
selected annotation models (from RDF, RDF*, OWL and so on.)
I cloned the git repo from https://github.com/dbpedia/extraction-framework.git,
ran the extractor on the local server, and started inspecting the code. The
goal of project looks clear and intuitive. I do work fine with XML, Scala, Java
and PHP.
If you have any further questions, please let me know. Thank you.
Best,
Mingzhe Du
------------------------------------------------------------------------------
One dashboard for servers and applications across Physical-Virtual-Cloud
Widest out-of-the-box monitoring support with 50+ applications
Performance metrics, stats and reports that give you Actionable Insights
Deep dive visibility with transaction tracing using APM Insight.
http://ad.doubleclick.net/ddm/clk/290420510;117567292;y
_______________________________________________
Dbpedia-gsoc mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc