(GSoC 2015: MARMOTTA-593) Project Midterm Report

Junyue Wang Sat, 27 Jun 2015 07:18:23 -0700

Hi Peter, Sergio,

I'm here to summarize the status for the first-half part of the GSoC
project:


1. Test data preparation
It's useful to have test data of hdt files prepared for testing the new
parser. But the dataset from [1] are too big for small tests. So I borrowed
some examples from W3C RDF documentation [2]. I used HDT java
implementation to transform example02.rdf~20.rdf into test02.hdt~20.hdt in
the code base [3]

2. HDT RDF parser based on HDT java implementation
I'm sorry that the project goal was misunderstood during the project
proposal period. In the first few weeks of the project, I was devoted to
code the HDT RDF parser based on HDT java implementation. I also sent email
to legal-discuss@, for clarifying the licence issue, but no response showed
up until now. Anyway, I committed the code [4], in case it may be useful in
future.

3. HDT RDF parser from scratch
I've began to code the HDT RDF parser from scratch. Now the new parser can
parse the Global Information of the hdt files [5]. I'll continue in this
way for the next half-part of the project.

yours,
Junyue

[1] http://www.rdfhdt.org/datasets/
[2] https://dvcs.w3.org/hg/rdf/raw-file/default/rdf-xml/index.html
[3]
https://github.com/junyuew/marmotta/tree/MARMOTTA-593/commons/marmotta-sesame-tools/marmotta-rio-rdfhdt/src/test/resources/org/apache/marmotta/commons/sesame/rio/rdfhdt
[4]
https://github.com/junyuew/marmotta/commit/e4b5d7492f102711c1227f592a36e26353f33812
[5]
https://github.com/junyuew/marmotta/commit/a7711b8338aafda9d812f0f2bb98cbde53a7cefa

(GSoC 2015: MARMOTTA-593) Project Midterm Report

Reply via email to