Hi Peter, Sergio,

I'm here to summarize the status for the first-half part of the GSoC
project:

1. Test data preparation
It's useful to have test data of hdt files prepared for testing the new
parser. But the dataset from [1] are too big for small tests. So I borrowed
some examples from W3C RDF documentation [2]. I used HDT java
implementation to transform example02.rdf~20.rdf into test02.hdt~20.hdt in
the code base [3]

2. HDT RDF parser based on HDT java implementation
I'm sorry that the project goal was misunderstood during the project
proposal period. In the first few weeks of the project, I was devoted to
code the HDT RDF parser based on HDT java implementation. I also sent email
to legal-discuss@, for clarifying the licence issue, but no response showed
up until now. Anyway, I committed the code [4], in case it may be useful in
future.

3. HDT RDF parser from scratch
I've began to code the HDT RDF parser from scratch. Now the new parser can
parse the Global Information of the hdt files [5]. I'll continue in this
way for the next half-part of the project.

yours,
Junyue

[1] http://www.rdfhdt.org/datasets/
[2] https://dvcs.w3.org/hg/rdf/raw-file/default/rdf-xml/index.html
[3]
https://github.com/junyuew/marmotta/tree/MARMOTTA-593/commons/marmotta-sesame-tools/marmotta-rio-rdfhdt/src/test/resources/org/apache/marmotta/commons/sesame/rio/rdfhdt
[4]
https://github.com/junyuew/marmotta/commit/e4b5d7492f102711c1227f592a36e26353f33812
[5]
https://github.com/junyuew/marmotta/commit/a7711b8338aafda9d812f0f2bb98cbde53a7cefa

Reply via email to