Hi Peter, Sergio, I'm here to summarize the status for the first-half part of the GSoC project:
1. Test data preparation It's useful to have test data of hdt files prepared for testing the new parser. But the dataset from [1] are too big for small tests. So I borrowed some examples from W3C RDF documentation [2]. I used HDT java implementation to transform example02.rdf~20.rdf into test02.hdt~20.hdt in the code base [3] 2. HDT RDF parser based on HDT java implementation I'm sorry that the project goal was misunderstood during the project proposal period. In the first few weeks of the project, I was devoted to code the HDT RDF parser based on HDT java implementation. I also sent email to legal-discuss@, for clarifying the licence issue, but no response showed up until now. Anyway, I committed the code [4], in case it may be useful in future. 3. HDT RDF parser from scratch I've began to code the HDT RDF parser from scratch. Now the new parser can parse the Global Information of the hdt files [5]. I'll continue in this way for the next half-part of the project. yours, Junyue [1] http://www.rdfhdt.org/datasets/ [2] https://dvcs.w3.org/hg/rdf/raw-file/default/rdf-xml/index.html [3] https://github.com/junyuew/marmotta/tree/MARMOTTA-593/commons/marmotta-sesame-tools/marmotta-rio-rdfhdt/src/test/resources/org/apache/marmotta/commons/sesame/rio/rdfhdt [4] https://github.com/junyuew/marmotta/commit/e4b5d7492f102711c1227f592a36e26353f33812 [5] https://github.com/junyuew/marmotta/commit/a7711b8338aafda9d812f0f2bb98cbde53a7cefa
