[ 
https://issues.apache.org/jira/browse/MARMOTTA-593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Peter Ansell updated MARMOTTA-593:
----------------------------------
    Description: 
[RDF HDT|http://www.rdfhdt.org] is a compact data structure and binary 
serialization format for RDF that keeps big datasets compressed to save space 
while maintaining search and browse operations without prior decompression. 
This makes it an ideal format for storing and sharing RDF datasets on the Web.

Currently the [Java 
Implementation|http://www.rdfhdt.org/manual-of-the-java-hdt-library/] only 
provides bindings for jena RIOT, with a license that does not enable it to be 
integrated into the main Sesame codebase, or any Apache codebase.

The idea consist on implementing an Apache licensed implementation of RDF HDT 
from scratch and support the [Sesame 
RIO|http://rdf4j.org/sesame/2.8/apidocs/org/openrdf/rio/Rio.html] 
infrastructure (RDFParser/RDFWriter/RDFHandler). 

The implementation would require to have good knowledge of Java programming, 
plus some basic understanding of parsers concepts and the RDF and HDT data 
models.

  was:
[RDF HDT|http://www.rdfhdt.org] is a compact data structure and binary 
serialization format for RDF that keeps big datasets compressed to save space 
while maintaining search and browse operations without prior decompression. 
This makes it an ideal format for storing and sharing RDF datasets on the Web.

Currently the [Java 
Implementation|http://www.rdfhdt.org/manual-of-the-java-hdt-library/] only 
provides bindings for jena RIOT, with a very restrictive license (GPL).

The idea consist on implementing from scratch the [Sesame 
RIO|http://rdf4j.org/sesame/2.8/apidocs/org/openrdf/rio/Rio.html] 
infrastructure (RDFParser/RDFWriter/RDFHandler) for RDF HDT. 

The implementation would require to have good knowledge of Java programming, 
plus some basic understanding of parsers concepts and the RDF data model.


> RDF HDT implementation for Sesame RIO
> -------------------------------------
>
>                 Key: MARMOTTA-593
>                 URL: https://issues.apache.org/jira/browse/MARMOTTA-593
>             Project: Marmotta
>          Issue Type: Task
>          Components: KiWi Triple Store
>            Reporter: Sergio Fernández
>              Labels: gsoc, gsoc2015, hdt, java, linkeddata, rdf, sesame
>   Original Estimate: 480h
>  Remaining Estimate: 480h
>
> [RDF HDT|http://www.rdfhdt.org] is a compact data structure and binary 
> serialization format for RDF that keeps big datasets compressed to save space 
> while maintaining search and browse operations without prior decompression. 
> This makes it an ideal format for storing and sharing RDF datasets on the Web.
> Currently the [Java 
> Implementation|http://www.rdfhdt.org/manual-of-the-java-hdt-library/] only 
> provides bindings for jena RIOT, with a license that does not enable it to be 
> integrated into the main Sesame codebase, or any Apache codebase.
> The idea consist on implementing an Apache licensed implementation of RDF HDT 
> from scratch and support the [Sesame 
> RIO|http://rdf4j.org/sesame/2.8/apidocs/org/openrdf/rio/Rio.html] 
> infrastructure (RDFParser/RDFWriter/RDFHandler). 
> The implementation would require to have good knowledge of Java programming, 
> plus some basic understanding of parsers concepts and the RDF and HDT data 
> models.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to