[ 
https://issues.apache.org/jira/browse/JENA-329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13716373#comment-13716373
 ] 

Andy Seaborne commented on JENA-329:
------------------------------------

The use for this came up recently where the app was doing a large 
transformation of the RDF using CONSTRUCT.  Larger than memory available.  
DistinctDataNet is a possible implementation (it's a sliding window effectively 
- by "chunked" I just mean the scope of uniqueness), a direct stream is another.

I prefer the idea that it is a weakening of delivery of the data rather than 
changing the query itself. (c.f. QueryExecution.execConstructTriples).  While 
an RDF graph is a set of triples, there isn't a strict need to make the 
returned data triple-unique.  It's not uncommon for syntax to contain repeats.
                
> Add streaming CONSTRUCT results to Fuseki
> -----------------------------------------
>
>                 Key: JENA-329
>                 URL: https://issues.apache.org/jira/browse/JENA-329
>             Project: Apache Jena
>          Issue Type: Improvement
>          Components: Fuseki
>            Reporter: Stephen Allen
>
> As a result of JENA-205, streaming results are now available for CONSTRUCT 
> queries.  However there can be duplicate triples in the iterator.  This task 
> is to allow Fuseki to stream back results, while at the same time performing 
> a distinct operation.
> The fix would be to modify SPARQL_Query to use 
> QueryExecution.execConstructTriples() and filter the results through a 
> DistinctDataNet<Triple> as they are being streamed back to the client.
> This also requires RDFWriter implementations that can accept Iterator<Triple> 
> instead of Model.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to