Re: RDFConnection

Andy Seaborne Sun, 09 Aug 2015 08:55:22 -0700

Here is a summary/draft to try to pull the discussions together:

There would be three main interfaces: one application facing and two forthe two SPARQL protocols.


(all names provisional!)

== RDFConnection

* The application facing API

*  RDFConnectionFactory (name?) to make the things.

* It builds on the two SPARQL protocols.

* Autocommit provided (client-side)

* No mention of QueryExecution

Results processed in a style that means RDFConnection gets to manage theresult set. Probably also one operation to execute and copy the resultsbecause this is a recurring support area. Tryi to get smart and passaroudn a stream uis

* Composed of SPARQLProtocolConnection (query+update) andSPARQLGraphStoreProtocol

* It is easier to add operations than remove them esp from aapplication=facing API like RDFConnection so start cautious. There couldbe useful compound operations or ones applicable only sometimes, but fornow, roughly a 1-1 match to a SPARQL operation.


= SPARQLProtocol
* The operations of Query, Update

* Explicit transactions only (client-side)

= SPARQLGraphStoreProtocol

* DatasetAccessor with renaming to make it clear how the operationsrefer to HTTP operations - we might as well call them gspGET, gspPOST,gspPUT, gspDELETE or something like that and have RDFConnection havetask focused names (e.g. loadFile, addModel, ...)


* Deprecate DatasetAccessor (and DatasetGraphAccessor?)

* Explicit transactions only (client-side)

== Notes/Questions:

* loadFile can be done two ways - GSP and INSERT DATA (GSP is moreefficient - but if no GSP handlers is available, it can switch to SPARQlProtocol means.

* Is bulk delete a major requirement (more a question of how much todesign for it - not whether to have it or not e.g. may be only inSPARQLProtocol).

* QueryExecution (or indeed the QueryStatement version) is going to be aproblem because calling the operation only sets it up, not actuallyexecutes it. And I would like to avoid, at least to the applicationgetting into excessive nesting of try-with-resources. That is valuablefor RDFConnection itself.


JDBC has statement objects for some reasons that don't apply to Jena:

Prepared statements (server side) and parameterised queries (Jena hasdifferent mechanisms - may show in RDFConnection and happen client side)



* non-HTTP remote connection in the future.

Always tricky to plan for the unknown! I think we should be aware thismay happen but not worry too much now (I used to put in designs for allpossibilities but looking back, they never end up right and do createlegacy baggage all too easily.)


== QueryExecution

(For SPARQLProtocol, not RDFConnection)

I'd like to refactor this and not create a new, separate interface thatthat does the same thing. So at a minimum, a super interface for exec*

With more change, one option is to remove (via deprecation cycle)get/set initial binding and require providing it at the factory step.

The getDataset and getQuery operations are more convenience - we couldremove or continue to document they may return null. As the querycarries prefixes information for presentation, retaining getQuery makessense to me. Remove getDataset?


        Andy

Re: RDFConnection

Reply via email to