Re: SDB to TDB transition

Rob Vesse Thu, 10 Apr 2014 15:57:28 -0700

Hi David

So the first think to point out which may make things tricky for you is
that TDB can only be accessed from a single JVM at a single time.  This is
due to the fact that  it uses memory mapped files, journaling and caching
so if you try and use it from multiple JVMs you are almost guaranteed to
corrupt your data.

However the workaround for this is to introduce Fuseki into your
architecture as your database server.  Fuseki can serve up TDB datasets
and clients then access them over HTTP.

If you use the standard APIs for remote query
(QueryExecutionFactory.sparqlService()), remote update
(UpdateExecutionFactory.createRemote()) and remote graph access
(DatasetAccessorFactory.createHTTP()) then your application can actually
be refactored to be agnostic of the backend and you can always switch out
Fuseki+TDB for another SPARQL compliant server if necessary.

With Fuseki+TDB being able to query the union graph can either be done at
the server configuration level or by specifying a magic graph URI in your
queries - <urn:x-arq:UnionGraph> though continuing to use this feature
will make it harder to migrate off Fuseki+TDB should you ever need to
since this feature goes beyond standard SPARQL.

I don¹t see why you can¹t keep your interaction roughly the same as you
have it now you would merely need to change the underlying implementation.
 Of course if your operations are mostly coarse-grained I.e. in terms of
graphs then the DatasetAccessor API may actually cover much of what you
need which would make your implementation fairly trivial.

Hope this is enough to get you started, please feel free to ask further
questions or for clarifications as you explore this,

Cheers,

Rob

On 10/04/2014 12:09, "Lebling, David (US SSA)"
<[email protected]> wrote:

>I am looking at finally biting the bullet and transitioning from SDB to
>TDB. The first step is to come up with a level-of-effort estimate to see
>if this fits in our budget.
>
>We are using Jena 2.11.0 and SDB 1.4.0. We have a set of five web
>services (which can be in separate JVMs) that use SDB as an OWL storage
>device. The items  stored are named graphs. These are read, modified in
>memory (sometimes through inference, sometimes though pure Java code
>adding and removing and modifying statements), and written back out. We
>also use SPARQL queries on the union graph to find graphs of interest.
>Although currently there are a fairly small number of these named graphs,
>we want to be able to expand the system to hold a much larger number. One
>of the stumbling blocks with SDB was a bug in its multi-JVM concurrency
>code that wasn't fixed due to lack of SDB support.
>
>All interactions with the SDB database are through a single class which
>implements an interface with read, write, delete, find, etc. on named
>graphs and open and close on the database itself.
>
>Any advice on how to go about architecting and implementing a TDB version
>of the above would be appreciated. More details can be supplied if
>needed, of course.
>
>Thanks,
>
>Dave Lebling
>

Re: SDB to TDB transition

Reply via email to