Re: SPARQL vs Jena rules

baran . ha Thu, 31 Aug 2017 06:23:49 -0700

On Thu, 31 Aug 2017 09:48:51 +0200, Dave Reynolds<[email protected]> wrote:

On 30/08/17 15:10, [email protected] wrote:
PS: I wonder why Dave doesn't comment in this thread. Perhaps becausehe thinks, Lorenz is ok, i myself cannot stand the low-level-knowledgeof the users in this thread or no matter what you do, by some heavydata-input an app with InfModel would hang anyway? Lorenz is ofcourseok, but i 'guess' Jena users are also very curious about Dave'scomments...
I didn't comment on this thread because, as Andy has already pointedout, this seems to be a repeat of a recent similar thread (on which Idid comment). That in turn was a near repeat of another similar thread.All from the same group.
Also I think Lorenz has covered it all, with admirable patience.
However, in an attempt to clarify the trade-offs in more depth maybe thefollowing would be helpful:
When comparing a rule system against a set of SPARQL Update queriesthere are several factors that affect the trade-offs including (1) thespecific nature of the rules/queries, (2) the data flow and, (3)preferences on syntax and machinery.
1a. For a single (forward) Jena rule then you can always achieve thesame with a SPARQL Update. For a single set of input data then SPARQLhas the benefit of being a standard [1] and offering better performanceover a store like TDB. Conversely SPARQL is much richer than Jena rulesso there are things that you could achieve with a single SPARQL Updatequery using, say, property paths that would require multiple Jena rules.
1b. If you have a set of rules, but they don't create loops/recursion,then you can "stratify" them into groups of rules than can be run oneafter the other. In that case, for a single set of input data, thenagain you can implement it as a sequence of SPARQL Updates with similarbenefits.
1c. If your rules can't be stratified, i.e. one rule can indirectlytrigger itself, then it's more complex. In that case you would have toe.g. run the set of SPARQL Updates repeatedly until nothing new isdeduced. Depending on the specifics of the rules and the data that maybe quite expensive and you would be better off with something Jenarules. However, in some cases you may be able to use things like SPARQLproperty paths to achieve the desired effect without have to recurse.
2. If you have a single data set and just want to run your rules on itthen the above applies. If you are repeatedly adding new data and wantto keep your deductions up to date then the Jena forward rules enginehas the advantage that it keeps all the partial matches around. Soaddition of one more triple may cause a rule to fire without it havingto search for all the other triples in the body. This is also why"recursive" rules work relatively efficiently.
This doesn't apply if you delete data. In that case Jena rules have tostart over and can't reuse state across data deletions.
If you keeping changing your data but very rarely ask questions of it,and then only limited questions, then Jena back rules have advantages.The backward engine will only run the rules needed for the specificquery. If that's a lot fewer than the overall rules then that should becheaper than running a full forward deduction using SPARQL Updates. Inthis situation it may be possible to achieve the same effects throughSPARQL query (not update) by query rewriting but that's a wholedifferent ball game.
3. With Jena rules you have some prebuilt machinery for running therules (InfGraphs and all that) and some support for externalizing therules in separate files. With SPARQL you have to create all that (thoughit's easy) and you have a nicer syntax.
So fundamentally, like all "X vs Y" questions it depends on thespecifics of what you are trying to do.
Dave
[1] There is a standard for rules, RIF, but it is not aimed atparticularly RDF processing and post-dates Jena rules.


************

Hello, thanks for the response, i hope very much some other Jena.usershave also had their conclusions.


My personal project-dependent view to

'Jena rules vs SPARQL queries'

was as described in my first posting to this thread:

'Offline', with Jena Rules, to prepare the input data for tdbloder so,that i can 'simplfy' some queries of my Query-UI with a better performancefor my Fuseki-endusers:

A Jena-app with InfModel + Rules-List -> output RDF -> TDB/Fuseki -> AQuery-UI

-------------------------------------------------------------------------------

Further assume Rules-List contains forward rules and only for thispurpose...


And i don't update TDB data with any triples, it remains constant.

And now i see:

1a. For a single (forward) Jena rule then you can always achieve thesame with a SPARQL Update.

This, in this theory-clearness, is really new to me, thanks again, in mybackhead i have had it vice versa. So i can can give now the alternativedev-scenerio:


A Jena-app with SPARQL Updates -> output RDF -> TDB/Fuseki -> A Query-UI
------------------------------------------------------------------------

It remaines developer-dependent, sometimes someone can formulate somesimple rules without making thoughts about harebrained SPARQL Updates...

Ok, i think this 2 dev-scenarios together can eventually be accepted togive a try, hmm, as i understand Dave, or?

And now, in offline-prepairing of the tdbloader-input for betterenduser-queries, our 'secondary' problem, using Jena-Rules OR SPARQLUpdate?


As a simple example to close this with a concrete thing:

@prefix skos: <http://www.w3.org/2004/02/skos/core#> .

[myrules5: (?x skos:subject ?y), (?y skos:broader ?z) -> (?x skos:subject?z)]


What would you give as the equivalent SPARQL Update?

Thank you very much,

baran.

--
Using Opera's mail client: http://www.opera.com/mail/

Re: SPARQL vs Jena rules

Reply via email to