Re: questions about reasoning with TDB

Andy Seaborne Tue, 07 Apr 2020 13:29:55 -0700

Inline:

On 04/04/2020 12:23, Dave Reynolds wrote:

Hi,
On 03/04/2020 15:38, Benjamin Geer wrote:
I’ve been reading the documentation and list archives about Fusekiassembler configurations with TDB and reasoners, and I’m trying tofigure out whether the setup I’d like to use is possible. I have threequestions:
1. I’d like to use a forward-chaining reasoner to improve queryperformance with a large TDB dataset by inferring some frequentlyqueried relations. To avoid having to recompute all the inferredtriples every time Fuseki is started (which could take a long time),I’d like to persist the inferred triples in TDB as well. Is thatpossible? I looked for this scenario in the Jena documentation butdidn’t find it.
Basically this isn't supported, sorry.


Benjamin,

What complexity of reasoning are you doing?

There is a tradeoff of complexity/performance at scale/effort needed.
And "effort needed" for complex+scale can be huge.

But RDFS+ level of complixity could take a different approach to thecurrent rules. Essentially, be backwards chaining (sees updates) withmaterialized transitive properties.

It's might be possible to go a fit further that that. Rules thatgenerate a single triple from a BGP+FILTER(+BIND), together withtransitive properties (not written as rules) might be possible.

The forward chaining engine keeps a *lot* of state in memory in theRETE-like network. Which means unless you have very selective patternsin your rules you can end up with large parts of the data in memory. Inworst cases you can have multiple copies.
This has several implications:
First, it means that it's not scalable. If you have a very large TDBdataset then the reasoner is likely to run out memory. Plus the internalformat is really not optimised for large scale data and inference speedwill take a hit.
Second, it means that there's no point persisting the inference resultson their own, unless they are static. If, as in your case, you want tocontinue to add new data and get incremental inferencing then you wouldneed some way to preserve and restore the intermediate state in theengine, which is not supported.
So given this there's little point in supporting having the deductionsgraph in TDB because that doesn't solve the problems of scaling andrestart.
2. For queries, I’d like a default graph containing the union of allnamed graphs plus the inferred statements. Can this be done along with(1)?
The first part can be done manually but not along with (1).
It's possible to use some offline process to generate a static set ofinferences (whether using the rule engine or e.g. SPARQL constructqueries) to one named graph, put the data in another graph and then havethe default graph be the union.
However, your data isn't static so this doesn't help.
3. The named graphs in the base model need to be continually updated(always using SPARQL quad patterns), and I’d like the reasoner toupdate its inferences when that happens. After reading some oldmessages on this list, I think this might not be possible, because ifI understand correctly, the only way to update the base model would bevia a separate Fuseki service that updates the underlying TDB datasetdirectly, and in that case, the reasoner won’t see those updates untilFuseki is restarted. Did I understand that correctly, and if so, is itstill true?
I thought you could configure fuseki to have a reasoner as the sourcemodel and so have updates do to the reasoner rather than a base graph.However, given none of the rest of what you need to do is supported thispoint is moot.

Yes,but not for the union default graph. That actually only exists forquery (and WHERE in SPARQL Update). It isn't updatable. If you updatethe named graphs it sees the change but that's bypassed the reasoner onthe graph.


Sorry to not be able to support your use case.

Dave

Re: questions about reasoning with TDB

Reply via email to