Re: huge number of rules for a few RDF statements ?

Dave Reynolds Tue, 10 Sep 2013 00:45:25 -0700

Hi,

On 09/09/13 23:25, [email protected] wrote:

Hi,


I'm considering the Jena Rules as a rule-based programming model
where rules are being discovered and accumulated to grow tens of
thousand, while the fact for inferring new info is only a few
RDF statements. In this case, the rule engine may have to check
each and every rule for the fact to find out the one matching
the statements - which may imply a scaling issue.

Or, should the rules be organized into a set of category, and
the statement is classified first to select the matching rule
set to reduce the rule processing time ?

Will appreciate your insights,

In theory the primary scaling issue in this case should be the number ofdistinct patterns in the rules rather than the number of rules. In RETEthe rules are implemented as a pattern matching network and facts aredropped in.

However, in practice the Jena rules implementation is crude and hasn'tbeen designed or tested on huge numbers of rules. So the network itproduces may be suboptimal (especially if grown incrementally) and thereis no indexing in the cases where one node fans out to a very largenumber of child nodes. Given the simplicity of the Jena implementationthen at least putting the more discriminating patterns at the start ofthe rules is likely to help.

The only way to check if Jena could cope with this would be to run somerepresentative tests.


Dave

Re: huge number of rules for a few RDF statements ?

Reply via email to