Re: listIndividual method taking too much time to return

Dave Reynolds Sun, 13 Sep 2015 02:57:08 -0700

On 12/09/15 19:33, Maria Clementina Latanzi wrote:

Hi all,


I'm working with Jena. I have an ontology with no more than 50
individuals, and I use Jena to
get individuals from Ontology by
calling*listIndividual*(/*com.hp.hpl.jena.ontology.OntModel.listIndividuals*/).
When I call this method, it's taking a lot of time up to 20 seconds.
When debugging, it takes more than 1 minute to return. Other methods
like *listClass *return instantly.

I found a stackoverflow question relevant to my problem but there was no
answer. This makes me think that the problem might be general for other
people and not just an issue with my ontology. Here's the link:

http://stackoverflow.com/questions/27645110/method-listindividual-takes-more-than-15-mins-with-dbpedia-2014-owl-2mb-siz#

I'm wondering if you could help me figure out why does it take so long
to run. Any ideas? Is this normal and expected behavior because of the
size of the ontology?


Short answer:

Use an explicit OntModelSpec when you create the ontology and use eitherOWL_MEM (if you don't need any inference) or OWL_MEM_MICRO_RULE_INF (ifyou do). On your example I get:


    No OntModelSpec:        1800ms
    OWL_MEM                   11ms
    OWL_MEM_MICRO_RULE_INF:    1ms

[I'm running on Java 7, with Jena2.]

Long answer:

listIndividuals is quite tricky for a tool like Jena which is quite RDFcentric but does support OWL.

If we know the ontology is supposed to be OWL *and* we are working witha sufficiently capable reasoner then we can just list the things of typeowl:Thing and trust the reasoner to figure things out. That's the placewhere listIndividuals makes sense.

If we are or might be in RDFS land then testing whether a resource is aindividual is tricky because there is no such separation. In order to beuseful listIndividuals/isIndividual tries to eliminate things that areexplicitly classes or properties, and then looks for things which areinstances of known types. That's a useful heuristic approach whichpeople find useful though it has no formal justification - in RDFSeverything is an individual. However, it's expensive. There was a timewhen listIndividuals tried to improve things but it proved too costly tomaintain so these days listIndividuals takes the brute force approach ofapplying the isIndividual test to every resource.

The problem is that the default OntModelSpec if you create an ontologywith no explicit spec has RDFS inference. That is a good default choicefor a wide range of uses but a real problem for listIndividuals. Ittells us we can't rely on there being an owl:Thing-aware reasoner, so wehave to use the expensive heuristic. But is also means there is anactual reasoner there which slows things down.

Hence the more efficient alternatives are to either have no reasoner orto have an OWL reasoner. Or, if you really want RDFS, then don't uselistIndividuals in the first place!

[Aside: this is probably a question for the users list rather that thedev list.]


Dave

Re: listIndividual method taking too much time to return

Reply via email to