Re: [Owlim-discussion] OWLIM-SE 4.3 - first trials - repository reasoner and hard disk directory size questions

Barry Bishop Tue, 20 Mar 2012 07:22:16 -0700

Hello Fabian,

Sorry for the delay, we have been very busy getting ready for OWLIM 5.0


Let me answer your questions inline:

On 15/03/12 08:42, Fabian Cretton wrote:

Hello,
I've been moving from OWLIM Lite 4.3 to OWLIM-SE 4.3 lately, andstarted to play with the wordNet ontology.I am doing those first trials on my desktop computer, Windows 7, 8GBRam, 3GB allowed to TomCat, hoping this small configuration is ok fortests.
About repository configuration:
When doing some pretty heavy SPARQL queries on an ontology as WordNet,with ?p variable for properties which could be any of the wordnetproperty, is the "Use predicate indices" of any help ?The documentation says "One should consider using this index fordatasets that contain a very large number (~1000) different
predicates." WordNet has less than 100 predicates.
If it does not help, could it make things worse ?

Not really, the only downside will be a slight increase in load time andan increase in storage space required. I suggest you try with predicatelists turned on to see if this helps. It will make most difference whenyou have queries that have a single triple pattern with an unboundpredicate, e.g. SELECT * { ?s ?p "some_object_value }

About the rule-set:
I first created the repository with no rule-set. I did load WordNetwith, of course, no inference. Than, using SPARQL Update I changed therepository rule-set to "owl-horst-optimized". As expected, nothinghappened as the inferences are carried out at load time. Then I didreload all the files but couldn't find any inferred triple. I had todelete/recreate the repository with the rule-set from start, and thenthe inferences where done. Any idea if I might have done somethingwrong or if there is something to be careful about ?

Changing the rule-set is always troublesome and I recommend not to if atall possible. A problem with using tomcat/http repository is thatdropping a repository does not actually delete the storage files. So ifyou then change the configuration and reload with the same repository idthen it wil confuse the inferencer, because it will see that eachstatement is already loaded and assume that no inference needs to bedone. I suspect this is what is happening in your case, so any easycheck would be to drop the repository, recreate with the new config andthen immediately clear it. After that I would expect things to workcorrectly.

About inferred triples:
If I remember well, OWLIM 3.5 had an 'implicit' context where I couldfind all inferred triples. Is it correct that this no more exists withvers 4.3 ? (I saw in the user guide how to query the explicit/implicittriples, it is just a question for understanding).


This feature is still present in 4.3:

http://owlim.ontotext.com/display/OWLIMv43/OWLIM-SE+Query+Behaviour#OWLIM-SEQueryBehaviour-ManagingExplicitandImplicitStatements

In reality, inferred statements belong to the database's default graph.However, they can be filtered by using some special graph names. We callthese pseudo-graphs, because they are not really graphs, rather just ameans to switch on/off certain query answering behaviour.

Then I have a question about what happens on disk:
I had my repository loaded with WordNet T-Box + A-Box. Then I changedthe ontology with Protégé, adding a new property as sub-property ofall the wordNet properties. Then, to update my T-Box, I did simplyload the new file in the same context, without first removing the oldone. Is it the correct way to do it ?

Probably best to delete the T box first, otherwise there will be someoverlap and possible consistency problems.

My size on disk did of course increase. My Triples count did rise fromabout 5Mio to 6Mio.Then I removed the T-Box, which took 2 hours on my desktop computer (Iguess it is something to expect), and the triples drop from 6Mio to2Mio (no more inference). But what is unexpected here, is that the Iwould think the hard drive storage folder would also come back to theoriginal size without inference, but on the contrary, it did stillincrease a little. So a repository with no inference was about 150 MB.With inferences it is around 600MB. And after doing thosemanipulations and removing the inferences it was arounnd 900MB (but Iwas expecting around 150MB). At first glance, I think "pso" and "pos"where much bigger than the "clean" ones.

I think in this case you just have a lot of unused pages in your indexfiles. I'm not sure if there is any advantage for OWLIM to offer aclean-up function, i.e. compact pages in use to the beginning of theindex files and truncate the rest.


At least it is not something that anyone has asked for before.

Thank you once again for any help
Fabian


I hope that helps. All the best,
barry

_______________________________________________
Owlim-discussion mailing list
[email protected]
http://ontomail.semdata.org/cgi-bin/mailman/listinfo/owlim-discussion

Re: [Owlim-discussion] OWLIM-SE 4.3 - first trials - repository reasoner and hard disk directory size questions

Reply via email to