> > From what I've seen so far, yes. But it could depend on your engine > configuration using a richer set of rules.
Same thing happens when we use the default rules set (seo_rules.sem) from SVN. We did not customize any other part of the installation with the exception of loading a local DBpedia index in sling/datafiles. David On Fri, Mar 16, 2012 at 12:27 PM, Alessandro Adamou <[email protected]>wrote: > On 3/16/12 11:16 AM, David Riccitelli wrote: > >> Is this issue happening to us only? >> > > From what I've seen so far, yes. But it could depend on your engine > configuration using a richer set of rules. > > Alessandro > > On Fri, Mar 16, 2012 at 12:12 PM, Alessandro Adamou<[email protected]>** >> wrote: >> >> One thing that it would be great to do is to detect the ontology ID >>> *before* creating the TripleCollection in Clerezza, so any mappings could >>> be done before storing. >>> >>> But I don't know how this can be done with not so much code. >>> >>> Perhaps creating an IndexedGraph, exploring its content, then creating >>> the >>> Graph in the TcManager with the same content and the right graph name, >>> then >>> finally clearing the IndexedGraph could work. >>> >>> But it still means having twice the resource usage (disk+memory) for a >>> period. >>> >>> Alessandro >>> >>> >>> >>> On 3/16/12 10:56 AM, Alessandro Adamou wrote: >>> >>> Hi David, >>>> >>>> well, I guess that depends pretty much on how heavy the usage of OntoNet >>>> is in your Stanbol installation. >>>> >>>> Those are graphs created when OntoNet has to load an ontology from its >>>> content rather than from a Web URI, so it cannot know the ontology ID >>>> earlier. >>>> >>>> This happens e.g. by POSTing the ontology as the payload or by passing a >>>> GraphContentInputSource to the Java API. >>>> >>>> Now I do not know why these graphs are created (perhaps the refactor >>>> engine could be loading some), but I do know that a Clerezza graph in >>>> Jena >>>> TDB occupies a LOT of disk space. >>>> >>>> Suffice it to say that my bundled had stored nine graphs of<100 triples >>>> each. Their disk space was about 1.8 GB, but when I tried to make a >>>> zipfile >>>> out of it, it came out as about 2MB! >>>> >>>> Alessandro >>>> >>>> >>>> On 3/16/12 10:30 AM, David Riccitelli wrote: >>>> >>>> Dears, >>>>> >>>>> As I ran into disk issues, I found that this folder: >>>>> sling/felix/bundleXXX/data/****tdb-data/mgraph >>>>> >>>>> >>>>> where XX is the bundle of: >>>>> Clerezza - SCB Jena TDB Storage Provider >>>>> org.apache.clerezza.rdf.jena.****tdb.storage >>>>> >>>>> >>>>> took almost 70 gbytes of disk space (then the disk space has been >>>>> exhausted). >>>>> >>>>> These are some of the files I found inside: >>>>> 193M ./ontonet%3A%3Ainputstream%****3Aontology889 >>>>> 193M ./ontonet%3A%3Ainputstream%****3Aontology1041 >>>>> 193M ./ontonet%3A%3Ainputstream%****3Aontology395 >>>>> 193M ./ontonet%3A%3Ainputstream%****3Aontology363 >>>>> 193M ./ontonet%3A%3Ainputstream%****3Aontology661 >>>>> 193M ./ontonet%3A%3Ainputstream%****3Aontology786 >>>>> 193M ./ontonet%3A%3Ainputstream%****3Aontology608 >>>>> 193M ./ontonet%3A%3Ainputstream%****3Aontology213 >>>>> 193M ./ontonet%3A%3Ainputstream%****3Aontology188 >>>>> 193M ./ontonet%3A%3Ainputstream%****3Aontology602 >>>>> >>>>> >>>>> Any clues? >>>>> >>>>> Thanks, >>>>> David Riccitelli >>>>> >>>>> **************************************************************** >>>>> ************************ >>>>> >>>>> >>>>> InsideOut10 s.r.l. >>>>> P.IVA: IT-11381771002 >>>>> Fax: +39 0110708239 >>>>> --- >>>>> LinkedIn: >>>>> http://it.linkedin.com/in/****riccitelli<http://it.linkedin.com/in/**riccitelli> >>>>> <http://it.linkedin.**com/in/riccitelli<http://it.linkedin.com/in/riccitelli> >>>>> > >>>>> Twitter: ziodave >>>>> --- >>>>> Layar Partner Network<http://www.layar.com/****<http://www.layar.com/**> >>>>> publishing/developers/list/?****page=1&country=&city=&keyword=**** >>>>> insideout10&lpn=1<http://www.**layar.com/publishing/** >>>>> developers/list/?page=1&**country=&city=&keyword=**insideout10&lpn=1<http://www.layar.com/publishing/developers/list/?page=1&country=&city=&keyword=insideout10&lpn=1> >>>>> > >>>>> **************************************************************** >>>>> ************************ >>>>> >>>>> >>>>> >>>>> >>>> -- >>> M.Sc. Alessandro Adamou >>> >>> Alma Mater Studiorum - Università di Bologna >>> Department of Computer Science >>> Mura Anteo Zamboni 7, 40127 Bologna - Italy >>> >>> Semantic Technology Laboratory (STLab) >>> Institute for Cognitive Science and Technology (ISTC) >>> National Research Council (CNR) >>> Via Nomentana 56, 00161 Rome - Italy >>> >>> >>> "I will give you everything, so long as you do not demand anything." >>> (Ettore Petrolini, 1930) >>> >>> Not sent from my iSnobTechDevice >>> >>> >>> >> > > -- > M.Sc. Alessandro Adamou > > Alma Mater Studiorum - Università di Bologna > Department of Computer Science > Mura Anteo Zamboni 7, 40127 Bologna - Italy > > Semantic Technology Laboratory (STLab) > Institute for Cognitive Science and Technology (ISTC) > National Research Council (CNR) > Via Nomentana 56, 00161 Rome - Italy > > > "I will give you everything, so long as you do not demand anything." > (Ettore Petrolini, 1930) > > Not sent from my iSnobTechDevice > > -- David Riccitelli ******************************************************************************** InsideOut10 s.r.l. P.IVA: IT-11381771002 Fax: +39 0110708239 --- LinkedIn: http://it.linkedin.com/in/riccitelli Twitter: ziodave --- Layar Partner Network<http://www.layar.com/publishing/developers/list/?page=1&country=&city=&keyword=insideout10&lpn=1> ********************************************************************************
