Re: [topbraid-users] SM Script Runs slower and slower and slower

Gokhan Soydan Fri, 16 Mar 2012 12:46:43 -0700

Tim,

If you are using a sml:IterateWhile module, then at the end of eachiteration, an ApplyByConstruct module with the query:


CONSTRUCT { ?s ?p ?o } WHERE { ?s ?p ?o }

and with the "sml:replace" value set to "true" might be useful. Thisflattens deepening nested Jena graph objects by copying triples from thenested graph objects into a single graph object.

If you are using sml:IterateOverSelect, you probably won't need thismodule, because it flattens the nested graph objects in each loopautomatically, but you can try.

You may also try this module in other places - for example just beforeentering the loop.


Gokhan


On 3/16/2012 12:35 PM, Tim Smith wrote:

Hi,
I'm attempting to process ~250 XML files into RDF. I created a schemafor the files using XMLSpy and imported the schema into TBC using theXSD importer. This created two .ttl files.
I created an SM script that iterates over the files using tops:filesvia a bind by select module. Prior to the Bind by Select, I importthe schema ontologies and my target ontology. In the body, I importeach XML file, convert it to RDF and then run a series of CONSTRUCTqueries to map each file into the target ontology. The combination ofall triples generated is then saved to disk.
The script works fine if I only run through a small number of files.However, if I try to hit all 250 at once, it just runs slower andslower and slower... The slow part seems to be the CONSTRUCTqueries. They run fast initially but slow significantly after 10-20files. For every file that I have manually tested by running theCONSTRUCT query in the SPARQL view, the query has always run very fastso I do not know why performance is so poor running as an SM script.
Any suggestions? Are there things I can do to speed this along? Isthere data that I can collect to better inform you?
My current work around is to process each directory individually buteven that hits the problem because some directories have 10's of files(not to mention the obvious hassle of changing the script - filenames, base URIs, etc... for each directory)
I'm using 3.6B on win7/64 with 5G allocated to the JVM.

Thanks,

Tim

--
You received this message because you are subscribed to the Google
Group "TopBraid Suite Users", the topics of which include EnterpriseVocabulary Network (EVN), TopBraid Composer,
TopBraid Live, TopBraid Ensemble, SPARQLMotion and SPIN.
To post to this group, send email to
[email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/topbraid-users?hl=en


--
You received this message because you are subscribed to the Google
Group "TopBraid Suite Users", the topics of which include Enterprise Vocabulary 
Network (EVN), TopBraid Composer,
TopBraid Live, TopBraid Ensemble, SPARQLMotion and SPIN.
To post to this group, send email to
[email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/topbraid-users?hl=en

Re: [topbraid-users] SM Script Runs slower and slower and slower

Reply via email to