>> Here is a simple query that is slow: > PREFIX text: <http://jena.apache.org/text#> > SELECT DISTINCT ?thing ?COUNT WHERE { > graph ?g { > ?thing text:query ( 'Jean-Marc' ) . > } > } ORDER BY DESC(?COUNT) > LIMIT 10 > > > ?COUNT is not bound here
On 05.12.2017 09:20, Jean-Marc Vanel wrote: > 2017-12-04 10:21 GMT+01:00 Andy Seaborne <[email protected]>: > >> Hi - your messages loose the indication of who wrote what. >> > Sorry, I edited all , putting "AS:" and "JMV:" prefixes, > and removing the less important stuff. > > >> AS: >> > >> Maybe it's this part of the query that causes some or all of the costs >> >>> graph ?g1 { >>>> ?thing a ?CLASS . >>>> >>> Unnecessary? > JMV: Yes, but removing it does not change the problem. > > AS: > >> ?g1 and ?CLASS aren't used so this is only some kind of existence test. It >> is a partial cross product on the first part. With the DISTINCT in the >> SELECT there may a very large number of results yet only a few DISTINCT. >> >> JMV: >> >> >> Here is a simple query that is slow: > PREFIX text: <http://jena.apache.org/text#> > SELECT DISTINCT ?thing ?COUNT WHERE { > graph ?g { > ?thing text:query ( 'Jean-Marc' ) . > } > } ORDER BY DESC(?COUNT) > LIMIT 10 > > > >> AS: >> >> >> Remove the "graph ?g {". >>> (and then remove the DISTINCT) >>> >>> *JMV:* >> >> When I do this, the result is empty. >>> The way the text index is initialized must be wrong: >>> https://github.com/jmvanel/semantic_forms/blob/master/scala/ >>> forms/src/main/scala/deductions/runtime/jena/lucene/LuceneIndex.scala >>> > In fact, the getting the graph where where a literal triple belongs is not > relevant in this application. > So, reading again the doc. > https://jena.apache.org/documentation/query/text-query.html#graph-specific-indexing > > I understand that I must deactivate "graph-specific indexing" that is not > not needed, and slows the query. > The doc. does not say so, but I think rebuilding the index is appropriate. > > > > > >>>> JMV: >>> >> Run it with: >>>> time wget -O semantic-forms.cc_select-ui.txt >>>> http://semantic-forms.cc:9112/select-ui?query=PREFIX+text%3A >>>> +%3Chttp%3A%2F%2Fjena.apache.org%2Ftext%23%3E+%0D%0ASELECT+ >>>> DISTINCT+%3Fthing+%3FCOUNT+WHERE+%7B%0D%0A++graph+%3Fg+% >>>> 7B%0D%0A++++%3Fthing+text%3Aquery+%28+%27Jean-Marc%27+% >>>> 29+.%0D%0A++%7D%0D%0A%7D%0D%0AORDER+BY+DESC%28%3FCOUNT%29%0D%0ALIMIT+10 >>>> >>>> Or, if you want to use YasGUI , the endpoint is >>>> http://semantic-forms.cc:9112/sparql >>>> >>>> *Statistics on the database* >>>> >>>> 268 graphs and 588 864 triples. >>>> >>>> # Count graphs and triples >>>> SELECT (COUNT(?s) AS ?trc) (COUNT(?GR) AS ?grc) >>>> WHERE { >>>> { GRAPH ?GR { } } >>>> UNION >>>> { GRAPH ?GR1 { ?s ?p ?o . } } >>>> } >>>> >>>> Result: 2 rows >>>> "grc" "trc" >>>> "268"^^http://www.w3.org/2001/XMLSchema#integer "588864"^^ >>>> http://www.w3.org/2001/XMLSchema#integer >>>> >>>> (I'm not sure this the right way to count, but it gives figures :) ) >>>> >>>> You can reproduce the query with this UI : >>>> >>>> http://semantic-forms.cc:9112/select-ui?query=%23+Count+grap >>>> hs%0D%0ASELECT+%28COUNT%28%3Fs%29+AS+%3Ftrc%29+%28COUNT% >>>> 28%3FGR%29+AS+%3Fgrc%29%0D%0A++++WHERE+%7B%0D%0A+++++%7B+ >>>> GRAPH+%3FGR+%7B+%7D+%7D%0D%0A++UNION%0D%0A++++%7B+GRAPH+% >>>> 3FGR1+%7B+%3Fs+%3Fp+%3Fo+.+%7D+%7D%0D%0A%7D >>>> >>>> This is using Jena 3.5.0. with TDB 1 . >>>> >> Here is a stack made with kill -3 when the app. was working hard; >>>> I put in bold a suspect line. >>>> >>>> "application-akka.actor.default-dispatcher-1351" #2683 prio=5 os_prio=0 >>>> tid=0x00007f07a801c000 nid=0x9b7 runnable [0x00007f06f25ab000] >>>> java.lang.Thread.State: RUNNABLE >>>> at sun.nio.ch.NativeThread.current(Native Method) >>>> at sun.nio.ch.NativeThreadSet.add(NativeThreadSet.java:46) >>>> at sun.nio.ch.FileChannelImpl.readInternal(FileChannelImpl.java:737) >>>> at sun.nio.ch.FileChannelImpl.read(FileChannelImpl.java:727) >>>> at >>>> org.apache.lucene.store.NIOFSDirectory$NIOFSIndexInput. >>>> readInternal(NIOFSDirectory.java:179) >>>> at >>>> org.apache.lucene.store.BufferedIndexInput.refill(BufferedIn >>>> dexInput.java:342) >>>> at >>>> org.apache.lucene.store.BufferedIndexInput.readByte(Buffered >>>> IndexInput.java:54) >>>> at org.apache.lucene.store.DataInput.readInt(DataInput.java:101) >>>> at >>>> org.apache.lucene.store.BufferedIndexInput.readInt(BufferedI >>>> ndexInput.java:183) >>>> at org.apache.lucene.codecs.CodecUtil.checkHeader(CodecUtil.java:194) >>>> at org.apache.lucene.util.fst.FST.<init>(FST.java:327) >>>> at org.apache.lucene.util.fst.FST.<init>(FST.java:313) >>>> at >>>> org.apache.lucene.codecs.blocktree.FieldReader.<init>(FieldR >>>> eader.java:91) >>>> at >>>> org.apache.lucene.codecs.blocktree.BlockTreeTermsReader.< >>>> init>(BlockTreeTermsReader.java:234) >>>> at >>>> org.apache.lucene.codecs.lucene50.Lucene50PostingsFormat.fie >>>> ldsProducer(Lucene50PostingsFormat.java:445) >>>> at >>>> org.apache.lucene.codecs.perfield.PerFieldPostingsFormat$Fie >>>> ldsReader.<init>(PerFieldPostingsFormat.java:292) >>>> at >>>> org.apache.lucene.codecs.perfield.PerFieldPostingsFormat.fie >>>> ldsProducer(PerFieldPostingsFormat.java:372) >>>> at >>>> org.apache.lucene.index.SegmentCoreReaders.<init>(SegmentCor >>>> eReaders.java:112) >>>> at org.apache.lucene.index.SegmentReader.<init>(SegmentReader.java:74) >>>> at >>>> org.apache.lucene.index.StandardDirectoryReader$1.doBody(Sta >>>> ndardDirectoryReader.java:62) >>>> at >>>> org.apache.lucene.index.StandardDirectoryReader$1.doBody(Sta >>>> ndardDirectoryReader.java:54) >>>> at >>>> org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run( >>>> SegmentInfos.java:692) >>>> at >>>> org.apache.lucene.index.StandardDirectoryReader.open(Standar >>>> dDirectoryReader.java:77) >>>> at org.apache.lucene.index.DirectoryReader.open(DirectoryReader.java:63) >>>> at >>>> org.apache.jena.query.text.TextIndexLucene.query(TextIndexLu >>>> cene.java:370) >>>> at org.apache.jena.query.text.TextQueryPF.performQuery(TextQuer >>>> yPF.java:290) >>>> at >>>> org.apache.jena.query.text.TextQueryPF.lambda$query$1(TextQu >>>> eryPF.java:267) >>>> at >>>> org.apache.jena.query.text.TextQueryPF$$Lambda$66/2108167189 >>>> .call(Unknown >>>> Source) >>>> at >>>> org.apache.jena.ext.com.google.common.cache.LocalCache$ >>>> LocalManualCache$1.load(LocalCache.java:5065) >>>> at >>>> org.apache.jena.ext.com.google.common.cache.LocalCache$Loadi >>>> ngValueReference.loadFuture(LocalCache.java:3716) >>>> at >>>> org.apache.jena.ext.com.google.common.cache.LocalCache$ >>>> Segment.loadSync(LocalCache.java:2424) >>>> at >>>> org.apache.jena.ext.com.google.common.cache.LocalCache$ >>>> Segment.lockedGetOrLoad(LocalCache.java:2298) >>>> * - locked <0x00000000ef9d34f8> (a >>>> org.apache.jena.ext.com.google.common.cache.LocalCache$ >>>> StrongAccessEntry)* >>>> >>>> at >>>> org.apache.jena.ext.com.google.common.cache.LocalCache$ >>>> Segment.get(LocalCache.java:2211) >>>> at >>>> org.apache.jena.ext.com.google.common.cache.LocalCache.get( >>>> LocalCache.java:4154) >>>> at >>>> org.apache.jena.ext.com.google.common.cache.LocalCache$ >>>> LocalManualCache.get(LocalCache.java:5060) >>>> at org.apache.jena.atlas.lib.cache.CacheGuava.getOrFill(CacheGu >>>> ava.java:58) >>>> at org.apache.jena.query.text.TextQueryPF.query(TextQueryPF.java:267) >>>> at >>>> org.apache.jena.query.text.TextQueryPF.variableSubject(TextQ >>>> ueryPF.java:227) >>>> at org.apache.jena.query.text.TextQueryPF.exec(TextQueryPF.java:196) >>>> at >>>> org.apache.jena.sparql.pfunction.PropertyFunctionBase$Repeat >>>> ApplyIteratorPF.nextStage(PropertyFunctionBase.java:106) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIterRepeatApply. >>>> makeNextStage(QueryIterRepeatApply.java:108) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIterRepeatApply. >>>> hasNextBinding(QueryIterRepeatApply.java:65) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has >>>> Next(QueryIteratorBase.java:114) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIterProcedure.ha >>>> sNextBinding(QueryIterProcedure.java:73) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has >>>> Next(QueryIteratorBase.java:114) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIterProcessBindi >>>> ng.hasNextBinding(QueryIterProcessBinding.java:66) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has >>>> Next(QueryIteratorBase.java:114) >>>> at >>>> org.apache.jena.sparql.engine.main.iterator.QueryIterGraph$Q >>>> ueryIterGraphInner.hasNextBinding(QueryIterGraph.java:121) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has >>>> Next(QueryIteratorBase.java:114) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIterRepeatApply. >>>> hasNextBinding(QueryIterRepeatApply.java:74) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has >>>> Next(QueryIteratorBase.java:114) >>>> at org.apache.jena.atlas.iterator.Iter$2.hasNext(Iter.java:265) >>>> at >>>> org.apache.jena.atlas.iterator.RepeatApplyIterator.hasNext( >>>> RepeatApplyIterator.java:45) >>>> at >>>> org.apache.jena.tdb.solver.SolverLib$IterAbortable.hasNext( >>>> SolverLib.java:195) >>>> at org.apache.jena.atlas.iterator.Iter$2.hasNext(Iter.java:265) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIterPlainWrapper >>>> .hasNextBinding(QueryIterPlainWrapper.java:53) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has >>>> Next(QueryIteratorBase.java:114) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIterRepeatApply. >>>> makeNextStage(QueryIterRepeatApply.java:101) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIterRepeatApply. >>>> hasNextBinding(QueryIterRepeatApply.java:65) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has >>>> Next(QueryIteratorBase.java:114) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIterConvert.hasN >>>> extBinding(QueryIterConvert.java:58) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has >>>> Next(QueryIteratorBase.java:114) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIterTopN$1.initi >>>> alizeIterator(QueryIterTopN.java:98) >>>> at >>>> org.apache.jena.atlas.iterator.IteratorDelayedInitialization >>>> .init(IteratorDelayedInitialization.java:40) >>>> at >>>> org.apache.jena.atlas.iterator.IteratorDelayedInitialization >>>> .hasNext(IteratorDelayedInitialization.java:50) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIterPlainWrapper >>>> .hasNextBinding(QueryIterPlainWrapper.java:53) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has >>>> Next(QueryIteratorBase.java:114) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorWrapper. >>>> hasNextBinding(QueryIteratorWrapper.java:39) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has >>>> Next(QueryIteratorBase.java:114) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorWrapper. >>>> hasNextBinding(QueryIteratorWrapper.java:39) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has >>>> Next(QueryIteratorBase.java:114) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorWrapper. >>>> hasNextBinding(QueryIteratorWrapper.java:39) >>>> at >>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has >>>> Next(QueryIteratorBase.java:114) >>>> at >>>> org.apache.jena.sparql.engine.ResultSetStream.hasNext(Result >>>> SetStream.java:74) >>>> at >>>> org.apache.jena.sparql.engine.ResultSetCheckCondition.hasNex >>>> t(ResultSetCheckCondition.java:55) >>>> at >>>> scala.collection.convert.Wrappers$JIteratorWrapper.hasNext( >>>> Wrappers.scala:42) >>>> at scala.collection.Iterator$class.toStream(Iterator.scala:1320) >>>> at scala.collection.AbstractIterator.toStream(Iterator.scala:1334) >>>> at >>>> scala.collection.TraversableOnce$class.toIterable( >>>> TraversableOnce.scala:296) >>>> at scala.collection.AbstractIterator.toIterable(Iterator.scala:1334) >>>> at >>>> deductions.runtime.sparql_cache.SPARQLHelpers$$anonfun$sparq >>>> lSelectQueryVariablesNT$1.apply(SPARQLHelpers.scala:382) >>>> at >>>> deductions.runtime.sparql_cache.SPARQLHelpers$$anonfun$sparq >>>> lSelectQueryVariablesNT$1.apply(SPARQLHelpers.scala:370) >>>> at deductions.runtime.utils.Timer$class.time(Timer.scala:18) >>>> at controllers.Application$.time(Application.scala:8) >>>> at >>>> deductions.runtime.sparql_cache.SPARQLHelpers$class.sparqlSe >>>> lectQueryVariablesNT(SPARQLHelpers.scala:370) >>>> at >>>> controllers.Application$.sparqlSelectQueryVariablesNT(Applic >>>> ation.scala:8) >>>> at >>>> deductions.runtime.sparql_cache.SPARQLHelpers$$anonfun$8. >>>> apply(SPARQLHelpers.scala:361) >>>> at >>>> deductions.runtime.sparql_cache.SPARQLHelpers$$anonfun$8. >>>> apply(SPARQLHelpers.scala:361) >>>> at >>>> org.w3.banana.jena.JenaDatasetStore$$anonfun$r$1.apply( >>>> JenaDatasetStore.scala:17) >>>> at scala.util.Try$.apply(Try.scala:192) >>>> at org.w3.banana.jena.JenaDatasetStore.r(JenaDatasetStore.scala:14) >>>> at org.w3.banana.jena.JenaDatasetStore.r(JenaDatasetStore.scala:10) >>>> at >>>> deductions.runtime.sparql_cache.SPARQLHelpers$class.sparqlSe >>>> lectQueryVariables(SPARQLHelpers.scala:359) >>>> at controllers.Application$.sparqlSelectQueryVariables(Applicat >>>> ion.scala:8) >>>> at >>>> deductions.runtime.services.Lookup$class.searchStringOrClass >>>> (Lookup.scala:76) >>>> at deductions.runtime.services.Lookup$class.lookup(Lookup.scala:44) >>>> at controllers.Application$.lookup(Application.scala:8) >>>> at controllers.Services$$anonfun$lookupService$1.apply(Services >>>> .scala:199) >>>> at controllers.Services$$anonfun$lookupService$1.apply(Services >>>> .scala:193) >>>> at play.api.mvc.ActionBuilder$$anonfun$apply$13.apply(Action.scala:371) >>>> at play.api.mvc.ActionBuilder$$anonfun$apply$13.apply(Action.scala:370) >>>> at play.api.mvc.Action$.invokeBlock(Action.scala:498) >>>> at play.api.mvc.Action$.invokeBlock(Action.scala:495) >>>> at play.api.mvc.ActionBuilder$$anon$2.apply(Action.scala:458) >>>> at >>>> play.api.mvc.Action$$anonfun$apply$2$$anonfun$apply$5$$anonf >>>> un$apply$6.apply(Action.scala:112) >>>> at >>>> play.api.mvc.Action$$anonfun$apply$2$$anonfun$apply$5$$anonf >>>> un$apply$6.apply(Action.scala:112) >>>> at play.utils.Threads$.withContextClassLoader(Threads.scala:21) >>>> at >>>> play.api.mvc.Action$$anonfun$apply$2$$anonfun$apply$5.apply( >>>> Action.scala:111) >>>> at >>>> play.api.mvc.Action$$anonfun$apply$2$$anonfun$apply$5.apply( >>>> Action.scala:110) >>>> at scala.Option.map(Option.scala:146) >>>> at play.api.mvc.Action$$anonfun$apply$2.apply(Action.scala:110) >>>> at play.api.mvc.Action$$anonfun$apply$2.apply(Action.scala:103) >>>> at scala.concurrent.Future$$anonfun$flatMap$1.apply(Future.scala:253) >>>> at scala.concurrent.Future$$anonfun$flatMap$1.apply(Future.scala:251) >>>> at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:36) >>>> at >>>> akka.dispatch.BatchingExecutor$AbstractBatch.processBatch(Ba >>>> tchingExecutor.scala:55) >>>> at >>>> akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$ >>>> 1.apply$mcV$sp( >>>> BatchingExecutor.scala:91) >>>> at >>>> akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$ >>>> 1.apply(BatchingExecutor.scala:91) >>>> at >>>> akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$ >>>> 1.apply(BatchingExecutor.scala:91) >>>> at scala.concurrent.BlockContext$.withBlockContext(BlockContext >>>> .scala:72) >>>> at >>>> akka.dispatch.BatchingExecutor$BlockableBatch.run(BatchingEx >>>> ecutor.scala:90) >>>> at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:39) >>>> at >>>> akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask. >>>> exec(AbstractDispatcher.scala:415) >>>> at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) >>>> at >>>> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask( >>>> ForkJoinPool.java:1339) >>>> at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPoo >>>> l.java:1979) >>>> at >>>> scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinW >>>> orkerThread.java:107) >>>> >>>> >>>> >
