2017-12-04 10:21 GMT+01:00 Andy Seaborne <[email protected]>:
> Hi - your messages loose the indication of who wrote what.
>
Sorry, I edited all , putting "AS:" and "JMV:" prefixes,
and removing the less important stuff.
> AS:
>
> Maybe it's this part of the query that causes some or all of the costs
>
> > graph ?g1 {
> >> ?thing a ?CLASS .
> >>
> >
> > Unnecessary?
>
JMV: Yes, but removing it does not change the problem.
AS:
>
> ?g1 and ?CLASS aren't used so this is only some kind of existence test. It
> is a partial cross product on the first part. With the DISTINCT in the
> SELECT there may a very large number of results yet only a few DISTINCT.
>
> JMV:
>
>
> Here is a simple query that is slow:
>>
>
PREFIX text: <http://jena.apache.org/text#>
SELECT DISTINCT ?thing ?COUNT WHERE {
graph ?g {
?thing text:query ( 'Jean-Marc' ) .
}
} ORDER BY DESC(?COUNT)
LIMIT 10
> AS:
>
>
> Remove the "graph ?g {".
>>
>> (and then remove the DISTINCT)
>>
>> *JMV:*
>
>
> When I do this, the result is empty.
>>
>> The way the text index is initialized must be wrong:
>> https://github.com/jmvanel/semantic_forms/blob/master/scala/
>> forms/src/main/scala/deductions/runtime/jena/lucene/LuceneIndex.scala
>>
>
In fact, the getting the graph where where a literal triple belongs is not
relevant in this application.
So, reading again the doc.
https://jena.apache.org/documentation/query/text-query.html#graph-specific-indexing
I understand that I must deactivate "graph-specific indexing" that is not
not needed, and slows the query.
The doc. does not say so, but I think rebuilding the index is appropriate.
>>> JMV:
>>
>>
> Run it with:
>>> time wget -O semantic-forms.cc_select-ui.txt
>>> http://semantic-forms.cc:9112/select-ui?query=PREFIX+text%3A
>>> +%3Chttp%3A%2F%2Fjena.apache.org%2Ftext%23%3E+%0D%0ASELECT+
>>> DISTINCT+%3Fthing+%3FCOUNT+WHERE+%7B%0D%0A++graph+%3Fg+%
>>> 7B%0D%0A++++%3Fthing+text%3Aquery+%28+%27Jean-Marc%27+%
>>> 29+.%0D%0A++%7D%0D%0A%7D%0D%0AORDER+BY+DESC%28%3FCOUNT%29%0D%0ALIMIT+10
>>>
>>> Or, if you want to use YasGUI , the endpoint is
>>> http://semantic-forms.cc:9112/sparql
>>>
>>> *Statistics on the database*
>>>
>>> 268 graphs and 588 864 triples.
>>>
>>> # Count graphs and triples
>>> SELECT (COUNT(?s) AS ?trc) (COUNT(?GR) AS ?grc)
>>> WHERE {
>>> { GRAPH ?GR { } }
>>> UNION
>>> { GRAPH ?GR1 { ?s ?p ?o . } }
>>> }
>>>
>>> Result: 2 rows
>>> "grc" "trc"
>>> "268"^^http://www.w3.org/2001/XMLSchema#integer "588864"^^
>>> http://www.w3.org/2001/XMLSchema#integer
>>>
>>> (I'm not sure this the right way to count, but it gives figures :) )
>>>
>>> You can reproduce the query with this UI :
>>>
>>> http://semantic-forms.cc:9112/select-ui?query=%23+Count+grap
>>> hs%0D%0ASELECT+%28COUNT%28%3Fs%29+AS+%3Ftrc%29+%28COUNT%
>>> 28%3FGR%29+AS+%3Fgrc%29%0D%0A++++WHERE+%7B%0D%0A+++++%7B+
>>> GRAPH+%3FGR+%7B+%7D+%7D%0D%0A++UNION%0D%0A++++%7B+GRAPH+%
>>> 3FGR1+%7B+%3Fs+%3Fp+%3Fo+.+%7D+%7D%0D%0A%7D
>>>
>>> This is using Jena 3.5.0. with TDB 1 .
>>>
>>
> Here is a stack made with kill -3 when the app. was working hard;
>>> I put in bold a suspect line.
>>>
>>> "application-akka.actor.default-dispatcher-1351" #2683 prio=5 os_prio=0
>>> tid=0x00007f07a801c000 nid=0x9b7 runnable [0x00007f06f25ab000]
>>> java.lang.Thread.State: RUNNABLE
>>> at sun.nio.ch.NativeThread.current(Native Method)
>>> at sun.nio.ch.NativeThreadSet.add(NativeThreadSet.java:46)
>>> at sun.nio.ch.FileChannelImpl.readInternal(FileChannelImpl.java:737)
>>> at sun.nio.ch.FileChannelImpl.read(FileChannelImpl.java:727)
>>> at
>>> org.apache.lucene.store.NIOFSDirectory$NIOFSIndexInput.
>>> readInternal(NIOFSDirectory.java:179)
>>> at
>>> org.apache.lucene.store.BufferedIndexInput.refill(BufferedIn
>>> dexInput.java:342)
>>> at
>>> org.apache.lucene.store.BufferedIndexInput.readByte(Buffered
>>> IndexInput.java:54)
>>> at org.apache.lucene.store.DataInput.readInt(DataInput.java:101)
>>> at
>>> org.apache.lucene.store.BufferedIndexInput.readInt(BufferedI
>>> ndexInput.java:183)
>>> at org.apache.lucene.codecs.CodecUtil.checkHeader(CodecUtil.java:194)
>>> at org.apache.lucene.util.fst.FST.<init>(FST.java:327)
>>> at org.apache.lucene.util.fst.FST.<init>(FST.java:313)
>>> at
>>> org.apache.lucene.codecs.blocktree.FieldReader.<init>(FieldR
>>> eader.java:91)
>>> at
>>> org.apache.lucene.codecs.blocktree.BlockTreeTermsReader.<
>>> init>(BlockTreeTermsReader.java:234)
>>> at
>>> org.apache.lucene.codecs.lucene50.Lucene50PostingsFormat.fie
>>> ldsProducer(Lucene50PostingsFormat.java:445)
>>> at
>>> org.apache.lucene.codecs.perfield.PerFieldPostingsFormat$Fie
>>> ldsReader.<init>(PerFieldPostingsFormat.java:292)
>>> at
>>> org.apache.lucene.codecs.perfield.PerFieldPostingsFormat.fie
>>> ldsProducer(PerFieldPostingsFormat.java:372)
>>> at
>>> org.apache.lucene.index.SegmentCoreReaders.<init>(SegmentCor
>>> eReaders.java:112)
>>> at org.apache.lucene.index.SegmentReader.<init>(SegmentReader.java:74)
>>> at
>>> org.apache.lucene.index.StandardDirectoryReader$1.doBody(Sta
>>> ndardDirectoryReader.java:62)
>>> at
>>> org.apache.lucene.index.StandardDirectoryReader$1.doBody(Sta
>>> ndardDirectoryReader.java:54)
>>> at
>>> org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(
>>> SegmentInfos.java:692)
>>> at
>>> org.apache.lucene.index.StandardDirectoryReader.open(Standar
>>> dDirectoryReader.java:77)
>>> at org.apache.lucene.index.DirectoryReader.open(DirectoryReader.java:63)
>>> at
>>> org.apache.jena.query.text.TextIndexLucene.query(TextIndexLu
>>> cene.java:370)
>>> at org.apache.jena.query.text.TextQueryPF.performQuery(TextQuer
>>> yPF.java:290)
>>> at
>>> org.apache.jena.query.text.TextQueryPF.lambda$query$1(TextQu
>>> eryPF.java:267)
>>> at
>>> org.apache.jena.query.text.TextQueryPF$$Lambda$66/2108167189
>>> .call(Unknown
>>> Source)
>>> at
>>> org.apache.jena.ext.com.google.common.cache.LocalCache$
>>> LocalManualCache$1.load(LocalCache.java:5065)
>>> at
>>> org.apache.jena.ext.com.google.common.cache.LocalCache$Loadi
>>> ngValueReference.loadFuture(LocalCache.java:3716)
>>> at
>>> org.apache.jena.ext.com.google.common.cache.LocalCache$
>>> Segment.loadSync(LocalCache.java:2424)
>>> at
>>> org.apache.jena.ext.com.google.common.cache.LocalCache$
>>> Segment.lockedGetOrLoad(LocalCache.java:2298)
>>> * - locked <0x00000000ef9d34f8> (a
>>> org.apache.jena.ext.com.google.common.cache.LocalCache$
>>> StrongAccessEntry)*
>>>
>>> at
>>> org.apache.jena.ext.com.google.common.cache.LocalCache$
>>> Segment.get(LocalCache.java:2211)
>>> at
>>> org.apache.jena.ext.com.google.common.cache.LocalCache.get(
>>> LocalCache.java:4154)
>>> at
>>> org.apache.jena.ext.com.google.common.cache.LocalCache$
>>> LocalManualCache.get(LocalCache.java:5060)
>>> at org.apache.jena.atlas.lib.cache.CacheGuava.getOrFill(CacheGu
>>> ava.java:58)
>>> at org.apache.jena.query.text.TextQueryPF.query(TextQueryPF.java:267)
>>> at
>>> org.apache.jena.query.text.TextQueryPF.variableSubject(TextQ
>>> ueryPF.java:227)
>>> at org.apache.jena.query.text.TextQueryPF.exec(TextQueryPF.java:196)
>>> at
>>> org.apache.jena.sparql.pfunction.PropertyFunctionBase$Repeat
>>> ApplyIteratorPF.nextStage(PropertyFunctionBase.java:106)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIterRepeatApply.
>>> makeNextStage(QueryIterRepeatApply.java:108)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIterRepeatApply.
>>> hasNextBinding(QueryIterRepeatApply.java:65)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has
>>> Next(QueryIteratorBase.java:114)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIterProcedure.ha
>>> sNextBinding(QueryIterProcedure.java:73)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has
>>> Next(QueryIteratorBase.java:114)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIterProcessBindi
>>> ng.hasNextBinding(QueryIterProcessBinding.java:66)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has
>>> Next(QueryIteratorBase.java:114)
>>> at
>>> org.apache.jena.sparql.engine.main.iterator.QueryIterGraph$Q
>>> ueryIterGraphInner.hasNextBinding(QueryIterGraph.java:121)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has
>>> Next(QueryIteratorBase.java:114)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIterRepeatApply.
>>> hasNextBinding(QueryIterRepeatApply.java:74)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has
>>> Next(QueryIteratorBase.java:114)
>>> at org.apache.jena.atlas.iterator.Iter$2.hasNext(Iter.java:265)
>>> at
>>> org.apache.jena.atlas.iterator.RepeatApplyIterator.hasNext(
>>> RepeatApplyIterator.java:45)
>>> at
>>> org.apache.jena.tdb.solver.SolverLib$IterAbortable.hasNext(
>>> SolverLib.java:195)
>>> at org.apache.jena.atlas.iterator.Iter$2.hasNext(Iter.java:265)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIterPlainWrapper
>>> .hasNextBinding(QueryIterPlainWrapper.java:53)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has
>>> Next(QueryIteratorBase.java:114)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIterRepeatApply.
>>> makeNextStage(QueryIterRepeatApply.java:101)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIterRepeatApply.
>>> hasNextBinding(QueryIterRepeatApply.java:65)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has
>>> Next(QueryIteratorBase.java:114)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIterConvert.hasN
>>> extBinding(QueryIterConvert.java:58)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has
>>> Next(QueryIteratorBase.java:114)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIterTopN$1.initi
>>> alizeIterator(QueryIterTopN.java:98)
>>> at
>>> org.apache.jena.atlas.iterator.IteratorDelayedInitialization
>>> .init(IteratorDelayedInitialization.java:40)
>>> at
>>> org.apache.jena.atlas.iterator.IteratorDelayedInitialization
>>> .hasNext(IteratorDelayedInitialization.java:50)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIterPlainWrapper
>>> .hasNextBinding(QueryIterPlainWrapper.java:53)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has
>>> Next(QueryIteratorBase.java:114)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorWrapper.
>>> hasNextBinding(QueryIteratorWrapper.java:39)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has
>>> Next(QueryIteratorBase.java:114)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorWrapper.
>>> hasNextBinding(QueryIteratorWrapper.java:39)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has
>>> Next(QueryIteratorBase.java:114)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorWrapper.
>>> hasNextBinding(QueryIteratorWrapper.java:39)
>>> at
>>> org.apache.jena.sparql.engine.iterator.QueryIteratorBase.has
>>> Next(QueryIteratorBase.java:114)
>>> at
>>> org.apache.jena.sparql.engine.ResultSetStream.hasNext(Result
>>> SetStream.java:74)
>>> at
>>> org.apache.jena.sparql.engine.ResultSetCheckCondition.hasNex
>>> t(ResultSetCheckCondition.java:55)
>>> at
>>> scala.collection.convert.Wrappers$JIteratorWrapper.hasNext(
>>> Wrappers.scala:42)
>>> at scala.collection.Iterator$class.toStream(Iterator.scala:1320)
>>> at scala.collection.AbstractIterator.toStream(Iterator.scala:1334)
>>> at
>>> scala.collection.TraversableOnce$class.toIterable(
>>> TraversableOnce.scala:296)
>>> at scala.collection.AbstractIterator.toIterable(Iterator.scala:1334)
>>> at
>>> deductions.runtime.sparql_cache.SPARQLHelpers$$anonfun$sparq
>>> lSelectQueryVariablesNT$1.apply(SPARQLHelpers.scala:382)
>>> at
>>> deductions.runtime.sparql_cache.SPARQLHelpers$$anonfun$sparq
>>> lSelectQueryVariablesNT$1.apply(SPARQLHelpers.scala:370)
>>> at deductions.runtime.utils.Timer$class.time(Timer.scala:18)
>>> at controllers.Application$.time(Application.scala:8)
>>> at
>>> deductions.runtime.sparql_cache.SPARQLHelpers$class.sparqlSe
>>> lectQueryVariablesNT(SPARQLHelpers.scala:370)
>>> at
>>> controllers.Application$.sparqlSelectQueryVariablesNT(Applic
>>> ation.scala:8)
>>> at
>>> deductions.runtime.sparql_cache.SPARQLHelpers$$anonfun$8.
>>> apply(SPARQLHelpers.scala:361)
>>> at
>>> deductions.runtime.sparql_cache.SPARQLHelpers$$anonfun$8.
>>> apply(SPARQLHelpers.scala:361)
>>> at
>>> org.w3.banana.jena.JenaDatasetStore$$anonfun$r$1.apply(
>>> JenaDatasetStore.scala:17)
>>> at scala.util.Try$.apply(Try.scala:192)
>>> at org.w3.banana.jena.JenaDatasetStore.r(JenaDatasetStore.scala:14)
>>> at org.w3.banana.jena.JenaDatasetStore.r(JenaDatasetStore.scala:10)
>>> at
>>> deductions.runtime.sparql_cache.SPARQLHelpers$class.sparqlSe
>>> lectQueryVariables(SPARQLHelpers.scala:359)
>>> at controllers.Application$.sparqlSelectQueryVariables(Applicat
>>> ion.scala:8)
>>> at
>>> deductions.runtime.services.Lookup$class.searchStringOrClass
>>> (Lookup.scala:76)
>>> at deductions.runtime.services.Lookup$class.lookup(Lookup.scala:44)
>>> at controllers.Application$.lookup(Application.scala:8)
>>> at controllers.Services$$anonfun$lookupService$1.apply(Services
>>> .scala:199)
>>> at controllers.Services$$anonfun$lookupService$1.apply(Services
>>> .scala:193)
>>> at play.api.mvc.ActionBuilder$$anonfun$apply$13.apply(Action.scala:371)
>>> at play.api.mvc.ActionBuilder$$anonfun$apply$13.apply(Action.scala:370)
>>> at play.api.mvc.Action$.invokeBlock(Action.scala:498)
>>> at play.api.mvc.Action$.invokeBlock(Action.scala:495)
>>> at play.api.mvc.ActionBuilder$$anon$2.apply(Action.scala:458)
>>> at
>>> play.api.mvc.Action$$anonfun$apply$2$$anonfun$apply$5$$anonf
>>> un$apply$6.apply(Action.scala:112)
>>> at
>>> play.api.mvc.Action$$anonfun$apply$2$$anonfun$apply$5$$anonf
>>> un$apply$6.apply(Action.scala:112)
>>> at play.utils.Threads$.withContextClassLoader(Threads.scala:21)
>>> at
>>> play.api.mvc.Action$$anonfun$apply$2$$anonfun$apply$5.apply(
>>> Action.scala:111)
>>> at
>>> play.api.mvc.Action$$anonfun$apply$2$$anonfun$apply$5.apply(
>>> Action.scala:110)
>>> at scala.Option.map(Option.scala:146)
>>> at play.api.mvc.Action$$anonfun$apply$2.apply(Action.scala:110)
>>> at play.api.mvc.Action$$anonfun$apply$2.apply(Action.scala:103)
>>> at scala.concurrent.Future$$anonfun$flatMap$1.apply(Future.scala:253)
>>> at scala.concurrent.Future$$anonfun$flatMap$1.apply(Future.scala:251)
>>> at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:36)
>>> at
>>> akka.dispatch.BatchingExecutor$AbstractBatch.processBatch(Ba
>>> tchingExecutor.scala:55)
>>> at
>>> akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$
>>> 1.apply$mcV$sp(
>>> BatchingExecutor.scala:91)
>>> at
>>> akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$
>>> 1.apply(BatchingExecutor.scala:91)
>>> at
>>> akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$
>>> 1.apply(BatchingExecutor.scala:91)
>>> at scala.concurrent.BlockContext$.withBlockContext(BlockContext
>>> .scala:72)
>>> at
>>> akka.dispatch.BatchingExecutor$BlockableBatch.run(BatchingEx
>>> ecutor.scala:90)
>>> at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:39)
>>> at
>>> akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.
>>> exec(AbstractDispatcher.scala:415)
>>> at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
>>> at
>>> scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(
>>> ForkJoinPool.java:1339)
>>> at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPoo
>>> l.java:1979)
>>> at
>>> scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinW
>>> orkerThread.java:107)
>>>
>>>
>>>
>>
--
Jean-Marc Vanel
http://www.semantic-forms.cc:9111/display?displayuri=http://jmvanel.free.fr/jmv.rdf%23me#subject
<http://www.semantic-forms.cc:9111/display?displayuri=http://jmvanel.free.fr/jmv.rdf%23me>
Déductions SARL - Consulting, services, training,
Rule-based programming, Semantic Web
+33 (0)6 89 16 29 52
Twitter: @jmvanel , @jmvanel_fr ; chat: irc://irc.freenode.net#eulergui