Forgot to say that I'm using Jena 3.3.0 on Ubuntu 17.04 , and
java -version
java version "1.8.0_121"
Java(TM) SE Runtime Environment (build 1.8.0_121-b13)
Java HotSpot(TM) 64-Bit Server VM (build 25.121-b13, mixed mode)

The semantic_forms sandbox is up-to-date with the source code and the
scenario above:
http://semantic-forms.cc:9111/




2017-07-28 13:14 GMT+02:00 Jean-Marc Vanel <[email protected]>:

> Hi
>
> I've checked lots of things for 2 days.
>
> I have this scenario in semantic_forms:
>
>     - on fresh TDB and LUCENE directories
>     - load rdfs: (the ontology)
>     - create instance of class bli:bli (sic !)
>     - enter rdfs:comment bli
>     - search bli => NOTHING !!! :(
>
> I wrote a  self-contained test with no semantic_forms nor Banana
> dependency, that reproduces the same scenario by theAPI:
> https://github.com/jmvanel/semantic_forms/blob/master/
> scala/forms/src/main/scala/deductions/runtime/jena/
> lucene/TestTextIndex2.scala
>
> But it succeds !!!
>
> So I wrote another test that runs on the TDB that was prepared in the
> above scenario in semantic_forms:
> https://github.com/jmvanel/semantic_forms/blob/master/
> scala/forms/src/main/scala/deductions/runtime/jena/
> lucene/QueryTextIndex.scala
>
> The indexing seems normal on Lucene + Jena side, but NOT the SPARQL search
> with text:query .
>
> runMain deductions.runtime.jena.lucene.QueryTextIndex bli TDB
> ...
> [info] search with Lucene: bli
> [info] Doc: 30
> [info]   1 stored,indexed,tokenized,indexOptions=DOCS<uri:http://
> localhost:9000/ldp/1501237821055-8217451390491>
> [info]   uri = http://localhost:9000/ldp/1501237821055-8217451390491
> [info]   2 stored,indexed,tokenized,omitNorms,indexOptions=DOCS<lang:fr>
> [info]   lang = fr
> [info]   3 stored,indexed,tokenized,omitNorms,indexOptions=DOCS<uid:
> f1e70540a1cd751b78e29b31b4ae57c5520b71a728f8e1c7b24c698e8cd85e83>
> [info]   uid = f1e70540a1cd751b78e29b31b4ae57
> c5520b71a728f8e1c7b24c698e8cd85e83
> [info]   4 stored,indexed,tokenized,omitNorms,indexOptions=DOCS<lang:fr>
> [info]   lang = fr
> [info]   5 stored,indexed,tokenized,omitNorms,indexOptions=DOCS<uid:
> 435b1578a796765c441ad43a9147e1952abbc44facfa5aebab3d6cb67e98f844>
> [info]   uid = 435b1578a796765c441ad43a9147e1
> 952abbc44facfa5aebab3d6cb67e98f844
> [info] query
> [info]     PREFIX text: <http://jena.apache.org/text#>
> [info]     PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
> [info]     SELECT * WHERE {
> [info]     graph ?g {
> [info]     # ?thing text:query (rdfs:label  "bli" ) .
> [info]     ?thing text:query 'bli' .
> [info]     ?thing ?p ?o .
> [info]   }
> [info] } LIMIT 22
> [info]
> [info] ---------------------
> [info] | thing | p | o | g |
> [info] =====================
> [info] ---------------------
>
> The URI in Lucene dump is correct. I'm surprised that field "lang" appears
> 2 times, and "graph" not at all .
>
> I've looked in the Jena code, and the member fields in EntityDefinition
> https://github.com/apache/jena/blob/master/jena-text/
> src/main/java/org/apache/jena/query/text/EntityDefinition.java#L39
> looks as if it is not always updated.
> fields is initialized once from fieldToPredicate, and I'm not sure that
> fieldToPredicate is initialized before;
> moreover it is modified by method
> void set(String field, Node predicate)
> https://github.com/apache/jena/blob/master/jena-text/
> src/main/java/org/apache/jena/query/text/EntityDefinition.java#L126
>
> --
> Jean-Marc Vanel
> http://www.semantic-forms.cc:9111/display?displayuri=http:/
> /jmvanel.free.fr/jmv.rdf%23me
> Déductions SARL - Consulting, services, training,
> Rule-based programming, Semantic Web
> +33 (0)6 89 16 29 52 <+33%206%2089%2016%2029%2052>
> Twitter: @jmvanel , @jmvanel_fr ; chat: irc://irc.freenode.net#eulergui
>



-- 
Jean-Marc Vanel
http://www.semantic-forms.cc:9111/display?displayuri=http://jmvanel.free.fr/jmv.rdf%23me
Déductions SARL - Consulting, services, training,
Rule-based programming, Semantic Web
+33 (0)6 89 16 29 52
Twitter: @jmvanel , @jmvanel_fr ; chat: irc://irc.freenode.net#eulergui

Reply via email to