Hi Chris,

The previous text index attempt we did was with 3.8.0, not 3.9.0, sorry for the misinformation. The assembler and the packet capture file I sent to your private email (users@jena doesn't accept messages larger than1Mb) I tried to enable the debug logs in /log4j.properties/ with *log4j.logger.org.apache.jena.query.text.es=DEBUG* but no output in the log file.

Regards,
Sorin


Am 27.02.2019 um 20:01 schrieb Chris Tomlinson:
Hi Sorin,

Please provide the assembler file for Elasticsearch that has the problematic 
entity map definitions.

There haven’t been any changes in over a year to textindexer since well before 
3.9. I don’t see any relevant changes to the handling of entity maps either so 
I can’t begin to pursue the issue further w/o perhaps seeing your current 
assembler file.

I don't have any experience with Elasticsearch or with using jena-text-es 
beyond a simple change to TextIndexES.java to change 
org.elasticsearch.common.transport.InetSocketTransportAddress to 
org.elasticsearch.common.transport.TransportAddress as part of the upgrade to 
Lucene 7.4.0 and Elasticsearch 6.4.2.

Regards,
Chris


On Feb 25, 2019, at 2:37 AM, Sorin Gheorghiu<[email protected]>  
wrote:

Correction: only the *latest field *from the /text:map/ list contains a value.

To reformulate:

* if there are 3 fields in /text:map/, then during indexing the first
   two are empty (let's name them 'text1' and 'text2') and the latest
   field contains data (let's name it 'text3')
* if on the next attempt the field 'text3' is commented out, then
   'text1' is empty and 'text2' contains data


Am 22.02.2019 um 15:01 schrieb Sorin Gheorghiu:
In addition:

  * if there are 3 fields in /text:map/, then during indexing one
    contains data (let's name it 'text1'), the others are empty (let's
    name them 'text2' and 'text3'),
  * if on the next attempt the field 'text1' is commented out, then
    'text2' contains data and 'text3' is empty



-------- Weitergeleitete Nachricht --------
Betreff:        Text Index build with empty fields
Datum:  Fri, 22 Feb 2019 14:01:18 +0100
Von:    Sorin Gheorghiu<[email protected]>
Antwort an:     [email protected]
An:     [email protected]



Hi,

When building the text index with the /jena.textindexer/ tool in Jena 3.10 for 
an external full-text search engine (Elasticsearch of course) and having 
multiple fields with different names in /text:map/, just *one field is indexed* 
(more precisely one field contains data, the others are empty). It doesn't look 
to be an issue with Elasticsearch, in the logs generated during the indexing 
the fields are already missing the values, but one. The same setup worked in 
Jena 3.9. Changing the Java version from 8 to 9 or 11 didn't change anything.

Could it be that changes of the new release have affected this tool and we deal 
with a bug?

--
Sorin Gheorghiu             Tel: +49 7531 88-3198
Universität Konstanz        Raum: B705
78464 [email protected]

- KIM: Abteilung Contentdienste -

Reply via email to