Hi Alexandre,
I don't have time for a call, but to give you some pointers, Lucene does
the following that may be related to natural language processing:
- Word segmentation via the `Tokenizer` class. It is rather simple for
western languages (including French, see StandardTokenizer), but less for
Hi Satyan,
You need to add a LowercaseTokenFilter to your analysis chain. The way to
do it depends on how you are building your analyzer today (pre-built
analyzer, extending Analyzer or using CustomAnalyzer). This will preserve
the original case in field values because lowercasing will only be app
Hi,
I can't find any way to perform case insensitive search on StringField. Please
help me out.
i.e If the field value is "Flying Robots", then the phrases "flying robots",
"fLying RObots" etc should match the value.
I also need the original case of the field value to be preserved
Good afternoon everyone,
I am working for a French company and in the scope of my work I am collecting
information on open source NLP tools available on the "market" worldwide.
I was looking for such intel on the internet and by reading some users'
comments but I figured, why not contact the per