Re: Looking for more information about Lucene

2018-05-22 Thread Adrien Grand
Hi Alexandre, I don't have time for a call, but to give you some pointers, Lucene does the following that may be related to natural language processing: - Word segmentation via the `Tokenizer` class. It is rather simple for western languages (including French, see StandardTokenizer), but less for

Re: Case Insensitive Search for StringField

2018-05-22 Thread Adrien Grand
Hi Satyan, You need to add a LowercaseTokenFilter to your analysis chain. The way to do it depends on how you are building your analyzer today (pre-built analyzer, extending Analyzer or using CustomAnalyzer). This will preserve the original case in field values because lowercasing will only be app

Case Insensitive Search for StringField

2018-05-22 Thread Chellasamy G
Hi, I can't find any way to perform case insensitive search on StringField. Please help me out. i.e If the field value is "Flying Robots", then the phrases "flying robots", "fLying RObots" etc should match the value. I also need the original case of the field value to be preserved

Looking for more information about Lucene

2018-05-22 Thread BABAUD Alexandre
Good afternoon everyone, I am working for a French company and in the scope of my work I am collecting information on open source NLP tools available on the "market" worldwide. I was looking for such intel on the internet and by reading some users' comments but I figured, why not contact the per