Re: Problem with solr suggester in case of non-ASCII characters

2019-07-31 Thread Szűcs Roland
Hi Erick, Thanks your advice. I already removed it from the field definition used by the suggester and it works great. I will consider to took it from the entire processing of the other fields. I have only 7000 docs with index size of 18MB so far, so the memory footprint is not a key issue for me

Re: Problem with solr suggester in case of non-ASCII characters

2019-07-31 Thread Erick Erickson
Roland: Have you considered just not using stopwords anywhere? Largely they’re a holdover from a long time ago when every byte counted. Plus using stopwords has “interesting” issues with things like highlighting and phrase queries and the like. Sure, not using stopwords will make your index lar

Re: Problem with solr suggester in case of non-ASCII characters

2019-07-30 Thread Szűcs Roland
Hi Furkan, Thanks the suggestion, I always forget the most effective debugging tool the analysis page. It turned out that "Jó" was a stop word and it was eliminated during the text analysis. What I will do is to create a new field type but without stop word removal and I will use it like this: sh

Re: Problem with solr suggester in case of non-ASCII characters

2019-07-30 Thread Furkan KAMACI
Hi Roland, Could you check Analysis tab ( https://lucene.apache.org/solr/guide/8_1/analysis-screen.html) and tell how the term is analyzed for both query and index? Kind Regards, Furkan KAMACI On Tue, Jul 30, 2019 at 4:50 PM Szűcs Roland wrote: > Hi All, > > I have an author suggester (searchc

Problem with solr suggester in case of non-ASCII characters

2019-07-30 Thread Szűcs Roland
Hi All, I have an author suggester (searchcomponent and the related request handler) defined in solrconfig: > author AnalyzingInfixLookupFactory DocumentDictionaryFactory BOOK_productAuthor short_text_hu suggester_infix_author false false