Re: Lowercase-ing everything but acronyms

2020-09-09 Thread Stavros Macrakis
I can't help you on the implementation issues, but... You may want to do something a little different than keep all-uppercase tokens in upper case. You may want simply to special-case all-uppercase stopwords, so that they are not ignored. The poster boy for that is IT, which in my last search appl

Lowercase-ing everything but acronyms

2020-09-09 Thread Dunham-Wilkie, Mike CITZ:EX
Hi SOLR list, I'm currently using the White Space tokenizer and the Lower Case filter with SOLR 7.3. I'd like to modify the logic to keep any tokens that are entirely upper case as upper case, and just apply the Lower Case filter (or something equivalent) to the remaining tokens. Is there a w