Re: About solr.HyphenatedWordsFilter

2020-08-26 Thread Erick Erickson
Another option is to suggest from a copyField with a very simple analysis chain. Say: PatternReplaceCharFilterFactory - to remove everything you don’t want to keep. WhitespaceTokenizerFactory LowercaseFilterFactory - maybe And I think you miss Shawn’s point about the exclamation point. If you

Re: About solr.HyphenatedWordsFilter

2020-08-26 Thread Kayak28
Hello, Shawn Thank you for your response. Yes. I am sure that I need to preserve "-" in the words. What I want to do is not actually search, it is for a suggestion. "abc-efg" is a dummy sample of our product ID. So, there are several product IDs. such as abc-efg, abc-hij, abc-klm and so on. When

Re: About solr.HyphenatedWordsFilter

2020-08-26 Thread Shawn Heisey
On 8/26/2020 12:05 AM, Kayak28 wrote: I would like to tokenize the following sentence. I do want to tokens that remain hyphens. So, for example, original text: This is a new abc-edg and xyz-abc is coming soon! desired output tokens: this/is/a/new/abc-edg/and/xyz-abc/is/coming/soon/! Is there

About solr.HyphenatedWordsFilter

2020-08-26 Thread Kayak28
Hello, Solr community: I would like to tokenize the following sentence. I do want to tokens that remain hyphens. So, for example, original text: This is a new abc-edg and xyz-abc is coming soon! desired output tokens: this/is/a/new/abc-edg/and/xyz-abc/is/coming/soon/! Is there any way that I do