Hi Markus,
thanks for the quick answer.
"sound stage" was just an example. We are looking for a generic solution ...
Is it "ok" to apply an NGRamFilter for query-analyzing?
<analyzer type="query">
<tokenizer class="solr.WhitespaceTokenizerFactory" />
<filter class="solr.LowerCaseFilterFactory" />
<filter class="solr.NGramFilterFactory" minGramSize="3"
maxGramSize="15" />
</analyzer>
I guess (besides the performance impact) this reduces search results accuracy?
-Clemens
-----Ursprüngliche Nachricht-----
Von: Markus Jelsma <[email protected]>
Gesendet: Freitag, 3. August 2018 12:43
An: [email protected]
Betreff: RE: indexing two words, searching single word
Hello,
If your case is English you could use synonyms to work around the problem of
the few compound words of the language. However, would you be dealing with a
Germanic compound language, the HyphenationCompoundWordTokenFilter [1] or
DictionaryCompoundWordTokenFilter are a better choice. The former is much more
flexible but has its drawbacks.
Regards,
Markus
https://lucene.apache.org/core/7_4_0/analyzers-common/org/apache/lucene/analysis/compound/HyphenationCompoundWordTokenFilterFactory.html
-----Original message-----
> From:Clemens Wyss DEV <[email protected]>
> Sent: Friday 3rd August 2018 12:22
> To: [email protected]
> Subject: indexing two words, searching single word
>
> Sounds like a rather simple issue:
> if I index "sound stage" and search for "soundstage" I get no hits
>
> What am I doing wrong
> a) when indexing
> b) when searching
> ?
>
> Thx in advance
> - Clemens
>