[
https://issues.apache.org/jira/browse/SOLR-10263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16079987#comment-16079987
]
Abhishek Kumar Singh commented on SOLR-10263:
---------------------------------------------
After PR #218 ,
The _solrconfig.xml_ of *WordBreakSolrSpellChecker* (and later, for all
the components) can be configured like this :-
{code:xml}
<lst name="spellchecker">
<str name="name">wordbreakspellcheck</str>
<str name="classname">solr.WordBreakSolrSpellChecker</str>
<str name="field">fieldspell</str>
<str name="combineWords">true</str>
<str name="breakWords">true</str>
<str name="usePhraseQFrequency">true</str>
<int name="maxChanges">5</int>
<int name="minPrefix">0</int>
<str name="wordBreakSuggestMode">SUGGEST_WHEN_NOT_IN_INDEX</str>
<str name="joinWordsSuggestMode">SUGGEST_ALWAYS</str>
</lst>
{code}
OR Simply as :-
{code:xml}
<lst name="spellchecker">
<str name="name">spellcheckword</str>
<str name="classname">solr.WordBreakSolrSpellChecker</str>
<str name="field">fieldspell</str>
<str name="combineWords">true</str>
<str name="breakWords">true</str>
<int name="maxChanges">5</int>
<int name="minPrefix">0</int>
<str name="suggestMode">SUGGEST_WHEN_NOT_IN_INDEX</str>
</lst>
{code}
> Different SpellcheckComponents should have their own suggestMode
> ----------------------------------------------------------------
>
> Key: SOLR-10263
> URL: https://issues.apache.org/jira/browse/SOLR-10263
> Project: Solr
> Issue Type: Wish
> Security Level: Public(Default Security Level. Issues are Public)
> Components: spellchecker
> Reporter: Abhishek Kumar Singh
> Priority: Minor
>
> As of now, common spellcheck options are applied to all the
> SpellCheckComponents.
> This can create problem in the following case:-
> It may be the case that we want *DirectSolrSpellChecker* to ALWAYS_SUGGEST
> spellcheck suggestions.
> But we may want *WordBreakSpellChecker* to suggest only if the token is not
> in the index (SUGGEST_WHEN_NOT_IN_INDEX) .
> *Update:* Recently, we also figured out that, for
> {{WordBreakSolrSpellChecker}} also, both - The {{WordBreak}} and {{WordJoin}}
> should also have different suggestModes.
> We faced this problem in our case, wherein, Most of the WordJoin cases are
> those where the words individually are valid tokens, but what the users are
> looking for is actually a combination (wordjoin) of the two tokens.
> For example:-
> *gold mine sunglasses* : Here, both *gold* and *mine* are valid tokens. But
> the actual product being looked for is *goldmine sunglasses* , where
> *goldmine* is a brand.
> In such cases, we should recommend {{didYouMean:goldmine sunglasses}} . But
> this wont be possible because we had set {{SUGGEST_WHEN_NOT_IN_INDEX}} for
> {{WordBreakSolrSpellChecker}} (of which, WordJoin is a part) .
> For this, we should have separate suggestModes for both `wordJoin` as well as
> `wordBreak`.
> So related changes have been done at Latest PR. :
> https://github.com/apache/lucene-solr/pull/218.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]