[
https://issues.apache.org/jira/browse/SOLR-6441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ramana updated SOLR-6441:
-------------------------
Attachment: SOLR-6441.patch
Attached is the path. Now, MoreLikeThisHandler implements SolrCoreAware
interface and in the inform method, loading the existing stopwords_en file and
converting to set of stopwords. For each mlt request, setting the stopwords to
Lucene "MoreLikeThis" API. With this, In the "addTermFrequencies" method of
"MoreLikeThis" all the stop words will be ignored while building the
termFreqMap.
> MoreLikeThis support for stopwords as in Lucene
> -----------------------------------------------
>
> Key: SOLR-6441
> URL: https://issues.apache.org/jira/browse/SOLR-6441
> Project: Solr
> Issue Type: Improvement
> Components: MoreLikeThis
> Affects Versions: 4.9
> Reporter: Jeroen Steggink
> Priority: Minor
> Labels: difficulty-easy, impact-low, workaround-exists
> Fix For: 4.10, 4.11
>
> Attachments: SOLR-6441.patch
>
>
> In the Lucene implementation of MoreLikeThis, it's possible to add a list of
> stopwords which are considered "uninteresting" and are ignored.
> It would be a great addition to the MoreLikeThisHandler to be able to specify
> a list of stopwords.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]