[ 
https://issues.apache.org/jira/browse/SOLR-6441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ramana updated SOLR-6441:
-------------------------
    Attachment: SOLR-6441.patch

Attached is the path. Now, MoreLikeThisHandler implements SolrCoreAware 
interface and in the inform method, loading the existing stopwords_en file and 
converting to set of stopwords. For each mlt request, setting the stopwords to 
Lucene "MoreLikeThis" API. With this, In the "addTermFrequencies" method of 
"MoreLikeThis" all the stop words will be ignored while building the 
termFreqMap.

> MoreLikeThis support for stopwords as in Lucene
> -----------------------------------------------
>
>                 Key: SOLR-6441
>                 URL: https://issues.apache.org/jira/browse/SOLR-6441
>             Project: Solr
>          Issue Type: Improvement
>          Components: MoreLikeThis
>    Affects Versions: 4.9
>            Reporter: Jeroen Steggink
>            Priority: Minor
>              Labels: difficulty-easy, impact-low, workaround-exists
>             Fix For: 4.10, 4.11
>
>         Attachments: SOLR-6441.patch
>
>
> In the Lucene implementation of MoreLikeThis, it's possible to add a list of 
> stopwords which are considered "uninteresting" and are ignored.
> It would be a great addition to the MoreLikeThisHandler to be able to specify 
> a list of stopwords.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to