[
https://issues.apache.org/jira/browse/SOLR-6441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14134285#comment-14134285
]
Steve Rowe edited comment on SOLR-6441 at 9/15/14 6:41 PM:
-----------------------------------------------------------
[~RamanaOpenSource], your patch changes the MLT Handler to always use the
stopword list in {{lang/stopwords_en.txt}}, but that won't work a) for people
who don't want to use any stopwords with the MLT handler; or b) for those who
want to use a different stopword list; or c) for those who don't include the
{{lang/}} directory in their configset.
This needs to be configurable in the handler, and the default should be to not
load stopwords at all.
Also, before this can be committed, there needs to be tests demonstrating that
the new functionality works.
was (Author: steve_rowe):
[~RamanaOpenSource], your changes the MLT Handler to always use the stopword
list in {{lang/stopwords_en.txt}}, but that won't work a) for people who don't
want to use any stopwords with the MLT handler; or b) for those who want to use
a different stopword list; or c) for those who don't include the {{lang/}}
directory in their configset.
This needs to be configurable in the handler, and the default should be to not
load stopwords at all.
Also, before this can be committed, there needs to be tests demonstrating that
the new functionality works.
> MoreLikeThis support for stopwords as in Lucene
> -----------------------------------------------
>
> Key: SOLR-6441
> URL: https://issues.apache.org/jira/browse/SOLR-6441
> Project: Solr
> Issue Type: Improvement
> Components: MoreLikeThis
> Affects Versions: 4.9
> Reporter: Jeroen Steggink
> Priority: Minor
> Labels: difficulty-easy, impact-low, workaround-exists
> Fix For: 4.10, 4.11
>
> Attachments: SOLR-6441.patch
>
>
> In the Lucene implementation of MoreLikeThis, it's possible to add a list of
> stopwords which are considered "uninteresting" and are ignored.
> It would be a great addition to the MoreLikeThisHandler to be able to specify
> a list of stopwords.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]