[
https://issues.apache.org/jira/browse/SOLR-15779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17441231#comment-17441231
]
Walter Underwood commented on SOLR-15779:
-----------------------------------------
The stopword filter should be removed from all example configs.
At Netflix in 2007, these movie titles were converted to empty strings by
stopword removal.
* [Being There|http://www.netflix.com/Movie/Being_There/60004552] (this is the
first one I noticed)
* [To Be and To Have (Être et
Avoir)|http://www.netflix.com/Movie/To_Be_and_To_Have/60031224]
* [To Have and To Have
Not|http://www.netflix.com/Movie/To_Have_and_Have_Not/60027940]
* [Once and
Again|http://www.netflix.com/Movie/Once_and_Again_Season_1/60030531]
* [To Be or Not To Be
(1942)|http://www.netflix.com/Movie/To_Be_or_Not_To_Be/60011755] (OK, it isn’t
just a quote from Hamlet)
* [To Be or Not To Be
(1983)|http://www.netflix.com/Movie/To_Be_or_Not_To_Be/70046292]
* [Now and Then, Here and
There|http://www.netflix.com/Movie/Now_and_Then_Here_and_There_Vol._1_Discord_and_Doom/60022187]
* [Be with Me|http://www.netflix.com/Movie/Be_with_Me/70058486]
* [I’ll Be There|http://www.netflix.com/Movie/I_ll_Be_There/60026116]
* [It Had to Be You|http://www.netflix.com/Movie/It_Had_to_Be_You/60024124]
* [You Should Not Be
Here|https://www.netflix.com/Movie/You_Should_Not_Be_Here/70002914]
* [You Are Here|http://www.netflix.com/Movie/You_Are_Here/60022152]
https://observer.wunderwood.org/2007/05/31/do-all-stopword-queries-matter/
> Remove Stopwords from sample_techproducts_config?
> -------------------------------------------------
>
> Key: SOLR-15779
> URL: https://issues.apache.org/jira/browse/SOLR-15779
> Project: Solr
> Issue Type: Improvement
> Security Level: Public(Default Security Level. Issues are Public)
> Components: examples
> Affects Versions: main (9.0)
> Reporter: David Eric Pugh
> Priority: Minor
>
> A recent thread on the user mailing list drove home to me that folks use the
> {{sample_techproducts_config}} as a guide to good practices...
> https://lists.apache.org/thread/xoq5omk6om2z4tx71k1c8v50hkmv1jvp
> I know I've looked at it plenty over the years to figure out things!
> Thoughts on removing the stopwords filter from the example? Thoughts on
> maybe even tweaking the text
> https://solr.apache.org/guide/8_10/filter-descriptions.html#stop-filter to
> highlight some of the challenges in using stop words? or even, gasp, remove
> it?
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]