Jeroen Steggink created LUCENE-6493:
---------------------------------------

             Summary: MoreLikeThis minimum should match query
                 Key: LUCENE-6493
                 URL: https://issues.apache.org/jira/browse/LUCENE-6493
             Project: Lucene - Core
          Issue Type: Improvement
          Components: core/search
    Affects Versions: 5.x
            Reporter: Jeroen Steggink
            Priority: Minor


When performing MoreLikeThis queries we have a lot of parameters at hand in 
order to finetune the results. However, one thing I personally missed, being 
able to set a minimum of matching terms.

The advantage of this parameter is that Lucene will retrieve only the documents 
which have at least the given number of matching terms and you won't get any 
results of just one or two keyword matches when you are looking for documents 
that have more than a few matching interesting keywords.

An example:

doc1: lucene
doc2: lucene release
doc3: apache
doc4: apache lucene
doc5: apache lucene release

Let's say the query is "apache lucene release" and you set the minimum number 
of matching interesting terms to 2.

MoreLikeThis will return:
doc2, doc4 and doc5

Would you set it to 3, it would return just doc5.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to