The core tf-idf scoring is described in this Javadoc:
http://lucene.apache.org/core/4_3_0/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html

That describes the scoring model and cites some papers.

Then you can navigate up to the base class and see that BM25 is another derived class. Unfortunately, that has less Javadoc, although it does cite a key paper on that approach.

-- Jack Krupansky

-----Original Message----- From: Erick Erickson
Sent: Wednesday, July 17, 2013 8:17 AM
To: java-user
Subject: Re: What is text searching algorithm in Lucene 4.3.1

Note: as of Lucene 4.x, you can plug in your
own scoring algorithm, it ships with several
variants (e.g. BM25) so you can look at the
pluggable scoring where all the code for the
various algorithms is concentrated.

Erick

On Wed, Jul 17, 2013 at 12:40 AM, Jack Krupansky
<j...@basetechnology.com> wrote:
The source code is what most people use to understand how Lucene actually
works. In some cases the Javadoc comments will point to published papers or
web sites for algorithms or approaches.

-- Jack Krupansky

-----Original Message----- From: Vinh Đặng
Sent: Tuesday, July 16, 2013 10:54 PM
To: java-user@lucene.apache.org
Subject: What is text searching algorithm in Lucene 4.3.1


Hi all,



I am trying to apply Lucene for a specific domain, so I need to customize
the text searching / text comparing algorithm of Lucene.



Is there any guideline / tutorial or article which explains about how Lucene
search and answer the query?



Thank you very much.



--

Thank you very much

VINH Dang (Mr)




---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to