The core tf-idf scoring is described in this Javadoc:
http://lucene.apache.org/core/4_3_0/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html
That describes the scoring model and cites some papers.
Then you can navigate up to the base class and see that BM25 is another
derived class. Unfortunately, that has less Javadoc, although it does cite a
key paper on that approach.
-- Jack Krupansky
-----Original Message-----
From: Erick Erickson
Sent: Wednesday, July 17, 2013 8:17 AM
To: java-user
Subject: Re: What is text searching algorithm in Lucene 4.3.1
Note: as of Lucene 4.x, you can plug in your
own scoring algorithm, it ships with several
variants (e.g. BM25) so you can look at the
pluggable scoring where all the code for the
various algorithms is concentrated.
Erick
On Wed, Jul 17, 2013 at 12:40 AM, Jack Krupansky
<j...@basetechnology.com> wrote:
The source code is what most people use to understand how Lucene actually
works. In some cases the Javadoc comments will point to published papers
or
web sites for algorithms or approaches.
-- Jack Krupansky
-----Original Message----- From: Vinh Đặng
Sent: Tuesday, July 16, 2013 10:54 PM
To: java-user@lucene.apache.org
Subject: What is text searching algorithm in Lucene 4.3.1
Hi all,
I am trying to apply Lucene for a specific domain, so I need to customize
the text searching / text comparing algorithm of Lucene.
Is there any guideline / tutorial or article which explains about how
Lucene
search and answer the query?
Thank you very much.
--
Thank you very much
VINH Dang (Mr)
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org