Hi Russel, my apologies for the delayed response. I rather have all
correspondence on the mailing list, but to keep this mail thread readable I
put the files at http://cdoronc.awardspace.com/TfTermQuery . I hope it
helps you and would be interested in your comments.
Regards,
Doron
Russell M.
a document (count?)
Hi Russel,
I am also interested in the internals of Lucene's ranking and how one
can/should alter the scoring. For now I was just learning from existing
code of Lucene scorers and Weights. Your question seemed interesting, so
I in fact implemented a quick scorer that would return
Hi Russel,
I am also interested in the internals of Lucene's ranking and how one
can/should alter the scoring. For now I was just learning from existing
code of Lucene scorers and Weights. Your question seemed interesting, so I
in fact implemented a quick scorer that would return the raw tf as a
, July 31, 2006 4:02 AM
To: Lucene Users
Subject: Re: Scoring a document (count?)
it would certainly be possible to get a score that was a simple count of
the number of matching clauses of a boolean query -- probably just with
a modified Similarity (no coord, 1/0 tf, no idf, no norms) but you
*might
lucene
works under the covers enough to write the TermQuery variant.
Thanks for the help,
Russell.
-Original Message-
From: Doron Cohen [mailto:[EMAIL PROTECTED]
Sent: Friday, July 28, 2006 3:19 AM
To: java-user@lucene.apache.org
Subject: Re: Scoring a document (count?)
This task reminds me
This task reminds me more of a count(*) sql query than a text search query.
Assuming that using a text search engine is a pre requisite, I can think of
two approaches - basing on Lucene scoring as suggested in the question, or
a more simple approach (below).
For the scoring approach - I don't
Doron Cohen/Haifa/[EMAIL PROTECTED] wrote on 28/07/2006 00:18:47:
For the scoring approach - I don't see an easy way to get the
counts from the score of the results, although the TF (term
frequency in candidate docs) is known+used during document
scoring, and although it seems that the
I am curious about the potential use of document scoring as a means to
extract additional data from an index. Specifically, I would like the
score to be a count of how many times a particular field matched a set
of terms.
For example, I am indexing movie-stars (Each document is a movie-star).
A