RE: Scoring a document (count?)

2006-08-10 Thread Doron Cohen
Hi Russel, my apologies for the delayed response. I rather have all correspondence on the mailing list, but to keep this mail thread readable I put the files at http://cdoronc.awardspace.com/TfTermQuery . I hope it helps you and would be interested in your comments. Regards, Doron Russell M.

RE: Scoring a document (count?)

2006-08-04 Thread Russell M. Allen
a document (count?) Hi Russel, I am also interested in the internals of Lucene's ranking and how one can/should alter the scoring. For now I was just learning from existing code of Lucene scorers and Weights. Your question seemed interesting, so I in fact implemented a quick scorer that would return

RE: Scoring a document (count?)

2006-08-03 Thread Doron Cohen
Hi Russel, I am also interested in the internals of Lucene's ranking and how one can/should alter the scoring. For now I was just learning from existing code of Lucene scorers and Weights. Your question seemed interesting, so I in fact implemented a quick scorer that would return the raw tf as a

RE: Scoring a document (count?)

2006-07-31 Thread Russell M. Allen
, July 31, 2006 4:02 AM To: Lucene Users Subject: Re: Scoring a document (count?) it would certainly be possible to get a score that was a simple count of the number of matching clauses of a boolean query -- probably just with a modified Similarity (no coord, 1/0 tf, no idf, no norms) but you *might

RE: Scoring a document (count?)

2006-07-31 Thread Russell M. Allen
lucene works under the covers enough to write the TermQuery variant. Thanks for the help, Russell. -Original Message- From: Doron Cohen [mailto:[EMAIL PROTECTED] Sent: Friday, July 28, 2006 3:19 AM To: java-user@lucene.apache.org Subject: Re: Scoring a document (count?) This task reminds me

Re: Scoring a document (count?)

2006-07-28 Thread Doron Cohen
This task reminds me more of a count(*) sql query than a text search query. Assuming that using a text search engine is a pre requisite, I can think of two approaches - basing on Lucene scoring as suggested in the question, or a more simple approach (below). For the scoring approach - I don't

Re: Scoring a document (count?)

2006-07-28 Thread Doron Cohen
Doron Cohen/Haifa/[EMAIL PROTECTED] wrote on 28/07/2006 00:18:47: For the scoring approach - I don't see an easy way to get the counts from the score of the results, although the TF (term frequency in candidate docs) is known+used during document scoring, and although it seems that the

Scoring a document (count?)

2006-07-27 Thread Russell M. Allen
I am curious about the potential use of document scoring as a means to extract additional data from an index. Specifically, I would like the score to be a count of how many times a particular field matched a set of terms. For example, I am indexing movie-stars (Each document is a movie-star). A