Re: Scoring results?!

Grant Ingersoll Wed, 09 May 2007 04:14:15 -0700

Hi Eric,

On May 9, 2007, at 2:39 AM, supereric wrote:

How I can get the tag word score in lucene. suppose that you havesearched a
tag word and 3 hit documents
are now found.
1 -How someone could find number of occurrences in any document soit could
sort the results.

Span Queries tell you where the matches occur in the document byoffset, but I am not sure what your sorting criteria would be. Theexplain method also can give you information about why a particulardocument scored a particular way.

Also I wan to have some other policies for ranking the results.What should
I do to handle that. for example
I want to score boldfaced tag words in an html document twicenormal texts.

Although totally experimental at this stage, the new Payload stuff inthe trunk version of Lucene (or nightly builds) is designed for sucha scenario. Check out the BoostingTermQuery which can boost termscores based on the contents of a payload located at a particularterm. Feedback on the APIs is very much appreciated.

2- How I can omit some tag words from the index?! for examplecommon words
in another language?


See the StopFilter token filter and/or the StopwordAnalyzer


HTH,
Grant

--------------------------
Grant Ingersoll
Center for Natural Language Processing
http://www.cnlp.org/tech/lucene.asp

Read the Lucene Java FAQ at http://wiki.apache.org/jakarta-lucene/LuceneFAQ




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Scoring results?!

Reply via email to