On Tue, Jan 16, 2007 at 11:49:12AM +0100, Martin Bernd Schmeil wrote:
> Hi Jens,
> 
[..]
> The main issues we have is the well known locking problem and the 
> scores.

Making sure you only have one process writing to the index, i.e. via
an indexer running in backgroundrb, should solve these issues.

> The scores leave us with the problem that - while the order seems to be 
> correct - we don't know where to cut the line to display results and 
> what a relevant match is. For a dozen attributes I've seen scores from 
> 0.something to 9.something, with  a result close below 9 not even 
> looling similar while just above 9 seems to be a "99 percent" match.

the calculation of scores is quite complex. To get an idea what happens
in there you can use Ferret's explain method (in
Ferret::Search::Searcher).

> If someone would tell me - in case this is possible at all - how to 
> normalize the scores I'd be very happy.

no idea if this is possible - maybe you find some information about
this in the context of Lucene (i.e. in Eric Hatcher's fine Lucene book
or on the lucene mailing list).

> Another thing which I didn't understand yet is what actually happens if 
> I do a multi token fuzzy search; currently I'm splitting the string up 
> in multiple tokens and build one query "attribute:token1~ AND 
> attribute:token2~ AND ...". Maybe not really what I should do to get 
> correct scores.

don't know if there is another way to express this with ferret. 


cheers,
Jens

-- 
webit! Gesellschaft für neue Medien mbH          www.webit.de
Dipl.-Wirtschaftsingenieur Jens Krämer       [EMAIL PROTECTED]
Schnorrstraße 76                         Tel +49 351 46766  0
D-01069 Dresden                          Fax +49 351 46766 66
_______________________________________________
Ferret-talk mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/ferret-talk

Reply via email to