Re: BM25 & field-configurable similarity ?

Grant Ingersoll Mon, 10 Mar 2008 19:03:41 -0700


On Mar 10, 2008, at 8:43 PM, Chris Hostetter wrote:

: Does it make (any) sense to try implementing this within Solr orshould I
: just forget about this ?
: As a more general note, does it make sense to try to use Solr as a
: "research" playground for similarities instead of Lucene? Or isthis the
: "wrong" level (aka Lucene being a better one)?

If i were going to sit down and reallyresearch alternate SImilarity
systems -- I would use Lucene directly. Solr adds a lot of nicefeaturesand abstractions, but for experimentations like this, those featuresand
abstractions can get in the way of experimenting.  In addition, the
benchmarking contrib in Lucene is designed to make it really easy to
run lots of repeatable tests changing small variables -- i beleiveGrantalready did some work to support evaluating "quality" metrics, soyou justhave to decide what "good" is and then you can run lots of testswhere youchange lots of variables in your custom similarity to whichcombination of
varaibles gets you the closest to "good"

FYI, it was Doron that hooked in the quality stuff, but the point isvalid. Lucene contrib/benchmark is a better place for doing low levelsimilarity testing.

Re: BM25 & field-configurable similarity ?

Reply via email to