Re: Disabling tf (term frequency) during indexing and/or scoring

2013-05-13 Thread tasmaniski
This is an old post, now there is a solution in SOLR omitTermFreqAndPositions=true http://wiki.apache.org/solr/SchemaXml#Data_Types -- View this message in context: http://lucene.472066.n3.nabble.com/Disabling-tf-term-frequency-during-indexing-and-or-scoring-tp502956p4062595.html Sent from

Re: Disabling tf (term frequency) during indexing and/or scoring

2009-09-18 Thread Aaron McKee
Hi Alexey, Thank you for your suggestion! My understanding of Similarity, though, is that this would affect the entire index, whereas I need something that is field-configurable. Looking at Similarity.tf(), it seems to be independent of the field (and unaware of it). I don't necessarily want

Re: Disabling tf (term frequency) during indexing and/or scoring

2009-09-18 Thread Yonik Seeley
On Fri, Sep 18, 2009 at 9:38 AM, Aaron McKee ucbmc...@gmail.com wrote: I suppose I'm curious why the omitTfAndPositions option conflates two apparently independent features. This relates to the index format, and is more for performance/size benefits when they are not needed. In the index, it's

Re: Disabling tf (term frequency) during indexing and/or scoring

2009-09-18 Thread Walter Underwood
Though it would be possible to calculate a binary tf, where the score is 1 if there are one or more occurances of the term. --wunder On Sep 18, 2009, at 7:08 AM, Yonik Seeley wrote: On Fri, Sep 18, 2009 at 9:38 AM, Aaron McKee ucbmc...@gmail.com wrote: I suppose I'm curious why the

Re: Disabling tf (term frequency) during indexing and/or scoring

2009-09-18 Thread Aaron McKee
Hi Yonik, Thank you for the explanation. If the primary goal was to save index space for a very specific subclass of fields, the implementation certainly makes more sense. I wonder, though, if it could also make sense to support a query-time only boolean to optionally disable TF

Re: Disabling tf (term frequency) during indexing and/or scoring

2009-09-18 Thread Aaron McKee
Hi Yonik, For my particular needs, IDF considerations are fine and helpful; if a user is requesting a rare term/phrase, increasing the score based on that makes sense as the match has higher confidence. I simply need to compensate for title and category type fields that may contain redundant

RE: Disabling tf (term frequency) during indexing and/or scoring

2009-09-18 Thread Walter Underwood
: Friday, September 18, 2009 8:33 AM To: solr-user@lucene.apache.org Subject: Re: Disabling tf (term frequency) during indexing and/or scoring Hi Yonik, For my particular needs, IDF considerations are fine and helpful; if a user is requesting a rare term/phrase, increasing the score based on that makes

Re: Disabling tf (term frequency) during indexing and/or scoring

2009-09-16 Thread Alexey Serba
Hi Aaron, You can overwrite default Lucene Similarity and disable tf and lengthNorm factors in scoring formula ( see http://lucene.apache.org/java/2_4_1/api/org/apache/lucene/search/Similarity.html and http://lucene.apache.org/java/2_4_1/api/index.html ) You need to 1) compile the following

Re: Disabling tf (term frequency) during indexing and/or scoring

2009-09-16 Thread Erik Hatcher
Just FYI - you can put Solr plugins in solr-home/lib as JAR files rather than messing with solr.war Erik On Sep 16, 2009, at 10:15 AM, Alexey Serba wrote: Hi Aaron, You can overwrite default Lucene Similarity and disable tf and lengthNorm factors in scoring formula ( see