Re: Removing lengthNorm from the calculation

2007-09-10 Thread Yonik Seeley
If you aren't using index-time document boosting, or field boosting
for that field specifically, then set omitNorms=true for that field
in the schema, shut down solr, completely remove the index, and then
re-index.

The norms for each field consist of the index-time boost multiplied by
the length normalization.

-Yonik


On 9/10/07, Kyle Banerjee [EMAIL PROTECTED] wrote:
 I know I'm missing something really obvious, but I'm spinning my
 wheels figuring out how to eliminate lengthNorm from the calculations.

 The specific problem I'm trying to solve is that naive queries are
 resulting in crummy short records near the top of the list. The
 reality is that the longer records tend to be higher quality, so if
 anything, they need to be emphasized.

 However, I'm missing something simple. Any advice or a pointer to an
 example I could model off would be greatly appreciated. Thanks,

 kyle


Re: Removing lengthNorm from the calculation

2007-09-10 Thread Mike Klaas

On 10-Sep-07, at 3:31 PM, Kyle Banerjee wrote:


I know I'm missing something really obvious, but I'm spinning my
wheels figuring out how to eliminate lengthNorm from the calculations.

The specific problem I'm trying to solve is that naive queries are
resulting in crummy short records near the top of the list. The
reality is that the longer records tend to be higher quality, so if
anything, they need to be emphasized.

However, I'm missing something simple. Any advice or a pointer to an
example I could model off would be greatly appreciated. Thanks,


My lengthNorm() method is filled with clauses like:

} else if (whatever.equals(fieldName)) {
  return super.lengthNorm(fieldName,  /
  Math.max(numTokens, MIN_LENGTH));

where MIN_LENGTH can be quite long for some fields.

-Mike