[jira] Commented: (LUCENE-517) norm compression breaks ranking for small fields

Yonik Seeley (JIRA) Fri, 10 Mar 2006 13:51:19 -0800

    [ 
http://issues.apache.org/jira/browse/LUCENE-517?page=comments#action_12369928 ]


Yonik Seeley commented on LUCENE-517:
-------------------------------------

Yes, the error bars seem kind of large for the normal usage of norms, which is 
just length normalization if you don't include boosts.  You could still use a 
single byte, but increase the number of bits dedicated to the mantissa to get 
better resolution (but with less range).

http://lucene.apache.org/java/docs/api/org/apache/lucene/util/SmallFloat.html

You could easily make the change for your index, but if would break existing 
indexes if we changed the default in Lucene.


> norm compression breaks ranking for small fields
> ------------------------------------------------
>
>          Key: LUCENE-517
>          URL: http://issues.apache.org/jira/browse/LUCENE-517
>      Project: Lucene - Java
>         Type: Bug
>   Components: Index, Search
>     Versions: 1.9
>  Environment: N/A
>     Reporter: Randy Puttick

>
> The scheme of compressing document norms to one byte loses a lot of 
> information.  This completely breaks search ranking on small fields because 
> there is no way to see the difference between documents with shorter and 
> longer fields that contain the same number of matching query terms.  
> Unfortunately the export of norms as a byte array seems to be pretty well 
> embedded in the code base so a fix would seem to require a major rev.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[jira] Commented: (LUCENE-517) norm compression breaks ranking for small fields

Reply via email to