[ 
https://issues.apache.org/jira/browse/LUCENE-3722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13192792#comment-13192792
 ] 

Robert Muir commented on LUCENE-3722:
-------------------------------------

{quote}
Ok, it makes sense in local (multi reader) situations but in distributed 
scenario it may be still acceptable to lose just a part of the statistics from 
one shard while keeping the stats from other shards.
{quote}

I really think we should not do this: it can result in NaN/Inf/negative scores 
to have 'invalid'
statistics (this causes serious problems!), but we have (and test) that all the 
sims fallback
gracefully for the -1 case.

-1 means 'preflex codec does not support the stat'. Once we no longer have to 
worry about 3.x
indexes, we no longer need to worry about -1.
                
> make similarities/term/collectionstats take long (for > 2B docs)
> ----------------------------------------------------------------
>
>                 Key: LUCENE-3722
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3722
>             Project: Lucene - Java
>          Issue Type: Improvement
>    Affects Versions: 4.0
>            Reporter: Robert Muir
>         Attachments: LUCENE-3722.patch, LUCENE-3722.patch
>
>
> As noted by Yonik and Andrzej on SOLR-1632, this would be useful for 
> distributed scoring.
> we can also add a sugar method add() to both of these to make it easier to 
> sum.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to