[
https://issues.apache.org/jira/browse/LUCENE-3722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13192792#comment-13192792
]
Robert Muir commented on LUCENE-3722:
-------------------------------------
{quote}
Ok, it makes sense in local (multi reader) situations but in distributed
scenario it may be still acceptable to lose just a part of the statistics from
one shard while keeping the stats from other shards.
{quote}
I really think we should not do this: it can result in NaN/Inf/negative scores
to have 'invalid'
statistics (this causes serious problems!), but we have (and test) that all the
sims fallback
gracefully for the -1 case.
-1 means 'preflex codec does not support the stat'. Once we no longer have to
worry about 3.x
indexes, we no longer need to worry about -1.
> make similarities/term/collectionstats take long (for > 2B docs)
> ----------------------------------------------------------------
>
> Key: LUCENE-3722
> URL: https://issues.apache.org/jira/browse/LUCENE-3722
> Project: Lucene - Java
> Issue Type: Improvement
> Affects Versions: 4.0
> Reporter: Robert Muir
> Attachments: LUCENE-3722.patch, LUCENE-3722.patch
>
>
> As noted by Yonik and Andrzej on SOLR-1632, this would be useful for
> distributed scoring.
> we can also add a sugar method add() to both of these to make it easier to
> sum.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]