[ 
https://issues.apache.org/jira/browse/SOLR-1632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789607#action_12789607
 ] 

Andrzej Bialecki  commented on SOLR-1632:
-----------------------------------------

I believe the API that I propose would support such implementation as well. 
Please note that it's usually not feasible to compute and distribute the 
complete IDF table for all terms - you would have to replicate a union of all 
term dictionaries across the cluster. In practice, you limit the amount of 
information by various means, e.g. only distributing data related to the 
current request (this implementation) or reducing the frequency of updates 
(e.g. LRU caching), or approximating global DF with a constant for frequent 
terms (where the contribution of their IDF to the score would be negligible 
anyway).

> Distributed IDF
> ---------------
>
>                 Key: SOLR-1632
>                 URL: https://issues.apache.org/jira/browse/SOLR-1632
>             Project: Solr
>          Issue Type: New Feature
>          Components: search
>    Affects Versions: 1.5
>            Reporter: Andrzej Bialecki 
>         Attachments: distrib.patch
>
>
> Distributed IDF is a valuable enhancement for distributed search across 
> non-uniform shards. This issue tracks the proposed implementation of an API 
> to support this functionality in Solr.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to