[ 
https://issues.apache.org/jira/browse/SOLR-2929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man resolved SOLR-2929.
----------------------------

    Resolution: Not A Problem

This is the expected behavior.

TermsComponent deals with the raw underlying Term Frequencies, which only 
reflected deleted documents (Note: updating a document is deleting the old 
instance and adding a new instance) when segments are merged and the deletions 
get expunged.

As noted on the wiki...

https://wiki.apache.org/solr/TermsComponent

bq. The doc frequencies returned are the number of documents that match the 
term, including any documents that have been marked for deletion but not yet 
removed from the index. 

                
> TermsComponent Adding entries
> -----------------------------
>
>                 Key: SOLR-2929
>                 URL: https://issues.apache.org/jira/browse/SOLR-2929
>             Project: Solr
>          Issue Type: Bug
>          Components: SearchComponents - other
>    Affects Versions: 3.3, 3.4
>         Environment: solr 3.x
>            Reporter: maillard
>            Priority: Minor
>
> When indexing multiple documents in one go and then updating one of the 
> documents in a later process Termscomponent count gets wrongfully incremented.
> example indexing two documents with a country field as such:
> <add>
> <doc>
> <field name="COUNTRY">US</field>
> <field name="ID">L20110121151204207</field>
> </doc>
> <doc>
> <field name="COUNTRY">Canada</field>
> <field name="ID">L20110121151204208</field>
> </doc>
> </add>
> Termscomponent returns:
>  US(1)
>  Canada(1)
> Update the first document:
> <add>
> <doc>
> <field name="COUNTRY">US</field>
> <field name="ID">L20110121151204207</field>
> </doc>
> </add>
> Termscomponent returns:
>  US(2)
>  Canada(1)
> There still are only two documents in the index.
> This does not happen when only dealing with a single doc, or when you update 
> the same set of documents you initially indexed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to