[jira] [Commented] (SOLR-4260) Inconsistent numDocs between leader and replica

Markus Jelsma (JIRA) Thu, 18 Jul 2013 08:13:37 -0700

    [ 
https://issues.apache.org/jira/browse/SOLR-4260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13712400#comment-13712400
 ]


Markus Jelsma commented on SOLR-4260:
-------------------------------------

Alright, nothing looks like zookeeper expirations i grepped expirations in the 
error log but there's nothing there. This indexing session did not produce so 
many inconsistencies as the previous one; there is only 1 shard of which one 
replica has 2 more documents. It won't fix itself.

During indexing there were, as usual, error such as autocommit causing a 
searcher too many and time outs talking to other nodes.

Only 2 nodes report a Stopping Recovery For of which one node actually has a 
replica of the inconsistent core. The other shard is seems fine, both replica's 
have the same numDocs.
                
> Inconsistent numDocs between leader and replica
> -----------------------------------------------
>
>                 Key: SOLR-4260
>                 URL: https://issues.apache.org/jira/browse/SOLR-4260
>             Project: Solr
>          Issue Type: Bug
>          Components: SolrCloud
>    Affects Versions: 5.0
>         Environment: 5.0.0.2013.01.04.15.31.51
>            Reporter: Markus Jelsma
>            Priority: Critical
>             Fix For: 5.0
>
>
> After wiping all cores and reindexing some 3.3 million docs from Nutch using 
> CloudSolrServer we see inconsistencies between the leader and replica for 
> some shards.
> Each core hold about 3.3k documents. For some reason 5 out of 10 shards have 
> a small deviation in then number of documents. The leader and slave deviate 
> for roughly 10-20 documents, not more.
> Results hopping ranks in the result set for identical queries got my 
> attention, there were small IDF differences for exactly the same record 
> causing a record to shift positions in the result set. During those tests no 
> records were indexed. Consecutive catch all queries also return different 
> number of numDocs.
> We're running a 10 node test cluster with 10 shards and a replication factor 
> of two and frequently reindex using a fresh build from trunk. I've not seen 
> this issue for quite some time until a few days ago.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SOLR-4260) Inconsistent numDocs between leader and replica

Reply via email to