[jira] [Comment Edited] (SOLR-9925) Child documents missing from replicas during parallel delete+add

Brandon Chapman (JIRA) Mon, 26 Jun 2017 13:05:26 -0700

    [ 
https://issues.apache.org/jira/browse/SOLR-9925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16063663#comment-16063663
 ]


Brandon Chapman edited comment on SOLR-9925 at 6/26/17 8:04 PM:
----------------------------------------------------------------

We have seen this same issue on Solr 6.4.2. If you send deleteByQuery 
{noformat}"_root_ :THE_ROOT"{noformat}, with an add for the same document in 
the same request, you can trigger this easily.

In our case we had to use a deleteByQuery due to deleteById not working 
correctly on sharded collections. Our only solution is to go back to a 
non-sharded collection so we can use deleteById. 


was (Author: bchapman):
We have seen this same issue on Solr 6.4.2. If you send deleteByQuery 
"_root_:THE_ROOT", with an add for the same document in the same request, you 
can trigger this easily.

In our case we had to use a deleteByQuery due to deleteById not working 
correctly on sharded collections. Our only solution is to go back to a 
non-sharded collection so we can use deleteById. 

> Child documents missing from replicas during parallel delete+add
> ----------------------------------------------------------------
>
>                 Key: SOLR-9925
>                 URL: https://issues.apache.org/jira/browse/SOLR-9925
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>    Affects Versions: 5.5.2, 6.3
>         Environment: Java 1.8 (OpenJDK) on both CentOS 6.7 and Ubuntu 16.04.1
>            Reporter: Dan Sirotzke
>         Attachments: generate.py, run.sh
>
>
> When pushing documents to Solr in parallel, doing a delete-by-query and then 
> add for the same set of IDs within each thread results in some of the 
> replicas missing some of the child documents.  All the parent documents are 
> successfully replicated.
> This appears to trigger some sort of race condition, since:
> * Documents are never missing from the leader.
> * Documents _might_ be missing from the replicas.
> * When they are missing, the number and which documents are different for 
> each replica and each run.
> * It happens more easily with large documents; my test script needs a huge 
> number of documents to trigger it a small number of times, whereas it happens 
> ~5% of the time on our dataset.
> * We're currently on Solr 5.5.2, but I've also managed to trigger it on 6.3.0
> * When not running anything in parallel, this doesn't occur.
> Quick aside, since this is surely the first thing that will jump out:  We 
> can't just do an update due to to the uniqueKey/\_root\_ issue behind 
> SOLR-5211.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Comment Edited] (SOLR-9925) Child documents missing from replicas during parallel delete+add

Reply via email to