[ 
https://issues.apache.org/jira/browse/SOLR-8500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15120152#comment-15120152
 ] 

Erick Erickson commented on SOLR-8500:
--------------------------------------

First let me say I have only the most cursory understanding of "the reordering 
problem". My assumption is that since CUSC is batching up sub-lists of the 
update set and sending them in parallel that if doc1 is followed by doc2 in the 
original list, doc2 might get to the indexing node before doc1, be it an 
update, delete, add, whatever.

That said, I don't really understand how reordering matters if (as per the 
original problem statement), it's _guaranteed_ that each document is new and is 
submitted exactly once _ever_. I guess another important restriction is that 
the client doesn't care if docs get  into the index in a different order than 
they were sent. How would correctness be threatened in that situation?

If the concern is that this is a too-specialized use-case that allows people to 
set it and shoot themselves in the foot too easily, that's a point. I just 
don't get why, in this specific use-case, this is a correctness question.

All that said, if Yonik's  fingerprint stuff is going in relatively soon, it's 
probably all moot and we can just wait on this...

> Allow the number of threads ConcurrentUpdateSolrClient StreamingSolrClients 
> configurable by a system property
> -------------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-8500
>                 URL: https://issues.apache.org/jira/browse/SOLR-8500
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Erick Erickson
>            Assignee: Erick Erickson
>            Priority: Minor
>         Attachments: SOLR-8500.patch
>
>
> Despite the warning in that code, in extremely high throughput situations 
> where there are guaranteed to be no updates to existing documents, it can be 
> useful to have more than one runner.
> I envision this as an "expert" kind of thing, used only in situations where 
> the a-priori knowledge is that there are no updates to existing documents.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to