[
https://issues.apache.org/jira/browse/SOLR-6816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14383567#comment-14383567
]
Per Steffensen commented on SOLR-6816:
--------------------------------------
bq. if a task fails, then Hadoop usually re-tries that task a couple of times,
meaning all docs in the block that failed will be sent again
We do not send all documents again if just a few in a batch (bulk) fails. Lets
say you send a batch of 1000 docs for indexing and only 2 fails due to e.g.
version-control, we only do another round on those 2 documents - SOLR-3382
> Review SolrCloud Indexing Performance.
> --------------------------------------
>
> Key: SOLR-6816
> URL: https://issues.apache.org/jira/browse/SOLR-6816
> Project: Solr
> Issue Type: Task
> Components: SolrCloud
> Reporter: Mark Miller
> Priority: Critical
> Attachments: SolrBench.pdf
>
>
> We have never really focused on indexing performance, just correctness and
> low hanging fruit. We need to vet the performance and try to address any
> holes.
> Note: A common report is that adding any replication is very slow.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]