[
https://issues.apache.org/jira/browse/SOLR-6450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14115174#comment-14115174
]
Mark Miller commented on SOLR-6450:
-----------------------------------
It's almost more interesting to try working out sending the docs to replicas in
parallel with indexing on the leader instead of after.
> Option to send pre-analyzed documents from leader to replica instead of
> replicas re-running analysis.
> -----------------------------------------------------------------------------------------------------
>
> Key: SOLR-6450
> URL: https://issues.apache.org/jira/browse/SOLR-6450
> Project: Solr
> Issue Type: Improvement
> Components: SolrCloud
> Reporter: Timothy Potter
>
> Given the leader has to run the full update processor chain on each document
> (text analysis, etc), it would be good to have it send a pre-analyzed
> document to replicas (to improve near realtime replication), allowing the
> replica to avoid re-doing expensive work.
> Thought should be given about allowing the leader to accept pre-analyzed as
> well, so that you could off-load the document analysis to external processes.
> For instance, have 1000's of Storm workers doing the analysis and then
> sending pre-analyzed documents to Solr.
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]