[ 
https://issues.apache.org/jira/browse/SOLR-6450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14115174#comment-14115174
 ] 

Mark Miller commented on SOLR-6450:
-----------------------------------

It's almost more interesting to try working out sending the docs to replicas in 
parallel with indexing on the leader instead of after.

> Option to send pre-analyzed documents from leader to replica instead of 
> replicas re-running analysis.
> -----------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-6450
>                 URL: https://issues.apache.org/jira/browse/SOLR-6450
>             Project: Solr
>          Issue Type: Improvement
>          Components: SolrCloud
>            Reporter: Timothy Potter
>
> Given the leader has to run the full update processor chain on each document 
> (text analysis, etc), it would be good to have it send a pre-analyzed 
> document to replicas (to improve near realtime replication), allowing the 
> replica to avoid re-doing expensive work.
> Thought should be given about allowing the leader to accept pre-analyzed as 
> well, so that you could off-load the document analysis to external processes. 
> For instance, have 1000's of Storm workers doing the analysis and then 
> sending pre-analyzed documents to Solr.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to