Re: SolrCloud load balancing during heavy indexing

Otis Gospodnetic Fri, 25 Apr 2014 14:07:36 -0700

Hi,

On Fri, Apr 25, 2014 at 12:54 PM, zzT <[email protected]> wrote:


> Erick Erickson wrote
> > Back up, you're misunderstanding the update process. A leader node
> > distributes the update to every replica. So _all_ your nodes in a
> > slice are indexing when _any_ of them index. So the idea of sending
> > queries to just the replicas to avoid performance problems isn't
> > relevant.
>
> Hmm, I thought that it's not actual indexing taking place on the replicas
> but that the changes were somehow transferred to the replicas and thus it
> was less intensive for them.
>

Unfortunately that's not the case.  Each node that gets a doc still has to
analyze and index it.  I think at some point I sent a message to the list
and/or created a JIRA issue to suggest doing analysis on just the receiving
node, in which case the other nodes that need to index could skip that step
and do a little less work, but that hasn't been implemented yet.

Otis
--
Performance Monitoring * Log Analytics * Search Analytics
Solr & Elasticsearch Support * http://sematext.com/



>
>
> Erick Erickson wrote
> > In order to support NRT and HA/DR, it's required that all the nodes be
> > ready to take over, so the notion of the leader being the only node
> > that actually indexed the documents then distributing only the indexed
> > document to the other members of the slice isn't how it's done.
>
> So, this is where SolrCloud is different from legacy master/slave
> configuration? I mean master/slave sends segments to the slaves using e.g.
> rsync while SolrCloud forwards the indexing request to replicas where it's
> processed "locally" on each replica, right?
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/SolrCloud-load-balancing-during-heavy-indexing-tp4133099p4133160.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>

Re: SolrCloud load balancing during heavy indexing

Reply via email to