Thanks shawn i will try to think in that way too :)

With Regards
Aman Tandon


On Fri, Jun 6, 2014 at 8:19 PM, Shawn Heisey <s...@elyograg.org> wrote:

> On 6/6/2014 8:31 AM, Aman Tandon wrote:
> > In my organisation we also want to implement the solrcloud, but the
> problem
> > is that, we are using the master-slave architecture and on master we do
> all
> > indexing, architecture of master is lower than the slaves.
> >
> > So if we implement the solrcloud in a fashion that master will be the
> > leader, and slaves will be the replicas then in that case, in the case of
> > high load leader can bear it,  I guess every query firstly goes to leader
> > then it distributes the request as i noticed from the logs and blogs :)
> >
> > As well as master is in NY and slaves are in Dallas, which also might
> cause
> > latency issue and it will instead fail our purpose of faster query
> response.
> >
> > So i thought to use this shards parameter so that we query only from the
> > replicas not to the leader so that leader just work fine. But we were not
> > sure about this shards parameter, what do you think? what should we do
> with
> > latency issue and shards parameter.
>
> SolrCloud does not yet have any way to prefer one set of replicas over
> the others, so if you just send it requests, they would be sent to both
> Dallas and New York, affecting search latency.  Local replica preference
> is a desperately needed feature.
>
> Old-style distributed search with the shards parameter, combined with
> master/slave replication, is an effective way to be absolutely sure
> which servers you are querying.
>
> I would actually recommend that you get rid of replication and have your
> index updating software update each copy of the index independently.
> This is how I do my Solr install.  It opens up a whole new set of
> possibilities -- you can change the schema and/or config on one set of
> servers, or upgrade any component -- Solr, Java, etc., without affecting
> the other set of servers at all.
>
> One note: in order for the indexing paradigm I've outlined to be
> actually effective, you must separately track which
> inserts/updates/deletes have been done for each server set.  If you
> don't do that, they can get out of sync when you restart a server.
> Also, if you don't do this, having a server is down for an extended
> period of time might cause all indexing activity to stop on BOTH server
> sets.
>
> Thanks,
> Shawn
>
>

Reply via email to