Hello,

I am new to SolrCloud and Zookeeper, and there's a piece of the puzzle on which 
I'm unclear.  I don't understand whether/how SolrCloud and Zookeeper handles 
the high availability aspect.  That is, once I have the Zookeeper ensemble set 
up, now how do I reference the collection in a "high-availability" fashion so 
that if one Solr instance is down, it will automatically route to another 
instance?

Am I thinking that SolrCloud does more than it actually does?  That is, could I 
instead handle all Load Balancing outside of Zookeeper, and just use our 
company's network load balancer to route/distribute traffic for a single 
consolidated URL:port to the different actual Solr Instances/Ports?

Here's more details in case it helps:
I have an ASP.NET application which uses SolrNet to query and write to a 
standalone Solr instance.  In my program , I specify the hostname:port of my 
Solr instance:
                Startup.Init<SolrDoc>("hostname:port");
This setup was used for development and testing.

Now, for production, I need to implement Solr in a H.A. fashion so that I don't 
have a single point of failure.    So, following Apache's documentation, I 
understand that the solution would be to set up an external Zookeeper Ensemble. 
So, let's say I set up a new Zookeeper Ensemble (three instances). Let's also 
say I have three Solr instances, #shards = 2, #replicas per shard = 2.

Once I have the Zookeeper ensemble set up, now how do I reference the 
collection?
Previously, my code references a single standalone Solr instance, 
"http://solrserver1:9983";. What do I point it to now?  

If I point it to any of the three Solr instances, and if that instance I'm 
referencing goes down, it means the request would fail, right?
        
I read similar questions online, for example 
https://groups.google.com/forum/#!msg/solrnet/-PeaGrLAMtw/pAfxuoYLVnIJ, and the 
answer seems to be that SolrNet doesn't support this type of HA.  So, do I 
understand correctly that the load balancing aspect needs to be handled on the 
client-side (or via a network load balancer), not the zookeeper/solrcloud 
server side?

Thanks,
Mike


 

Reply via email to