There’s a fundamental downside of embedded zk or putting zk on the same node as 
Solr. When there are many queries running, a Solr replica gets overloaded and 
it starts to affect the node itself (sometimes you can’t even ssh into the node 
when Solr is too busy), it also affects the Zk on the same node, which can 
cause a chain of downed solr replicas and zk nodes, reducing the overall 
availability of the cluster. 

When you need a high read high write cluster, it’s better to put zookeeper on 
its own machines I think. 

-ufuk yilmaz 

> On 23 Nov 2023, at 00:25, Jan Høydahl <jan....@cominvent.com> wrote:
> 
> I think there is a product that speaks ZK api but stores everything in etcd :)
> 
> Also, you may want to check out 
> https://cwiki.apache.org/confluence/display/SOLR/SIP-18%3A+A+Solr+Kubernetes+Module+for+native+integration
>  , an initiative that will let you use ConfigMaps for configSet files etc. 
> I.e. somewhat more cloud native. And 
> https://cwiki.apache.org/confluence/display/SOLR/SIP-14+Embedded+Zookeeper, 
> if it gains traction, will embed ZK in Solr processes, making it an 
> implementation detail that the users dont need to manage separately. 
> 
> Jan
> 
>> 22. nov. 2023 kl. 14:30 skrev matthew sporleder <msporle...@gmail.com>:
>> 
>> It's annoying that zookeeper is a hard dependency when etcd is just
>> sitting there and all of the stuff in zk could easily fit into
>> ConfigMaps.
> 

Reply via email to