I dropped the notes that Gus made into a Miro board just to visualize it..   
https://miro.com/app/board/o9J_lm8BmXE=/?moveToWidget=3458764558006503531&cot=14

Be interesting during one of our community calls to talk about “what do we 
have” and “what are we missing?”….     Feels to me like a number of our SIPs 
actually map to this progression.

Eric


> On Jun 20, 2023, at 12:04 PM, Eric Pugh <ep...@opensourceconnections.com> 
> wrote:
> 
> I think this is a very interesting progression…..    It’s a really nice 
> mental model of “what tools should I reach for when?"
> 
>> On Jun 20, 2023, at 11:41 AM, Gus Heck <gus.h...@gmail.com> wrote:
>> 
>> I'm not familiar with these classes, but I'm not particularly fond of
>> anything that leads us in a direction of requiring a *third* installation
>> for initial use. (Zookeeper being #2 already). That said, we really need a
>> good replacement for autoscaling, and large installs might reasonably want
>> to offload any non query work. Ideally, we would have a smooth transition
>> so that users can easily follow this path:
>> 
>>   1. Single Cloud Node, embedded zookeeper, local management (mostly
>>   unused, maybe not loaded)
>>   2. A few Cloud nodes (2-5), embedded zookeeper, local management
>>   3. Moderate cloud (6-12 nodes), embedded zookeeper on subset of nodes,
>>   local management
>>   4. Large cloud 13-25 nodes, external zookeeper, local management
>>   5. > 25 nodes, external zookeeper, management local or external.
>>   6. >100 nodes, recommended external zk and management
>> 
>> Thus folks doing moderate stuff don't need to bother with installing
>> anything other than Solr. Somewhere along that scale they would likely
>> start using tlog and then tlog/pull setups as well. Ideally we would have a
>> clear path to make these transitions with minimal downtime.
>> 
>> So if we can fit what these classes do into that dream, great. If they
>> point elsewhere meh.
>> 
>> Note: of course none of this has anything to do with "user-managed" Solr
>> (a.k.a. legacy solr) which is managed manually by users and doesn't have zk.
>> 
>> 
>> On Mon, Jun 19, 2023, 4:42 PM David Smiley <dsmi...@apache.org> wrote:
>> 
>>> I noticed the SolrCloudManager concept added some time ago brought about to
>>> abstract away SolrCloud in the context of doing simulated experiments on
>>> auto-scaling.  Essentially -- need to simulate SolrCloud and not actually
>>> use a real SolrCloud.  But that need and code went away in 9.0...
>>> nonetheless SolrCloudManager and its friends (like DistributedStateManager)
>>> are still around.  I could imagine someone advocating for them
>>> nonetheless.  But the present state is very half-implemented as there is
>>> code all over the place that assumes ZooKeeper (e.g. uses SolrZkClient or
>>> ZkStateReader) instead of some of these abstractions.  I think there is a
>>> need to set a direction here -- do we embrace abstracting SolrCloud within
>>> Solr or do we revert this stuff as needless indirection / concepts.
>>> 
>>> I think there's lots of room to debate / review the particulars of
>>> SolrCloudManager and friends if we do want to keep it.
>>> DistributedQueueFactory isn't even used anymore.  NodeStateProvider is only
>>> for AttributeFactory; not very obvious.  DistribStateManager is essentially
>>> SolrZkClient but nonetheless still references ZK classes.
>>> 
>>> ~ David Smiley
>>> Apache Lucene/Solr Search Developer
>>> http://www.linkedin.com/in/davidwsmiley
>>> 
> 
> _______________________
> Eric Pugh | Founder & CEO | OpenSource Connections, LLC | 434.466.1467 | 
> http://www.opensourceconnections.com <http://www.opensourceconnections.com/> 
> | My Free/Busy <http://tinyurl.com/eric-cal>  
> Co-Author: Apache Solr Enterprise Search Server, 3rd Ed 
> <https://www.packtpub.com/big-data-and-business-intelligence/apache-solr-enterprise-search-server-third-edition-raw>
>   
> This e-mail and all contents, including attachments, is considered to be 
> Company Confidential unless explicitly stated otherwise, regardless of 
> whether attachments are marked as such.
> 

_______________________
Eric Pugh | Founder & CEO | OpenSource Connections, LLC | 434.466.1467 | 
http://www.opensourceconnections.com <http://www.opensourceconnections.com/> | 
My Free/Busy <http://tinyurl.com/eric-cal>  
Co-Author: Apache Solr Enterprise Search Server, 3rd Ed 
<https://www.packtpub.com/big-data-and-business-intelligence/apache-solr-enterprise-search-server-third-edition-raw>
    
This e-mail and all contents, including attachments, is considered to be 
Company Confidential unless explicitly stated otherwise, regardless of whether 
attachments are marked as such.

Reply via email to