I dropped the notes that Gus made into a Miro board just to visualize it.. https://miro.com/app/board/o9J_lm8BmXE=/?moveToWidget=3458764558006503531&cot=14
Be interesting during one of our community calls to talk about “what do we have” and “what are we missing?”…. Feels to me like a number of our SIPs actually map to this progression. Eric > On Jun 20, 2023, at 12:04 PM, Eric Pugh <ep...@opensourceconnections.com> > wrote: > > I think this is a very interesting progression….. It’s a really nice > mental model of “what tools should I reach for when?" > >> On Jun 20, 2023, at 11:41 AM, Gus Heck <gus.h...@gmail.com> wrote: >> >> I'm not familiar with these classes, but I'm not particularly fond of >> anything that leads us in a direction of requiring a *third* installation >> for initial use. (Zookeeper being #2 already). That said, we really need a >> good replacement for autoscaling, and large installs might reasonably want >> to offload any non query work. Ideally, we would have a smooth transition >> so that users can easily follow this path: >> >> 1. Single Cloud Node, embedded zookeeper, local management (mostly >> unused, maybe not loaded) >> 2. A few Cloud nodes (2-5), embedded zookeeper, local management >> 3. Moderate cloud (6-12 nodes), embedded zookeeper on subset of nodes, >> local management >> 4. Large cloud 13-25 nodes, external zookeeper, local management >> 5. > 25 nodes, external zookeeper, management local or external. >> 6. >100 nodes, recommended external zk and management >> >> Thus folks doing moderate stuff don't need to bother with installing >> anything other than Solr. Somewhere along that scale they would likely >> start using tlog and then tlog/pull setups as well. Ideally we would have a >> clear path to make these transitions with minimal downtime. >> >> So if we can fit what these classes do into that dream, great. If they >> point elsewhere meh. >> >> Note: of course none of this has anything to do with "user-managed" Solr >> (a.k.a. legacy solr) which is managed manually by users and doesn't have zk. >> >> >> On Mon, Jun 19, 2023, 4:42 PM David Smiley <dsmi...@apache.org> wrote: >> >>> I noticed the SolrCloudManager concept added some time ago brought about to >>> abstract away SolrCloud in the context of doing simulated experiments on >>> auto-scaling. Essentially -- need to simulate SolrCloud and not actually >>> use a real SolrCloud. But that need and code went away in 9.0... >>> nonetheless SolrCloudManager and its friends (like DistributedStateManager) >>> are still around. I could imagine someone advocating for them >>> nonetheless. But the present state is very half-implemented as there is >>> code all over the place that assumes ZooKeeper (e.g. uses SolrZkClient or >>> ZkStateReader) instead of some of these abstractions. I think there is a >>> need to set a direction here -- do we embrace abstracting SolrCloud within >>> Solr or do we revert this stuff as needless indirection / concepts. >>> >>> I think there's lots of room to debate / review the particulars of >>> SolrCloudManager and friends if we do want to keep it. >>> DistributedQueueFactory isn't even used anymore. NodeStateProvider is only >>> for AttributeFactory; not very obvious. DistribStateManager is essentially >>> SolrZkClient but nonetheless still references ZK classes. >>> >>> ~ David Smiley >>> Apache Lucene/Solr Search Developer >>> http://www.linkedin.com/in/davidwsmiley >>> > > _______________________ > Eric Pugh | Founder & CEO | OpenSource Connections, LLC | 434.466.1467 | > http://www.opensourceconnections.com <http://www.opensourceconnections.com/> > | My Free/Busy <http://tinyurl.com/eric-cal> > Co-Author: Apache Solr Enterprise Search Server, 3rd Ed > <https://www.packtpub.com/big-data-and-business-intelligence/apache-solr-enterprise-search-server-third-edition-raw> > > This e-mail and all contents, including attachments, is considered to be > Company Confidential unless explicitly stated otherwise, regardless of > whether attachments are marked as such. > _______________________ Eric Pugh | Founder & CEO | OpenSource Connections, LLC | 434.466.1467 | http://www.opensourceconnections.com <http://www.opensourceconnections.com/> | My Free/Busy <http://tinyurl.com/eric-cal> Co-Author: Apache Solr Enterprise Search Server, 3rd Ed <https://www.packtpub.com/big-data-and-business-intelligence/apache-solr-enterprise-search-server-third-edition-raw> This e-mail and all contents, including attachments, is considered to be Company Confidential unless explicitly stated otherwise, regardless of whether attachments are marked as such.