I think this is a very interesting progression….. It’s a really nice mental model of “what tools should I reach for when?"
> On Jun 20, 2023, at 11:41 AM, Gus Heck <gus.h...@gmail.com> wrote: > > I'm not familiar with these classes, but I'm not particularly fond of > anything that leads us in a direction of requiring a *third* installation > for initial use. (Zookeeper being #2 already). That said, we really need a > good replacement for autoscaling, and large installs might reasonably want > to offload any non query work. Ideally, we would have a smooth transition > so that users can easily follow this path: > > 1. Single Cloud Node, embedded zookeeper, local management (mostly > unused, maybe not loaded) > 2. A few Cloud nodes (2-5), embedded zookeeper, local management > 3. Moderate cloud (6-12 nodes), embedded zookeeper on subset of nodes, > local management > 4. Large cloud 13-25 nodes, external zookeeper, local management > 5. > 25 nodes, external zookeeper, management local or external. > 6. >100 nodes, recommended external zk and management > > Thus folks doing moderate stuff don't need to bother with installing > anything other than Solr. Somewhere along that scale they would likely > start using tlog and then tlog/pull setups as well. Ideally we would have a > clear path to make these transitions with minimal downtime. > > So if we can fit what these classes do into that dream, great. If they > point elsewhere meh. > > Note: of course none of this has anything to do with "user-managed" Solr > (a.k.a. legacy solr) which is managed manually by users and doesn't have zk. > > > On Mon, Jun 19, 2023, 4:42 PM David Smiley <dsmi...@apache.org> wrote: > >> I noticed the SolrCloudManager concept added some time ago brought about to >> abstract away SolrCloud in the context of doing simulated experiments on >> auto-scaling. Essentially -- need to simulate SolrCloud and not actually >> use a real SolrCloud. But that need and code went away in 9.0... >> nonetheless SolrCloudManager and its friends (like DistributedStateManager) >> are still around. I could imagine someone advocating for them >> nonetheless. But the present state is very half-implemented as there is >> code all over the place that assumes ZooKeeper (e.g. uses SolrZkClient or >> ZkStateReader) instead of some of these abstractions. I think there is a >> need to set a direction here -- do we embrace abstracting SolrCloud within >> Solr or do we revert this stuff as needless indirection / concepts. >> >> I think there's lots of room to debate / review the particulars of >> SolrCloudManager and friends if we do want to keep it. >> DistributedQueueFactory isn't even used anymore. NodeStateProvider is only >> for AttributeFactory; not very obvious. DistribStateManager is essentially >> SolrZkClient but nonetheless still references ZK classes. >> >> ~ David Smiley >> Apache Lucene/Solr Search Developer >> http://www.linkedin.com/in/davidwsmiley >> _______________________ Eric Pugh | Founder & CEO | OpenSource Connections, LLC | 434.466.1467 | http://www.opensourceconnections.com <http://www.opensourceconnections.com/> | My Free/Busy <http://tinyurl.com/eric-cal> Co-Author: Apache Solr Enterprise Search Server, 3rd Ed <https://www.packtpub.com/big-data-and-business-intelligence/apache-solr-enterprise-search-server-third-edition-raw> This e-mail and all contents, including attachments, is considered to be Company Confidential unless explicitly stated otherwise, regardless of whether attachments are marked as such.