Hey Ilan,

Thanks for putting together this writeup.  I think I understand the goal
conceptually, and it sounds like a good one for Solr!  But I'm still having
trouble understanding how this all would actually work.  So a few
questions, inline:

> A fourth replica type called ZERO is introduced

Why the name "Zero"? Is it conveying something about the design that I'm
not picking up on?

> At Collection creation time, it is possible to specify that the
collection exclusively uses replicas of type ZERO rather than being a
“normal” collection that uses NRT/TLOG/PULL.

Am I correct in understanding this to mean that if "zero" is used, it must
be used for every replica in the collection?  If so, it almost sounds like
this isn't a new type of replica but a new "collection type" altogether?

> This allows scaling compute (more queries, more indexing) independently
of storage

I think the biggest question I have is: how does the "compute" side of this
actually work?

On the indexing side: what all happens in Solr before giving a response
back to users?  What happens on a commit?  Are updates indexed only on the
leader (like TLOG/PULL) or on all replicas (like NRT), or some other
arrangement altogether?

On the querying side: what situations cause index data to be pulled from
the remote store?

(These last questions might be a bit lengthy to get into via email, but
they should probably be in the writeup?  Not sure what's best there...)

Best,

Jason

On Sat, Jan 13, 2024 at 9:15 PM Ishan Chattopadhyaya <
ichattopadhy...@gmail.com> wrote:

> +1, thanks for the contribution Ilan! Looking forward to seeing this coming
> to fruition.
>
> On Sun, 14 Jan 2024 at 03:40, Ilan Ginzburg <ilans...@gmail.com> wrote:
>
> > I have created SIP-20
> >
> >
> https://cwiki.apache.org/confluence/display/SOLR/SIP-20%3A+Separation+of+Compute+and+Storage+in+SolrCloud
> >
> > In the next few days I will create a Jira + a branch that implements
> > the SIP proposal and that includes documentation on how to approach
> > that branch and what's in it.
> >
> > This proposed contribution is based on work done at Salesforce these
> > last few years and currently running at scale in multiple regions.
> >
> > Thanks,
> > Ilan
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: dev-unsubscr...@solr.apache.org
> > For additional commands, e-mail: dev-h...@solr.apache.org
> >
> >
>

Reply via email to