Thanks for the response. You do raise good points.

Say I reverse your example and I have a 10 node cluster with a 10-shard
collection and a replication factor of 10. Now I kill 9 of my nodes, do all
100 replicas move to the one remaining node? I believe the answer is, well
that depends on the configuration.

I'm thinking about it from the initial cluster planning side of things. The
decisions about auto-scaling, how many replicas, and even how many shards
are at least partially dependent on the available hardware. So at
deployment time I would think there would be a way of defining what the
collection *should* look like based on the hardware I am deploying to.
Obviously this could change during runtime and I may need to add nodes,
split shards, etc...

As it is now it seems like I need to deploy my cluster then write a custom
script to ensure each node I expect to be there is running and only then
create my collection with desired shards and replication.

- Frank

On Wed, Jan 9, 2019 at 2:14 PM Erick Erickson <erickerick...@gmail.com>
wrote:

> How would you envision that working? When would the
> replicas actually be created and under what heuristics?
>
> Imagine this is possible, and there are a bunch of
> placeholders in ZK for a 10-shard, collection with
> a replication factor of 10 (100 replicas all told). Now
> I bring up a single Solr instance. Should all 100 replicas
> be created immediately? Wait for N Solr nodes to be
> brought online? On some command?
>
> My gut feel is that this would be fraught with problems
> and not very valuable to many people. If you could create
> the "template" in ZK without any replicas actually being created,
> then at some other point say "make it so", I don't see the advantage
> over just the current setup. And I do think that it would be
> considerable effort.
>
> Net-net is I'd like to see a much stronger justification
> before anyone embarks on something like this. First as
> I mentioned above I think it'd be a lot of effort, second I
> virtually guarantee it'd introduce significant bugs. How
> would it interact with autoscaling for instance?
>
> Best,
> Erick
>
> On Wed, Jan 9, 2019 at 9:59 AM Frank Greguska <fg...@apache.org> wrote:
> >
> > Hello,
> >
> > I am trying to bootstrap a SolrCloud installation and I ran into an issue
> > that seems rather odd. I see it is possible to bootstrap a configuration
> > set from an existing SOLR_HOME using
> >
> > ./server/scripts/cloud-scripts/zkcli.sh -zkhost ${ZK_HOST} -cmd bootstrap
> > -solrhome ${SOLR_HOME}
> >
> > but this does not create a collection, it just uploads a configuration
> set.
> >
> > Furthermore, I can not use
> >
> > bin/solr create
> >
> > to create a collection and link it to my bootstrapped configuration set
> > because it requires Solr to already be running.
> >
> > I'm hoping someone can shed some light on why this is the case? It seems
> > like a collection is just some znodes stored in zookeeper that contain
> > configuration settings and such. Why should I not be able to create those
> > nodes before Solr is running?
> >
> > I'd like to open a feature request for this if one does not already exist
> > and if I am not missing something obvious.
> >
> > Thank you,
> >
> > Frank Greguska
>

Reply via email to