Thanks Erick/Deepak. The cloud is running on baremetal (128 GB/24 cpu).
Is there an option to run a compact on the data files to make the size equal on both the clouds? I am trying find all the options before I add the new fields into the production cloud. Thanks AA On Mon, Apr 30, 2018 at 10:45 AM, Erick Erickson <erickerick...@gmail.com> wrote: > Anthony: > > You are probably seeing the results of removing deleted documents from > the shards as they're merged. Even on replicas in the same _shard_, > the size of the index on disk won't necessarily be identical. This has > to do with which segments are selected for merging, which are not > necessarily coordinated across replicas. > > The test is if the number of docs on each collection is the same. If > it is, then don't worry about index sizes. > > Best, > Erick > > On Mon, Apr 30, 2018 at 9:38 AM, Deepak Goel <deic...@gmail.com> wrote: > > Could you please also give the machine details of the two clouds you are > > running? > > > > > > > > Deepak > > "The greatness of a nation can be judged by the way its animals are > > treated. Please stop cruelty to Animals, become a Vegan" > > > > +91 73500 12833 > > deic...@gmail.com > > > > Facebook: https://www.facebook.com/deicool > > LinkedIn: www.linkedin.com/in/deicool > > > > "Plant a Tree, Go Green" > > > > Make In India : http://www.makeinindia.com/home > > > > On Mon, Apr 30, 2018 at 9:51 PM, Antony A <antonyaugus...@gmail.com> > wrote: > > > >> Hi Shawn, > >> > >> The cloud is running version 6.2.1. with ClassicIndexSchemaFactory > >> > >> The sum of size from admin UI on all the shards is around 265 G vs 224 G > >> between the two clouds. > >> > >> I created the collection using "numShards" so compositeId router. > >> > >> If you need more information, please let me know. > >> > >> Thanks > >> AA > >> > >> On Mon, Apr 30, 2018 at 10:04 AM, Shawn Heisey <apa...@elyograg.org> > >> wrote: > >> > >> > On 4/30/2018 9:51 AM, Antony A wrote: > >> > > >> >> I am running two separate solr clouds. I have 8 shards in each with a > >> >> total > >> >> of 300 million documents. Both the clouds are indexing the document > from > >> >> the same source/configuration. > >> >> > >> >> I am noticing there is a difference in the size of the collection > >> between > >> >> them. I am planning to add more shards to see if that helps solve the > >> >> issue. Has anyone come across similar issue? > >> >> > >> > > >> > There's no information here about exactly what you are seeing, what > you > >> > are expecting to see, and why you believe that what you are seeing is > >> wrong. > >> > > >> > You did say that there is "a difference in size". That is a very > vague > >> > problem description. > >> > > >> > FYI, unless a SolrCloud collection is using the implicit router, you > >> > cannot add shards. And if it *IS* using the implicit router, then you > >> are > >> > 100% in control of document routing -- Solr cannot influence that at > all. > >> > > >> > Thanks, > >> > Shawn > >> > > >> > > >> >