Thanks Erick/Deepak.

The cloud is running on baremetal (128 GB/24 cpu).

Is there an option to run a compact on the data files to make the size
equal on both the clouds? I am trying find all the options before I add the
new fields into the production cloud.

Thanks
AA

On Mon, Apr 30, 2018 at 10:45 AM, Erick Erickson <erickerick...@gmail.com>
wrote:

> Anthony:
>
> You are probably seeing the results of removing deleted documents from
> the shards as they're merged. Even on replicas in the same _shard_,
> the size of the index on disk won't necessarily be identical. This has
> to do with which segments are selected for merging, which are not
> necessarily coordinated across replicas.
>
> The test is if the number of docs on each collection is the same. If
> it is, then don't worry about index sizes.
>
> Best,
> Erick
>
> On Mon, Apr 30, 2018 at 9:38 AM, Deepak Goel <deic...@gmail.com> wrote:
> > Could you please also give the machine details of the two clouds you are
> > running?
> >
> >
> >
> > Deepak
> > "The greatness of a nation can be judged by the way its animals are
> > treated. Please stop cruelty to Animals, become a Vegan"
> >
> > +91 73500 12833
> > deic...@gmail.com
> >
> > Facebook: https://www.facebook.com/deicool
> > LinkedIn: www.linkedin.com/in/deicool
> >
> > "Plant a Tree, Go Green"
> >
> > Make In India : http://www.makeinindia.com/home
> >
> > On Mon, Apr 30, 2018 at 9:51 PM, Antony A <antonyaugus...@gmail.com>
> wrote:
> >
> >> Hi Shawn,
> >>
> >> The cloud is running version 6.2.1. with ClassicIndexSchemaFactory
> >>
> >> The sum of size from admin UI on all the shards is around 265 G vs 224 G
> >> between the two clouds.
> >>
> >> I created the collection using "numShards" so compositeId router.
> >>
> >> If you need more information, please let me know.
> >>
> >> Thanks
> >> AA
> >>
> >> On Mon, Apr 30, 2018 at 10:04 AM, Shawn Heisey <apa...@elyograg.org>
> >> wrote:
> >>
> >> > On 4/30/2018 9:51 AM, Antony A wrote:
> >> >
> >> >> I am running two separate solr clouds. I have 8 shards in each with a
> >> >> total
> >> >> of 300 million documents. Both the clouds are indexing the document
> from
> >> >> the same source/configuration.
> >> >>
> >> >> I am noticing there is a difference in the size of the collection
> >> between
> >> >> them. I am planning to add more shards to see if that helps solve the
> >> >> issue. Has anyone come across similar issue?
> >> >>
> >> >
> >> > There's no information here about exactly what you are seeing, what
> you
> >> > are expecting to see, and why you believe that what you are seeing is
> >> wrong.
> >> >
> >> > You did say that there is "a difference in size".  That is a very
> vague
> >> > problem description.
> >> >
> >> > FYI, unless a SolrCloud collection is using the implicit router, you
> >> > cannot add shards.  And if it *IS* using the implicit router, then you
> >> are
> >> > 100% in control of document routing -- Solr cannot influence that at
> all.
> >> >
> >> > Thanks,
> >> > Shawn
> >> >
> >> >
> >>
>

Reply via email to