Hello, Shawn, Toke Eskildsen and Solr Community:

> Since version 7.5, optimize with TieredMergePolicy (the default policy)
> respects the maximum segment size, which defaults to 5GB.
Thank you for your reply.
Indeed,  there was approximately a total of 9GB index size.
So, as you tell me, TieredMergePolicy respected to the maximum size.

> Your PDF attachment did not make it to the list.  We cannot see it.  The
> mailing list rarely lets attachments through.
Thank you for your response.
I am sorry for sending an attachment, which mailing list does not send to.
I did not know about that.
For the next opportunity to share table-formatted data, what is the best
way to share data with all of you?
Should I share a Google spreadsheet URL?  How people usually do?

> The one place where there is a real difference is with String faceting
> as >= 2 segments means that String ordinals must be coordinated between
> segments

Let me confirm what you say (since I am not a native speaker of English and
I am new to Solr)
When we request a facet query (like below) to 2 Solrs: one with multiple
segments and the other is only one segment,
we will have different performance result?

"http://localhost:8983/solr/collection1/
select?q=*:*&
fl=id,cat,manu,price&
indent=true&
rows=2&
facet=true&
facet.field=price&
facet.sort=index&
facet.limit=10"

Can we configure up-front faceting or one-the-fly facet method?
I would like to see the difference to understand Solr's behavior if it is
possible.


Sincerely,
Kaya Ota






2019年5月20日(月) 18:38 Toke Eskildsen <t...@kb.dk>:

> On Sun, 2019-05-19 at 13:36 -0600, Shawn Heisey wrote:
> > I would not expect to see a really noticeable performance increase
> > by going from two segments to one.
>
> The one place where there is a real difference is with String faceting
> as >= 2 segments means that String ordinals must be coordinated between
> segments. Depending on faceting method this is done up-front (with an
> obvious fist-call time penalty, some memory overhead to hold the
> mapping and a slight running overhead for consulting the map) or on-
> the-fly (with non-surprising running overhead).
>
> It becomes relevant with large indexes and/or setups where performance
> is very important.
>
> - Toke Eskildsen, Royal Danish library
>
>
>

Reply via email to