Re: how splitting more shards impact performance

Shawn Heisey Mon, 03 Feb 2020 21:46:00 -0800

On 2/3/2020 5:17 PM, ChienHua wrote:

What should we expect the query performance impacted by splitting one
collection into more shards?


We expect the query performance would degrade by splitting more shards since
the overhead of merging results from several shards.

However, the test result seems not as we expect. Any idea or experience for
the performance impact?


This is a often misunderstood aspect of Solr performance.

In situations with a very high query rate, splitting into shards isgenerally going to reduce performance. This happens because as youmentioned, there is overhead from merging the results. A high queryrate will keep all the CPUs very busy.

But in situations with a low query rate, more shards can actually makethings faster. This is a possibility when there is a significantsurplus of available CPU capacity ... the subqueries for one query cancomplete concurrently, so even with the overhead of merging, the overallresult is faster.

The size of the index can also affect this dynamic. If you take anindex that is way too big for a single machine and split it so it hasshards on multiple machines, that can improve query performancedramatically.


Thanks,
Shawn

Re: how splitting more shards impact performance

Reply via email to