Mazen Raafat created SOLR-15353:
-----------------------------------
Summary: Querying multiple collection performance issue
Key: SOLR-15353
URL: https://issues.apache.org/jira/browse/SOLR-15353
Project: Solr
Issue Type: Bug
Security Level: Public (Default Security Level. Issues are Public)
Components: search
Affects Versions: 8.8.1
Environment: SolrCloud Setup: 3 zookeeper servers and 2 solr cloud
nodes
each Solr node hosted in AWS m5.xlarge EC2 with 8G RAM dedicated to Solr JVM
heap
in this environment each collection is one shard and 2 replicas
for benchmarking i've used JMeter, setting the thread group=50, and loop
count=500
Reporter: Mazen Raafat
Attachments: querying alias points to collection 1 and collection
2.png, querying collection 1.png, querying collection 2.png, thread group.png
performance degradation when querying multiple collections using aliases that
points to multiple collection or calling the search handler directly with
collection query param as follow
{{http://localhost:8983/solr/collection1/select?collection=collection1,collection2,collection3}}
in the first test i have queried a collection with about 40k docs, the
throughput was ~3k req/sec
in the second test i have queried another collection with about 4k docs, the
throughput was ~3.5k req/sec
in the third test i have queried an alias that points to both collections and
viola! the throughput dropped to be ~200 req/sec!
i have tried not to use alias and use
solrurl/solr/collection1/select?collections=collection2 and got the same result
notes:
# both collections have the same schema
# query and filter query are the same in all tests
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]