That's interesting. I think it's vital to get back some performance tests from the community. Since my contribution to support Vector-search in Apache Solr was merged, we got little or null feedback to understand its performance, in real-world use cases. Blogs, open benchmarks or even just this sort of mail message are welcome. Let me reply in line: -------------------------- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member*
e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn <https://linkedin.com/company/sease-ltd> | Twitter <https://twitter.com/seaseltd> | Youtube <https://www.youtube.com/channel/UCDx86ZKLYNpI3gzMercM7BQ> | Github <https://github.com/seaseltd> On Wed, 27 Mar 2024 at 21:06, Kent Fitch <kent.fi...@gmail.com> wrote: > Hi Iram, > > Is the machine doing lots of IO? If the hnsw graphs are not entirely in > memory, performance will be poor. What JVM? You may get some benefit from > simd support in java 21. Can you use the latest quantisation changes in > Lucene to reduce memory footprint of the hnsw graphs? That's a large topk, > but I guess you need it? > > Best regards > > Kent Fitch > > On Thu, 28 Mar 2024, 5:12 am Iram Tariq, > <iram.ta...@northbaysolutions.net.invalid> wrote: > > > Hi All, > > > > I am using Dense vectors in SOLR and facing slowness in it. Each search > is > > taking 10-25 seconds. I want to reduce the time to 5 seconds (or less > > ideally). > > > > Following configurations are being used. > > > > > > 1. *SOLR Version:* 9.3.0 > > 2. *Lucene Version:* 9.7.0 > *Ale*: https://www.elastic.co/blog/accelerating-vector-search-simd-instructions, I suggest to experiment with simD vector improvements (unless you are already doing it) > > 3. *Vector Dimensions*: 384 > > 4. *Total Shards:* 5 > > 5. *Number of Vectors (Per shard*): 43209158 > > 6. *JVM for each Instance:* 35GB > *Ale*: What about the machine memory? > > 7. *TopK: *1000 (Getting 1000 from each shard) > > 8. *Rows: *1000 > > 9. *Vector Field Schema: *<fieldType name="knn_vector_384" > > class="solr.DenseVectorField" hnswMaxConnections="20" > > knnAlgorithm="hnsw" > > vectorDimension="384" similarityFunction="cosine" hnswBeamWidth="40"/> > *Ale*: you can fine-tune the hyper-parameter to compromise a bit on recall in favour of performance (hnswBeamWidth, hnswMaxConnections) > > 10. *Stored*: False > > 11. *WebServer:* Apache Tomcat > > 12. *System Specs*: Linux ( CPU:64, RAM:488 GB, OS:Ubuntu 20.04.6 ) > > > > Any sort of help/clue will be appreciated. > > > > > > > > Regards, > > > > > > Iram Tariq | Software Architect > > > > NorthBay > > > > Direct: +1 (902) 329-7329 > > > > iram.ta...@northbaysolutions.net > > > > www.northbaysolutions.com > > >