Re: Performance Suggestion for Dense Vectors

Iram Tariq Thu, 28 Mar 2024 09:50:06 -0700

Hi  Alessandro,

Thank you for the feedback. Kindly see my comments below,


*Ale*:
https://www.elastic.co/blog/accelerating-vector-search-simd-instructions, I
suggest to experiment with simD vector improvements  (unless you are
already doing it)

* We will try this soon. *

*Ale*: What about the machine memory?

Following is the system specification:  Linux ( CPU:64, RAM:488 GB,
OS:Ubuntu 20.04.6 )

*Ale*: you can fine-tune the hyper-parameter to compromise a bit on recall
in favour of performance  (hnswBeamWidth, hnswMaxConnections)

I am trying this as a first step. But I am sure it will impact recall.

Regards,


Iram Tariq | Software Architect

NorthBay

Direct:  +1 (902) 329-7329

iram.ta...@northbaysolutions.net

www.northbaysolutions.com




On Thu, Mar 28, 2024 at 5:42 AM Alessandro Benedetti <a.benede...@sease.io>
wrote:

> That's interesting.
> I think it's vital to get back some performance tests from the community.
> Since my contribution to support Vector-search in Apache Solr was merged,
> we got little or null feedback to understand its performance, in real-world
> use cases.
> Blogs, open benchmarks or even just this sort of mail message are welcome.
> Let me reply in line:
> --------------------------
> *Alessandro Benedetti*
> Director @ Sease Ltd.
> *Apache Lucene/Solr Committer*
> *Apache Solr PMC Member*
>
> e-mail: a.benede...@sease.io
>
>
> *Sease* - Information Retrieval Applied
> Consulting | Training | Open Source
>
> Website: Sease.io <http://sease.io/>
> LinkedIn <https://linkedin.com/company/sease-ltd> | Twitter
> <https://twitter.com/seaseltd> | Youtube
> <https://www.youtube.com/channel/UCDx86ZKLYNpI3gzMercM7BQ> | Github
> <https://github.com/seaseltd>
>
>
> On Wed, 27 Mar 2024 at 21:06, Kent Fitch <kent.fi...@gmail.com> wrote:
>
> > Hi Iram,
> >
> > Is the machine doing lots of IO? If the hnsw graphs are not entirely in
> > memory, performance will be poor. What JVM? You may get some benefit from
> > simd support in java 21. Can you use the latest quantisation changes in
> > Lucene to reduce memory footprint of the hnsw graphs? That's a large
> topk,
> > but I guess you need it?
> >
> > Best regards
> >
> > Kent Fitch
> >
> > On Thu, 28 Mar 2024, 5:12 am Iram Tariq,
> > <iram.ta...@northbaysolutions.net.invalid> wrote:
> >
> > > Hi All,
> > >
> > > I am using Dense vectors in SOLR and facing slowness in it. Each search
> > is
> > > taking 10-25 seconds. I want to reduce the time to 5 seconds (or less
> > > ideally).
> > >
> > > Following configurations are being used.
> > >
> > >
> > >    1. *SOLR Version:* 9.3.0
> > >    2. *Lucene Version:* 9.7.0
> >
> *Ale*:
> https://www.elastic.co/blog/accelerating-vector-search-simd-instructions,
> I
> suggest to experiment with simD vector improvements  (unless you are
> already doing it)
>
> > >    3. *Vector Dimensions*: 384
> > >    4. *Total Shards:* 5
> > >    5. *Number of Vectors (Per shard*): 43209158
> > >    6. *JVM for each Instance:* 35GB
> >
> *Ale*: What about the machine memory?
>
> > >    7. *TopK: *1000  (Getting 1000 from each shard)
> > >    8. *Rows: *1000
> > >    9. *Vector Field Schema:  *<fieldType name="knn_vector_384"
> > >    class="solr.DenseVectorField" hnswMaxConnections="20"
> > > knnAlgorithm="hnsw"
> > >    vectorDimension="384" similarityFunction="cosine"
> hnswBeamWidth="40"/>
> >
> *Ale*: you can fine-tune the hyper-parameter to compromise a bit on recall
> in favour of performance  (hnswBeamWidth, hnswMaxConnections)
>
> > >    10. *Stored*: False
> > >    11. *WebServer:* Apache Tomcat
> > >    12. *System Specs*:  Linux ( CPU:64, RAM:488 GB, OS:Ubuntu 20.04.6 )
> > >
> > > Any sort of help/clue will be appreciated.
> > >
> > >
> > >
> > > Regards,
> > >
> > >
> > > Iram Tariq | Software Architect
> > >
> > > NorthBay
> > >
> > > Direct:  +1 (902) 329-7329
> > >
> > > iram.ta...@northbaysolutions.net
> > >
> > > www.northbaysolutions.com
> > >
> >
>

Re: Performance Suggestion for Dense Vectors

Reply via email to