Re: Performance Suggestion for Dense Vectors

2024-05-31 Thread Alessandro Benedetti
Cool! I missed that! I'll make sure to align with my digital marketing manager to make her add all our Solr-related external posts! Good to see this live! -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail:

Re: Performance Suggestion for Dense Vectors

2024-05-29 Thread David Smiley
There *is* a Solr blog site that just launched: https://solr.apache.org/blog.html On Thu, Mar 28, 2024 at 3:49 PM rajani m wrote: > > @Alessandro, > Is there a solr blog site where we can submit work/articles or are you > suggesting to post on my own site and share a link here? I prefer the >

Re: Performance Suggestion for Dense Vectors

2024-03-29 Thread Alessandro Benedetti
Hi Rajani, the discussion for a centralised Apache Solr blog is in progress (that allows both linking to private blogs to gather more views or write directly there), I'll give you an update as soon as the community finalises the solution. In the meantime, as Ishan said, posting a quest blog is a

Re: Performance Suggestion for Dense Vectors

2024-03-29 Thread Ishan Chattopadhyaya
Hi Rajani, Please feel free to submit guest posts to our SearchScale blog. We welcome posts on vector search. https://SearchScale.com/blog Thanks, Ishan On Fri, 29 Mar, 2024, 1:18 am rajani m, wrote: > @Alessandro, > Is there a solr blog site where we can submit work/articles or are you >

Re: Performance Suggestion for Dense Vectors

2024-03-28 Thread rajani m
btw this article[1] by Tom Burgmans is relevant. It is posted on linkedin though and never came into my feed. I found it on the relevance-search slack channel. [1] https://www.linkedin.com/pulse/testing-dense-vector-search-scale-part-1-ann-tom-burgmans-0tale/ On Thu, Mar 28, 2024 at 3:47 PM

Re: Performance Suggestion for Dense Vectors

2024-03-28 Thread rajani m
@Alessandro, Is there a solr blog site where we can submit work/articles or are you suggesting to post on my own site and share a link here? I prefer the former if there is one because there were times when I had my own, it hardly had any views and on top of that google blogging made me migrate

Re: Performance Suggestion for Dense Vectors

2024-03-28 Thread rajani m
Run the same knn queries at a slow throughput for 30-60 minutes, this should warm up disk caches with hnsw index files, and then you should see a significant drop in the query time. Also make use of "fq" and reduce the document space as much as you can. On Thu, Mar 28, 2024 at 12:50 PM Iram

Re: Performance Suggestion for Dense Vectors

2024-03-28 Thread Iram Tariq
Hi Alessandro, Thank you for the feedback. Kindly see my comments below, *Ale*: https://www.elastic.co/blog/accelerating-vector-search-simd-instructions, I suggest to experiment with simD vector improvements (unless you are already doing it) * We will try this soon. * *Ale*: What about the

Re: Performance Suggestion for Dense Vectors

2024-03-28 Thread Alessandro Benedetti
That's interesting. I think it's vital to get back some performance tests from the community. Since my contribution to support Vector-search in Apache Solr was merged, we got little or null feedback to understand its performance, in real-world use cases. Blogs, open benchmarks or even just this

Re: Performance Suggestion for Dense Vectors

2024-03-27 Thread Kent Fitch
Hi Iram, Is the machine doing lots of IO? If the hnsw graphs are not entirely in memory, performance will be poor. What JVM? You may get some benefit from simd support in java 21. Can you use the latest quantisation changes in Lucene to reduce memory footprint of the hnsw graphs? That's a large