Re: Experience re OpenAI embeddings in combination with Lucene vector search

2022-02-15 Thread Michael Wechner
fair enough, but wouldn't it make sense that one can increase it programmatically, e.g. .setVectorMaxDimension(2028) ? Thanks Michael Am 14.02.22 um 23:34 schrieb Michael Sokolov: I think we picked the 1024 number as something that seemed so large nobody would ever want to exceed it!

Re: Experience re OpenAI embeddings in combination with Lucene vector search

2022-02-15 Thread Alessandro Benedetti
I believe it could make sense, but as Michael pointed out in the Jira ticket related to the Solr integration, then we'll get complaints like "I set it to 1.000.000 and my Solr instance doesn't work anymore" (I kept everything super simple just to simulate a realistic scenario). So I tend to agree

Re: Experience re OpenAI embeddings in combination with Lucene vector search

2022-02-15 Thread Michael Wechner
I understand, but if Lucene itself would allow to overwrite the default max size programmatically, then I think it should be clear that you do this at your own risk :-) Thanks for the links to your blog posts, which sound very interesting. Thanks Michael Am 15.02.22 um 17:25 schrieb

Re: Experience re OpenAI embeddings in combination with Lucene vector search

2022-02-15 Thread Robert Muir
On Tue, Feb 15, 2022 at 2:33 PM Michael Wechner wrote: > > There seems to be no light at the end of the tunnel for the JDK vector > api, I think OpenJDK will incubate this API until the sun supernovas and > java is dead :) > It is frustrating, as that could give current implementation a needed >

Re: How to Increase max vector size?

2022-02-15 Thread Michael Sokolov
I don't think it makes sense to have a static variable maximum that you can change by calling a method. What purpose would it serve? On Tue, Feb 15, 2022, 2:39 PM Michael Wechner wrote: > Hi Alessandro > > No, I have not created a Jira ticket, but I would be happy to create one, > just let me

Re: How to Increase max vector size?

2022-02-15 Thread Michael Wechner
true :-) when you are the one controlling the input of vectors, then a method to disable the maximum limit would be sufficient. But I could imagine when you offer Lucene as a service where people can for example configure their own "sentence embedding models" and you would like to offer a

Re: How to Increase max vector size?

2022-02-15 Thread Alessandro Benedetti
Hi Michael, let's create a Jira ticket to use a higher value(if you haven't already). I would be happy to consider the patch/or do it myself but after 10/03. Once the pull request is ready (including the Javadoc documentation that clearly states that if you go above X it's at your own risk), we'll

Re: Experience re OpenAI embeddings in combination with Lucene vector search

2022-02-15 Thread Robert Muir
Sure, but lucene should be able to have limits. We have this discussion with every single limit we attempt to implement :) There will always be extreme use cases using too many dimensions or whatever. It is open source! I think if what you are doing is strange enough, you can modify the sources.

Re: Experience re OpenAI embeddings in combination with Lucene vector search

2022-02-15 Thread Michael Wechner
Am 15.02.22 um 19:48 schrieb Robert Muir: Sure, but lucene should be able to have limits. We have this discussion with every single limit we attempt to implement :) There will always be extreme use cases using too many dimensions or whatever. It is open source! I think if what you are doing

Re: How to Increase max vector size?

2022-02-15 Thread Michael Wechner
Hi Alessandro No, I have not created a Jira ticket, but I would be happy to create one, just let me know or please feel free to create one. I understand the concerns about the limits in general and I think it makes sense to have a default max dimensions limit, but I could imagine it needs