Fair enough - are you planning to offer such a service;) sounds exciting!

-Mike

On Tue, Feb 15, 2022 at 6:00 PM Michael Wechner <michael.wech...@wyona.com>
wrote:

> true :-) when you are the one controlling the input of vectors, then a
> method to disable the maximum limit would be sufficient.
>
> But I could imagine when you offer Lucene as a service where people can
> for example configure their own "sentence embedding models" and you would
> like to offer a different maximum limit than the default of 1024, then I
> think a method to reset the maximum limit would make sense. Examples could
> be a service of OpenAI or vector search databases like for example Weaviate
> or Pinecone.
>
> Thanks
>
> Michael
>
>
>
>
> Am 15.02.22 um 23:34 schrieb Michael Sokolov:
>
> I don't think it makes sense to have a static variable maximum that you
> can change by calling a method. What purpose would it serve?
>
> On Tue, Feb 15, 2022, 2:39 PM Michael Wechner <michael.wech...@wyona.com>
> wrote:
>
>> Hi Alessandro
>>
>> No, I have not created a Jira ticket, but I would be happy to create one,
>> just let me know or please feel free to create one.
>>
>> I understand the concerns about the limits in general and I think it
>> makes sense to have a default max dimensions limit, but I could imagine it
>> needs to be increased eventually and being able to increase it
>> programmatically and at your own risk will help people using Lucene.
>>
>> Thanks
>>
>> Michael
>>
>> Am 15.02.22 um 19:22 schrieb Alessandro Benedetti:
>>
>> Hi Michael,
>> let's create a Jira ticket to use a higher value(if you haven't already).
>> I would be happy to consider the patch/or do it myself but after 10/03.
>> Once the pull request is ready (including the Javadoc documentation that
>> clearly states that if you go above X it's at your own risk), we'll involve
>> also Michael Sokolov and the other committers familiar with this area of
>> the code.
>>
>> Cheers
>>
>> --------------------------
>> Alessandro Benedetti
>> Apache Lucene/Solr PMC member and Committer
>> Director, R&D Software Engineer, Search Consultant
>>
>> www.sease.io
>>
>>
>> On Sat, 12 Feb 2022 at 22:53, Michael Wechner <michael.wech...@wyona.com>
>> wrote:
>>
>>> Hi
>>>
>>> I just tried to test the OpenAI model "text-similarity-davinci-001" with
>>> 12288 dimensions and receive the following error
>>>
>>> java.lang.IllegalArgumentException: vector numDimensions must be <=
>>> VectorValues.MAX_DIMENSIONS (=1024); got 12288
>>>         at
>>> org.apache.lucene.document.FieldType.setVectorDimensionsAndSimilarityFunction(FieldType.java:381)
>>> ~[lucene-core-9.0.0.jar:9.0.0 0b18b3b965cedaf5eb129aa41243a44c83ca826d -
>>> jpountz - 2021-12-01 14:23:49]
>>>         at
>>> org.apache.lucene.document.KnnVectorField.createFieldType(KnnVectorField.java:69)
>>> ~[lucene-core-9.0.0.jar:9.0.0 0b18b3b965cedaf5eb129aa41243a44c83ca826d -
>>> jpountz - 2021-12-01 14:23:49]
>>>
>>> IIUC I can not increase programmatically the max vector size which is
>>> set inside lucene/core/src/java/org/apache/lucene/index/VectorValues.java
>>>
>>>   public static int MAX_DIMENSIONS = 1024;
>>>
>>> right?
>>>
>>> I guess I could rebuild Lucene with a greater size or what are the
>>> possbilities to increase the max vector size?
>>>
>>> Thanks
>>>
>>> Michael
>>>
>>>
>>>
>>
>

Reply via email to