> > btw, what about native binary embedding quantization support by Lucene?
This sounds like a good idea to have in Lucene. Would this require another VetctorField /VectorsFormat? Based on current implementation, one way would be to use another KNN format or alternatively maybe a better approach would be to make Lucene99ScalarQuantizedVectorsFormat <https://lucene.apache.org/core/9_9_1/core/org/apache/lucene/codecs/lucene99/Lucene99ScalarQuantizedVectorsFormat.html> configurable to accept the type of quantization like this new work in progress PR for int4 quantization <https://github.com/apache/lucene/pull/13197> support which takes the number of bits to use for quantizing as input. Since this change allows passing 1 for bits to be used for quantization, it looks to me like an enabler for binary quantization. - Shubham On Sun, Mar 24, 2024 at 4:34 AM Michael Wechner <michael.wech...@wyona.com> wrote: > btw, what about native binary embedding quantization support by Lucene? > > > https://www.linkedin.com/posts/tomaarsen_binary-and-scalar-embedding-quantization-activity-7176966403332132864-lJzH?utm_source=share&utm_medium=member_desktop > > Would this require another VetctorField /VectorsFormat? > > Thanks > > Michael > > Am 19.03.24 um 21:57 schrieb Shubham Chaudhary: > > Hi Michael, > > > > Lucene already had int8 vector support since 9.5 (#1054 > > <https://github.com/apache/lucene/pull/1054>) but it was left to the > user > > to get those quantized vectors and index using KnnByteVectorField > > < > https://lucene.apache.org/core/9_5_0/core/org/apache/lucene/document/KnnByteVectorField.html > >, > > but with Lucene 9.9 out now there is a native support for int8 scalar > > quantization (#12582 <https://github.com/apache/lucene/pull/12582>) > using > > Lucene99ScalarQuantizedVectorsFormat > > < > https://lucene.apache.org/core/9_9_1/core/org/apache/lucene/codecs/lucene99/Lucene99ScalarQuantizedVectorsFormat.html > > > > that > > expects a confidence interval from 90-100. Here is a nice blog(s) that > > talks about how it works in Lucene. > > > > - > > > https://www.elastic.co/search-labs/blog/articles/scalar-quantization-in-lucene > > - > https://www.elastic.co/search-labs/blog/articles/scalar-quantization-101 > > > > Some other references : > > - > > > https://lucene.apache.org/core/9_9_1/core/org/apache/lucene/codecs/lucene99/Lucene99ScalarQuantizedVectorsFormat.html > > - > > > https://lucene.apache.org/core/9_9_1/core/org/apache/lucene/codecs/lucene99/Lucene99ScalarQuantizedVectorsReader.html > > - > > > https://lucene.apache.org/core/9_9_1/core/org/apache/lucene/codecs/lucene99/Lucene99ScalarQuantizedVectorsWriter.html > > > > > > > > On Wed, Mar 20, 2024 at 1:54 AM Michael Wechner < > michael.wech...@wyona.com> > > wrote: > > > >> Hi > >> > >> Cohere recently announced there "compressed" embeddings > >> > >> https://twitter.com/Nils_Reimers/status/1769809006762037368 > >> > >> > https://www.linkedin.com/posts/bhavsarpratik_rag-genai-search-activity-7175850704928989187-Ki1N/?utm_source=share&utm_medium=member_desktop > >> > >> Does Lucene Vector Search support this already, or is somebody working > >> on this? > >> > >> Thanks > >> > >> Michael > >> > >> --------------------------------------------------------------------- > >> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > >> For additional commands, e-mail: java-user-h...@lucene.apache.org > >> > >> > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > >