Thanks for clarification Michael! On Tue, Dec 3, 2024 at 1:56 PM Michael Sokolov <msoko...@gmail.com> wrote:
> Sparse is meaning two different things here. In the case you found Mikhail, > it means not every document has a value for some vector field. I think the > question here is about very high dimensional vectors where most documents > have zeroes in most dimensions of the vector. > > On Tue, Dec 3, 2024, 2:01 AM Mikhail Khludnev <m...@apache.org> wrote: > > > Morning. > > I noticed a condition choosing sparse and dense format underneath > > > > > https://github.com/apache/lucene/blob/6053e1e31378378f6d310a05ea6d7dcdfc45f48b/lucene/core/src/java/org/apache/lucene/codecs/lucene95/OffHeapByteVectorValues.java#L108 > > perhaps it may achieve your performance requirements. > > > > > > On Mon, Dec 2, 2024 at 8:03 PM Viacheslav Dobrynin <w.v.d...@gmail.com> > > wrote: > > > > > Hi! > > > > > > I need to index sparse vectors, whereas as I understand it, > > > KnnFloatVectorField is designed for dense vectors. > > > Therefore, it seems that this approach will not work. > > > > > > вс, 1 дек. 2024 г. в 18:36, Mikhail Khludnev <m...@apache.org>: > > > > > > > Hi, > > > > May it look like KnnFloatVectorField(... DOT_PRODUCT) > > > > and KnnFloatVectorQuery? > > > > > > > > > > > > > -- > > Sincerely yours > > Mikhail Khludnev > > > -- Sincerely yours Mikhail Khludnev