[
https://issues.apache.org/jira/browse/LUCENE-10057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17401661#comment-17401661
]
Michael Sokolov commented on LUCENE-10057:
------------------------------------------
Oh, did my stab at this not work? I was unable to reproduce so I wasn't sure
... Thank you for hacking at it, @Dawid. Your patches LGTM. I don't think I
understand where the issue was coming from and why this fixed it though.
Re: source data for the vectors ... I'm not sure what you mean there; these are
a small sample of the (from our perspective precomputed) embeddings downloaded
from https://nlp.stanford.edu/projects/glove/ (there is something about it in
the package-info.java). Originally they were arrived at by training a large
corpus of text (I think these are from a collection of 6B twitter and other
texts).
> Replace direct mmaped buffer with Lucene abstractions in KnnVectorDict
> ----------------------------------------------------------------------
>
> Key: LUCENE-10057
> URL: https://issues.apache.org/jira/browse/LUCENE-10057
> Project: Lucene - Core
> Issue Type: Bug
> Reporter: Dawid Weiss
> Priority: Major
> Attachments: LUCENE-10057.patch
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]