mikemccand commented on issue #15540: URL: https://github.com/apache/lucene/issues/15540#issuecomment-3718461103
And maybe also any `NaN` values in the array? I don't think we can insist on unit-normalized vectors ... some corpora / ML models/religions seem to intentionally preserve the vector's un-normalized original length as meaningful / information-bearing. But it sure would be nice to see something go wrong if you use the wrong distance metric with un-normalized vectors. It would help save time for [incredibly confused users like me when benchmarking](https://github.com/mikemccand/luceneutil/issues/498). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
