Raghav Jindal has uploaded this change for review. ( http://gerrit.cloudera.org:8080/23730
Change subject: IMPALA-14566: Add euclidean_distance and cosine_similarity functions for ARRAY<FLOAT> ...................................................................... IMPALA-14566: Add euclidean_distance and cosine_similarity functions for ARRAY<FLOAT> This commit adds two new built-in functions for vector similarity calculations: - euclidean_distance: Computes L2 distance between two vectors - cosine_similarity: Computes cosine similarity between two vectors Both functions operate on ARRAY<FLOAT> data types and return DOUBLE values for better precision. These functions enable semantic search capabilities within Impala. Change-Id: Id305acc87530d5d0e53613fe8df9a631ea4e1080 --- A be/src/exprs/vector-functions-ir.cc A be/src/exprs/vector-functions.h M common/function-registry/impala_functions.py 3 files changed, 227 insertions(+), 0 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/30/23730/1 -- To view, visit http://gerrit.cloudera.org:8080/23730 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: newchange Gerrit-Change-Id: Id305acc87530d5d0e53613fe8df9a631ea4e1080 Gerrit-Change-Number: 23730 Gerrit-PatchSet: 1 Gerrit-Owner: Raghav Jindal <[email protected]>
