Abhishek Rawat created IMPALA-14572:
---------------------------------------

             Summary: Add support for euclidean distance function
                 Key: IMPALA-14572
                 URL: https://issues.apache.org/jira/browse/IMPALA-14572
             Project: IMPALA
          Issue Type: Task
            Reporter: Abhishek Rawat


The euclidean distance (L2) could be used sometimes when the magnitude of the 
vector (e.g., the frequency or intensity of a feature) is an important 
indicator of similarity. As an example to find similar houses based on sqFt, # 
bedrooms, price, etc., euclidean distance can provide better similarity results.

 
Impala doesn't support a native vector data type yet, so we could possibly use 
an ARRAY<FLOAT> data type for representing vectors.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to