Greg Rahn created IMPALA-7759:
---------------------------------
Summary: Add Levenshtein edit distance function
Key: IMPALA-7759
URL: https://issues.apache.org/jira/browse/IMPALA-7759
Project: IMPALA
Issue Type: New Feature
Reporter: Greg Rahn
Assignee: Greg Rahn
References:
* [Netezza -
(le_dst())|https://www.ibm.com/support/knowledgecenter/en/SSULQD_7.2.1/com.ibm.nz.dbu.doc/r_dbuser_functions_expressions_fuzzy_funcs.html]
* [Postgres -
levenshtein()|https://www.postgresql.org/docs/current/static/fuzzystrmatch.html#id-1.11.7.24.6]
One notable difference:
* Netezza: if either value is NULL, returns the length of non-NULL value
* Postgres: if either value is NULL, returns NULL
Preference is to implement Postgres version due to ease of cross-system testing.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]