[
https://issues.apache.org/jira/browse/IMPALA-7759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16951539#comment-16951539
]
Greg Rahn commented on IMPALA-7759:
-----------------------------------
Stand-alone version here: https://github.com/gregrahn/impala-levenshtein
> Add Levenshtein edit distance built-in function
> -----------------------------------------------
>
> Key: IMPALA-7759
> URL: https://issues.apache.org/jira/browse/IMPALA-7759
> Project: IMPALA
> Issue Type: New Feature
> Reporter: Greg Rahn
> Assignee: Greg Rahn
> Priority: Major
> Labels: built-in-function
> Fix For: Impala 3.2.0
>
>
> References:
> * [Netezza -
> le_dst()|https://www.ibm.com/support/knowledgecenter/en/SSULQD_7.2.1/com.ibm.nz.dbu.doc/r_dbuser_functions_expressions_fuzzy_funcs.html]
> * [Postgres -
> levenshtein()|https://www.postgresql.org/docs/current/static/fuzzystrmatch.html#id-1.11.7.24.6]
> One notable difference:
> * Netezza: if either value is NULL, returns the length of non-NULL value
> * Postgres: if either value is NULL, returns NULL
> Preference is to implement Postgres version due to ease of cross-system
> testing.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]