Upgrade Apache Commons Codec to version 1.6 in order to add new Beider-Morse
Phonetic Matching (BMPM) option
------------------------------------------------------------------------------------------------------------
Key: SOLR-2982
URL: https://issues.apache.org/jira/browse/SOLR-2982
Project: Solr
Issue Type: Improvement
Components: Rules, Schema and Analysis, search
Reporter: Brooke Schreier Ganz
Fix For: 3.6, 4.0
Apache Commons Codec released version 1.6 of their codec pack in November,
2011. Along with a few bug fixes, 1.6 contains a great new phonetic matching
system called Beider-Morse Phonetic Matching (BMPM) that is far superior to the
existing phonetic codecs, such as regular soundex, metaphone, caverphone, and
so on. BMPM has actually been available for some time, but this is the first
port of it to java, and its first commit in the Apache ecosystem.
For a lot more information, see here: http://stevemorse.org/phoneticinfo.htm
and http://stevemorse.org/phonetics/bmpm.htm
BMPM would be a fantastic "soundalike" tool to help search for personal names
(or just surnames) in a Solr/Lucene index, much better than Levenshtein
distance for this use case.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]