Juan Garcia created CODEC-235:
---------------------------------
Summary: Revised / Alternate NYSIIS
Key: CODEC-235
URL: https://issues.apache.org/jira/browse/CODEC-235
Project: Commons Codec
Issue Type: New Feature
Reporter: Juan Garcia
Priority: Minor
I have been dabbling in phonetic algorithms lately and it is pleasing to see
that I can find something under the commons umbrella for this area as well so
thanks a ton for that.
In regards to feature requests NYSIIS as is implemented here I believe falls
under the original release in the 1970s. Not being savvy in this area it took
me too long to realize that the results that I was seeing from Oracles
implementation as referenced here
https://docs.oracle.com/cd/E18150_01/javadocs/SunMasterIndex/com/sun/mdm/index/phonetic/impl/Nysiis.html
differs from what exists in commons-codec 1.10.
A series of searches brings me to this cool page
http://www.dropby.com/NYSIIS.html which illustrates the differences as each
replacement occurs between the original NYSIIS and refined / alternate NYSIIS.
I would gladly put more research in regards to specifications, coming up with
samples for tests, tests themselves, and even development if this was something
the team wished to see become a part of commons-codec. Some other Google
searches does yield some implementations that I am using for the time being but
something that is already packaged into something I use daily would be gladly
welcome in my books.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)