Juan Garcia created CODEC-235:
---------------------------------

             Summary: Revised / Alternate NYSIIS
                 Key: CODEC-235
                 URL: https://issues.apache.org/jira/browse/CODEC-235
             Project: Commons Codec
          Issue Type: New Feature
            Reporter: Juan Garcia
            Priority: Minor


I have been dabbling in phonetic algorithms lately and it is pleasing to see 
that I can find something under the commons umbrella for this area as well so 
thanks a ton for that.

In regards to feature requests NYSIIS as is implemented here I believe falls 
under the original release in the 1970s. Not being savvy in this area it took 
me too long to realize that the results that I was seeing from Oracles 
implementation as referenced here 
https://docs.oracle.com/cd/E18150_01/javadocs/SunMasterIndex/com/sun/mdm/index/phonetic/impl/Nysiis.html
 differs from what exists in commons-codec 1.10.

A series of searches brings me to this cool page 
http://www.dropby.com/NYSIIS.html which illustrates the differences as each 
replacement occurs between the original NYSIIS and refined / alternate NYSIIS.

I would gladly put more research in regards to specifications, coming up with 
samples for tests, tests themselves, and even development if this was something 
the team wished to see become a part of commons-codec. Some other Google 
searches does yield some implementations that I am using for the time being but 
something that is already packaged into something I use daily would be gladly 
welcome in my books.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to