On Jul 6, 2011, at 1:23 PM, Eric E. Dolecki wrote:

> I found a Soundex NSString category here:
> 
> http://www.cocoadev.com/index.pl?NSStringSoundex
> 
> However, when I tried it out I get strange results...
> 
> //someString is set to different strings each time tested
> BOOL test = [someString soundsLikeString:@"Face"];
> NSLog(@"sounds like Face: %d",test);
> 
> Place = 0
> Ace = 0
> Mace = 0
> Fake = 1
> Testing = 0
> Brake = 0
> 
> It would seem something is off to get negatives on Place, Ace & Mace.
> 
> Any ideas or perhaps a different Soundex implementation I could try? I
> am already using Levenstein distance, but on it's own it's not good
> enough.

Soundex is based on having the first character providing a "unique" sound, with 
the rest of the character mapped together.

If you wanted to ignore that first character distinction, prepend your strings 
with a character like "X" that is unlikely to be a first character (since 
otherwise you'll get into trouble with its "drop double letters" step). The 
code appears to not care what the first character is (i.e., doesn't test for 
alphabetic characters), so you could probably use something like "#".


Glenn Andreas                      [email protected] 
The most merciful thing in the world ... is the inability of the human mind to 
correlate all its contents - HPL

_______________________________________________

Cocoa-dev mailing list ([email protected])

Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com

Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/cocoa-dev/archive%40mail-archive.com

This email sent to [email protected]

Reply via email to