I'm looking to index mailing addresses. I'd like to take into account these common abbreviations:
http://www.usps.com/ncsc/lookups/usps_abbreviations.html Would those be considered synonyms? I'm not exactly sure if I should use the WordNet modules or extend a built in analyzer and append my own filter. Has someone (in Java or .NET) already written a mailing address analyzer that handles normalizing things like "163 N 4th St" into "163 North Fourth Street"...if that's even a good thing to do?
