I'm about to embark on trying to write code to apply NACO normalization to
strings (not for field-to-field comparisons, but for correctly sorting
things). I was drivin to this by a complaint about how some Arabic
manuscript titles are sorting.
My end goal is a Solr filter, so I'm most interested
Wow! Thanks, Ralph! This is great!
On Wed, Apr 11, 2012 at 12:04 PM, LeVan,Ralph le...@oclc.org wrote:
I'm pretty sure attachments don't work on the list, so I'm just pasting
my NACO normalizer below. Note that there are 2007 versions of the
normalize() method in there. This is used for all
Very interesting, Ralph. Are you / OCLC offering that code under any particular
license(s)?
(The Evergreen code, for what it's worth, has a project-level license stating
that Evergreen code is offered under the GPL v2 with the or later clause).
LeVan,Ralph le...@oclc.org 4/11/2012 12:04 PM
Apache 2.
To cover my butt, this code was originally released as part of our
SiteSearch product which we made Open Source. This is just the latest
incarnation, but just as open.
Stick this at the top of that code, if you expect to reuse it.
Thanks for asking!
Ralph
/**
* Copyright 2012 OCLC
Thanks to everyone for the suggestions.
Ethan
On Tue, Apr 10, 2012 at 7:43 PM, Simon Spero sesunc...@gmail.com wrote:
On Mon, Apr 9, 2012 at 7:13 PM, Ethan Gruber ewg4x...@gmail.com wrote:
Ancient geographic entities. Athens is in Attica. Sardis is in Lydia
(in
Anatolia, for example).