[CODE4LIB] Modern NACO Normalization (esp. in java?)

2012-04-11 Thread Bill Dueber
I'm about to embark on trying to write code to apply NACO normalization to strings (not for field-to-field comparisons, but for correctly sorting things). I was drivin to this by a complaint about how some Arabic manuscript titles are sorting. My end goal is a Solr filter, so I'm most interested

Re: [CODE4LIB] Modern NACO Normalization (esp. in java?)

2012-04-11 Thread Bill Dueber
Wow! Thanks, Ralph! This is great! On Wed, Apr 11, 2012 at 12:04 PM, LeVan,Ralph le...@oclc.org wrote: I'm pretty sure attachments don't work on the list, so I'm just pasting my NACO normalizer below. Note that there are 2007 versions of the normalize() method in there. This is used for all

Re: [CODE4LIB] Modern NACO Normalization (esp. in java?)

2012-04-11 Thread Dan Scott
Very interesting, Ralph. Are you / OCLC offering that code under any particular license(s)? (The Evergreen code, for what it's worth, has a project-level license stating that Evergreen code is offered under the GPL v2 with the or later clause). LeVan,Ralph le...@oclc.org 4/11/2012 12:04 PM

Re: [CODE4LIB] Modern NACO Normalization (esp. in java?)

2012-04-11 Thread LeVan,Ralph
Apache 2. To cover my butt, this code was originally released as part of our SiteSearch product which we made Open Source. This is just the latest incarnation, but just as open. Stick this at the top of that code, if you expect to reuse it. Thanks for asking! Ralph /** * Copyright 2012 OCLC

Re: [CODE4LIB] Representing geographic hiearchy in linked data

2012-04-11 Thread Ethan Gruber
Thanks to everyone for the suggestions. Ethan On Tue, Apr 10, 2012 at 7:43 PM, Simon Spero sesunc...@gmail.com wrote: On Mon, Apr 9, 2012 at 7:13 PM, Ethan Gruber ewg4x...@gmail.com wrote: Ancient geographic entities. Athens is in Attica. Sardis is in Lydia (in Anatolia, for example).