Hello Sarah, Kathy Lussier pointed me to the existing bug and fix for the issue I described. Looks like it was included in 2.12. https://bugs.launchpad.net/evergreen/+bug/1308090
It looks like it does follow the logic that you described, space then single letter then period, the period is kept. Josh Stompro - LARL IT Director From: Open-ils-general [mailto:[email protected]] On Behalf Of Sarah Childs Sent: Wednesday, March 07, 2018 1:41 PM To: Evergreen Discussion Group <[email protected]> Subject: Re: [OPEN-ILS-GENERAL] Normalizing full stops in personal author names With the second approach, can it be a space and a single letter, regardless of case? In at least one example (k.d. lang), the authorized version is in lowercase, and there are probably others. On Wed, Mar 7, 2018 at 2:29 PM, Josh Stompro <[email protected]<mailto:[email protected]>> wrote: Hello, I’m curious what the correct approach is for dealing with multiple entries in the browse and facet lists, when the difference is just the ending period/full stop. For example, Some of the records have =100 1\$aAdler-Olsen, Jussi,$eauthor. And some have =100 1\$aAdler-Olsen, Jussi. This results in two facet lines and two browse lines for the same author. Is the correct approach to normalize the data? Or would an index normalizer be appropriate here, something to remove periods when they are not preceded by a space and a single capital letter? Something like “r/([^ ][^A-Z])(\.)$/$1/” Thanks Lake Agassiz Regional Library - Moorhead MN larl.org<http://larl.org> Josh Stompro | Office 218.233.3757 EXT-139<tel:(218)%20233-3757> LARL IT Director | Cell 218.790.2110<tel:(218)%20790-2110> -- Sarah Childs Technical Services Department Head Hussey-Mayfield Memorial Public Library 250 North Fifth Street Zionsville, IN 46077 317-873-3149 x13330 [email protected]<mailto:[email protected]>
