With the second approach, can it be a space and a single letter, regardless of case? In at least one example (k.d. lang), the authorized version is in lowercase, and there are probably others.
On Wed, Mar 7, 2018 at 2:29 PM, Josh Stompro <[email protected]> wrote: > Hello, I’m curious what the correct approach is for dealing with multiple > entries in the browse and facet lists, when the difference is just the > ending period/full stop. > > > > For example, Some of the records have > > =100 1\$aAdler-Olsen, Jussi,$eauthor. > > And some have > > =100 1\$aAdler-Olsen, Jussi. > > > > This results in two facet lines and two browse lines for the same author. > > > > Is the correct approach to normalize the data? > > > > Or would an index normalizer be appropriate here, something to remove > periods when they are not preceded by a space and a single capital letter? > > Something like “r/([^ ][^A-Z])(\.)$/$1/” > > > > Thanks > > > > Lake Agassiz Regional Library - Moorhead MN larl.org > > Josh Stompro | Office 218.233.3757 EXT-139 <(218)%20233-3757> > > LARL IT Director | Cell 218.790.2110 <(218)%20790-2110> > > > -- Sarah Childs Technical Services Department Head Hussey-Mayfield Memorial Public Library 250 North Fifth Street Zionsville, IN 46077 317-873-3149 x13330 [email protected]
