http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=14824
Bug ID: 14824
Summary: Fix sorting of Norwegian vowels
Change sponsored?: ---
Product: Koha
Version: master
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: P5 - low
Component: Searching
Assignee: [email protected]
Reporter: [email protected]
QA Contact: [email protected]
Currently, if you sort bibliographic search results by title, the Norwegian
wovels ÆØÅ will not be sorted correctly. Specifically, with a result list
containing 4 results, I was getting this for an A-Z (or A-Å) sort:
Å...
Æ...
H...
Ø...
Which is just about as incorrect as it gets.
As documented here: http://www.indexdata.com/zebra/doc/character-map-files.html
sorting in Zebra is governed by the "lowercase" and "uppercase" setting in
sort-string-utf.chr. That page suggests this for scandinavian languages:
lowercase {0-9}{a-y}üzæäøöå
uppercase {0-9}{A-Y}ÜZÆÄØÖÅ
But with that config I got warnings during rebuild zebra:
10:05:00-15/09 zebraidx(26924) [warn] duplicate entry for charmap from 'ä'
10:05:00-15/09 zebraidx(26924) [warn] duplicate entry for charmap from 'Ä'
10:05:00-15/09 zebraidx(26924) [warn] duplicate entry for charmap from 'ö'
10:05:00-15/09 zebraidx(26924) [warn] duplicate entry for charmap from 'Ö'
10:05:00-15/09 zebraidx(26924) [warn] duplicate entry for charmap from 'ü'
10:05:00-15/09 zebraidx(26924) [warn] duplicate entry for charmap from 'Ü'
I assume this is because these characters are already mapped to other chars.
So I will propose a patch that uses this simplified setting:
lowercase {0-9}{a-z}æøå
uppercase {0-9}{A-Z}ÆØÅ
--
You are receiving this mail because:
You are watching all bug changes.
_______________________________________________
Koha-bugs mailing list
[email protected]
http://lists.koha-community.org/cgi-bin/mailman/listinfo/koha-bugs
website : http://www.koha-community.org/
git : http://git.koha-community.org/
bugs : http://bugs.koha-community.org/