Hi Vito and Tomaž, > > Thursday 31 December 2009 06:58:38 je Vitomir Cvitanovic napisal: > > I think, and as far as I know (perhaps someone from Slovenia could verify > > that) that both Croatia and Slovenia have the same sort order. > On Thu, 31 Dec 2009, Tomaž Zupan wrote: > Yes, it is.
Thank you for your confirmation. I'll update Slovenian and Croatian CPs in Harbour repository but I have some questions to you. Neither Slovenian nor Croatian collation define the relations to X and Y characters. OK but I do not believe that you never have to sort words having them so what is the common practice in such situations? I.e. how words starting with X or Y are sorted in printed vocabularies? Or maybe it depends on situation and different rules are used in vocabulary, encyclopedia and phone book? Or maybe these area is still undefined so everyone has to take his own arbitrary decision about it? It's possible that there are some differences between Slovenia and Croatia so I would like to hear the answer from both of you. The second part is addressed to Vito and Croatian users. You have three digraph letters: Dž, Lj and Nj. As I can see it created few problems. These digraphs should be sorted as single letter in precisely defined order which is not compatible with simple single character sorting. Dž is sorted as expected between Dz and Đ but Lj and Nj no. Lj should be sorted between Lz and M and Nj between Nz and O. Current Harbour code allows to define such collation but I do not know if it's important to introduce it. For sure it's not compatible with Clipper so if we add it then we have to keep also CP which is strictly ntxcor.obj compatible. Maybe even you will want to duplicate all existing Croatian CPs because though in fact current ones without native support for digraphs are the same as Slovenian CPs with only different names. Do you think it's important to add support for Croatian (and maybe Latin Serbian) collation respecting special order of digraphs? I also found the information that these digraphs were not well chosen because sometimes such character combination should be used as separate letters and real digraphs have own Unicode values. Is it true? If yes can you precisely tell me what are these Unicode values for upper and lower letters? Do you use them in real life? Are they supported by some CPs and/or do you have hardware support (keyboards?) for using them? Is it something what have to be resolved in the future or rather you will try to adopt existing solutions - i.e. in Poland our own national keyboard layout is dieing and now most of us prefer standard QWERTY layout with ALT-GR used to insert Polish national characters (we call it 'Polish programmer keyboard'). The answers should help me also in the future when I work on Unicode support in Harbour. best regards, Przemek _______________________________________________ Harbour mailing list (attachment size limit: 40KB) [email protected] http://lists.harbour-project.org/mailman/listinfo/harbour
