Re: [Harbour] CodePage different behaviour between 2.0.0beta3 and 2.0.0 (Win)

Przemysław Czerpak Thu, 31 Dec 2009 00:55:08 -0800

Hi Vito and Tomaž,

> > Thursday 31 December 2009 06:58:38 je Vitomir Cvitanovic napisal:
> > I think, and as far as I know (perhaps someone from Slovenia could verify
> > that) that both Croatia and Slovenia have the same sort order. 
> On Thu, 31 Dec 2009, Tomaž Zupan wrote:
> Yes, it is.


Thank you for your confirmation.

I'll update Slovenian and Croatian CPs in Harbour repository but I have
some questions to you.

Neither Slovenian nor Croatian collation define the relations to X and Y
characters. OK but I do not believe that you never have to sort words
having them so what is the common practice in such situations? I.e. how
words starting with X or Y are sorted in printed vocabularies? Or maybe
it depends on situation and different rules are used in vocabulary,
encyclopedia and phone book? Or maybe these area is still undefined so
everyone has to take his own arbitrary decision about it?
It's possible that there are some differences between Slovenia and
Croatia so I would like to hear the answer from both of you.

The second part is addressed to Vito and Croatian users.
You have three digraph letters: Dž, Lj and Nj.
As I can see it created few problems. These digraphs should be sorted
as single letter in precisely defined order which is not compatible
with simple single character sorting. Dž is sorted as expected between
Dz and Đ but Lj and Nj no. Lj should be sorted between Lz and M and
Nj between Nz and O. Current Harbour code allows to define such collation
but I do not know if it's important to introduce it. For sure it's
not compatible with Clipper so if we add it then we have to keep also
CP which is strictly ntxcor.obj compatible. Maybe even you will want to
duplicate all existing Croatian CPs because though in fact current ones
without native support for digraphs are the same as Slovenian CPs with
only different names.
Do you think it's important to add support for Croatian (and maybe Latin
Serbian) collation respecting special order of digraphs?

I also found the information that these digraphs were not well chosen
because sometimes such character combination should be used as separate
letters and real digraphs have own Unicode values.
Is it true?
If yes can you precisely tell me what are these Unicode values for upper
and lower letters? Do you use them in real life? Are they supported by some
CPs and/or do you have hardware support (keyboards?) for using them?
Is it something what have to be resolved in the future or rather you will
try to adopt existing solutions - i.e. in Poland our own national keyboard
layout is dieing and now most of us prefer standard QWERTY layout with
ALT-GR used to insert Polish national characters (we call it 'Polish
programmer keyboard').

The answers should help me also in the future when I work on Unicode
support in Harbour.

best regards,
Przemek
_______________________________________________
Harbour mailing list (attachment size limit: 40KB)
[email protected]
http://lists.harbour-project.org/mailman/listinfo/harbour

Re: [Harbour] CodePage different behaviour between 2.0.0beta3 and 2.0.0 (Win)

Reply via email to