https://bugs.freedesktop.org/show_bug.cgi?id=63460

Eike Rathke <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |ASSIGNED
           Assignee|[email protected] |[email protected]
                   |desktop.org                 |

--- Comment #4 from Eike Rathke <[email protected]> ---
Let's check the current status/assignments and how to transition to the
assignments above. Because previously script tags weren't supported we
defined a workaround that distinguishes them by different country
assignments and we have

LANGUAGE_USER_KURDISH_TURKEY  0x0626  /* sublang 0x01, Latin script */  
[ku-TR]
LANGUAGE_USER_KURDISH_SYRIA   0x0A26  /* sublang 0x02, Latin script */  
[ku-SY]
LANGUAGE_USER_KURDISH_IRAQ    0x0E26  /* sublang 0x03, Arabic script */ 
[ku-IQ]
LANGUAGE_USER_KURDISH_IRAN    0x1226  /* sublang 0x04, Arabic script */ 
[ku-IR]

First said, 'ku' is an ISO 639-1 macrolanguage code and should be
avoided in distinct language assignments where more specific ISO 639-1
or 639-3 codes exist. That should be 'ckb', 'kmr' and 'sdh' instead,
where 'ckb' and 'sdh' already imply Arabic script.

So according to that PDF Microsoft assigned

Central Kurdish       0x0092  /* sublang 0x00, Latin script? */  [ku]
Central Kurdish       0x7C92  /* sublang 0x1f, Arabic script */  [ku-Arab]
Central Kurdish Iraq  0x0492  /* sublang 0x01, Arabic script */  [ku-Arab-IQ]

and no other Kurdish tags. Great :-/
http://msdn.microsoft.com/library/dd318693.aspx mentions _only_ 0x0492
and nothing else, you gotta love'em :-(

At least it can be detected that since Windows 7 they seem to use
sublanguage IDs in the range 0x1d to 0x1f for primary language with
script type and without country. Which doesn't help much because 0x0492
can not be deduced from that and is rather 0x0092 with the default 0x01
sublanguage ID.

When defining new language entries for LibreOffice we should always
include a country to form a default locale of that language to make it
available for locale fallback mechanisms.

So here the results would be something like

LANGUAGE_KURDISH_ARABIC_IRAQ         0x0492  [ckb-IQ]
LANGUAGE_KURDISH_ARABIC_IRAQ         0x0492  [ku-Arab-IQ]
LANGUAGE_KURDISH_ARABIC_IRAQ         0x0492  [ku-IQ]
LANGUAGE_OBSOLETE_USER_KURDISH_IRAQ  0x0E26  [ckb-IQ]

with mappings to the preferred ISO 639-3 code and the ISO 639-1
macrolanguage code and the old values and codes, with

#define LANGUAGE_OBSOLETE_USER_KURDISH_IRAQ  0x0E26
#define LANGUAGE_USER_KURDISH_IRAQ  LANGUAGE_KURDISH_ARABIC_IRAQ


How to transition the other three, ku-TR, ku-SY and ku-IR?
ku-TR would be kmr-Latn-TR, Northern Kurdish Latin script.
ku-SY would be kmr-Latn-SY, Northern Kurdish Latin script.
But ku-IR? Which Kurdish would it be? ckb-IR?

The defaults for the new languages would be

Kurdish, Northern, Latin script   [kmr-Latn-TR]
Kurdish, Central, Arabic script   [ckb-IQ]
Kurdish, Southern, Arabic script  [sdh-IQ]

References I consulted
http://www.ethnologue.com/language/kmr
http://www.ethnologue.com/language/ckb
http://www.ethnologue.com/language/sdh

-- 
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs

Reply via email to