ID:               29955
 User updated by:  jan at horde dot org
-Summary:          mb_strtoupper() / lower() broken with some locale
 Reported By:      jan at horde dot org
-Status:           Closed
+Status:           Open
 Bug Type:         mbstring related
 Operating System: Linux
 PHP Version:      5CVS, 4CVS (2004-09-02)
 Assigned To:      hirokawa
 New Comment:

Any chance this is going to be backported to PHP 5.2? I guess mbstring
is going to be obsolete with the Unicode and ICU support in PHP 6.


Previous Comments:
------------------------------------------------------------------------

[2005-12-23 14:56:27] [EMAIL PROTECTED]

Please try using this CVS snapshot:

  http://snaps.php.net/php6.0-latest.tar.gz
 
For Windows:
 
  http://snaps.php.net/win32/php6.0-win32-latest.zip

Turkish language support is added in CVS HEAD.
When mbstring.language = Turkish,
Turkish case filding will be performed in ISO-8859-9.
(upper:0x69 -> 0xdd, lower:0x49->0xfd)
Otherwise, normal case folding is performed.
(upper:0x69 -> 0x49, lower:0x49->0x69)


------------------------------------------------------------------------

[2005-12-23 14:28:29] [EMAIL PROTECTED]

"man iso-8859-9" will tell you.

"i" maps to "0xdd"
and
"0xfd" maps to "I"

See also:
http://www.eki.ee/letter/chardata.cgi?lang=tr+Turkish&script=latin

------------------------------------------------------------------------

[2005-12-23 14:24:06] jan at horde dot org

See http://www.gymel.com/charsets/ISO8859-9.html#U0069 and
http://www.gymel.com/charsets/ISO8859-9.html#U0049 under "Bemerkungen:"
(remarks).

------------------------------------------------------------------------

[2005-12-23 14:10:05] [EMAIL PROTECTED]

I don't know which is the standard way (0x49 or 0xdd).
In ISO-8859-9 (Turkish),
upper case of 'i' (0x69) always should be translated to 'I' 
with dot (0xdd) ?
If yes, please let me know some URLs which describe 
the mapping.



------------------------------------------------------------------------

[2005-05-13 08:00:26] [EMAIL PROTECTED]

Turkish locale would need complete overhaul on the 
entire extension because the locale's character 
properties and required case folding behaviour are very 
special.

PHP-ICU extension could support anything, but that's 
just an ongoing work by l0t3k.


------------------------------------------------------------------------

The remainder of the comments for this report are too long. To view
the rest of the comments, please view the bug report online at
    http://bugs.php.net/29955

-- 
Edit this bug report at http://bugs.php.net/?id=29955&edit=1

Reply via email to