> -----Original Message-----
> From: [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] On Behalf Of 
> Brion Vibber
> Sent: 24 November 2008 19:17
> To: Wikimedia developers
> Subject: Re: [Wikitech-l] UTF8 Normalization
> 
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Jared Williams wrote:
> >     I patched UtfNormal.php to use the new intl extension's 
> normalization 
> > function, http://php.net/manual/en/normalizer.normalize.php
> > 
> >     But running UtfNormalTest.php causes 200+ errors, 
> whilst the PHP 
> > normalization routines are error free.
> > 
> >     So kind at a standstill wondering what the problem is. 
> I guess WM use 
> > utf8_normalize() which is based on ICU like intl, does than 
> have the 
> > same errors?
> 
> It may be that intl is using an ICU library which supports an 
> older version of Unicode. UtfNormal is currently built using 
> the Unicode 5.1 files, as I recall, so will have the 5.1 test cases.
> 
> For the most part, this means that the newer version includes 
> mapping rules for newly-added characters -- normalization is 
> deliberately designed to be compatible and existing rules 
> should never change.
> (Though there might be a few exceptions changed as errata, I forget.)
> 

Ah, php.net provided intl is built with ICU 3.6..which is unicode 5.0

Jared


_______________________________________________
Wikitech-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to