Re: [PHP-I18N] intl 1.0.0RC1

David Zülke Sun, 01 Jun 2008 01:50:33 -0700

Am 01.06.2008 um 07:43 schrieb Stanislav Malyshev:

Hi!
- I still think IntlDateFormatter vs the rest w/o Intl prefix isinconsistent. Can't we just prefix it with "Intl" across the board?Saves trouble down the road
Because frankly this prefix sucks and so do names likeIntlMessageFormatter. In order to avoid conflict with Date extensionI had to rename the date formatter, but I really don't want to makeit uglier than it must be.

What if there is a Number extension down the road. Or a Collatorextension. Or what if people already have classes calledNumberFormatter etc. That's not too unlikely. I agree that thoseprefixes are ugly, but I firmly believe that it is more important tohave consistency across the board. "Intl" prefix for all classes justmakes more sense.

- DateFormatter::parse() and DateFormater::localtime() should havethe second argument ($parse_pos) as a reference - it is supposed to"return" the position where an error occured, in case it could notparse the given value. ICU does have it this way, and I think PHPshould, too.
Please add it as feature request bug report.


Okay.

- IntlDateFormatter::format() does not accept DateTime (I thinkthat is a known issue)
- There is no way to use or retrieve milliseconds, so far
And these too. First of these is planned as for the second one itdepends on if ICU allows that.


I believe it does, yeah.

- What does Collator::sortWithKeys() do, exactly? Why not alwayshave this one? Why does the API have a "normal" sorting functionand an optimized one? Why not just always use the sorting withucol_getSortKey() keys? And why is there noCollator::asortWithKeys(), to keep the API consistent?
sortWithKeys generates collation keys for each entry prior tosorting, which is supposed to speed it up. But it depends on howmany entries there are - it may prove not efficient to store allthose keys.

But why are those internal differences exposed through the API. Ithink that is bad design. I as a caller should not have to botherthinking about the internal implementation of the two methods and thendecide which to use. That's why I'm using the extension, after all, tohave things convenient.

At least, there should be Collator::asortWithKeys(). But I reallybelieve that both sort() and asort() should use the variant withgenerated collation keys, if that one becomes faster as the array sizegrows.

You're saying "it may not prove efficient"... did you or anyonebenchmark it? ;) Having two methods with the same behavior, justdifferent implementations due to a gut feeling might not be the bestidea. I'm failing to get the extension compiled here on OS X, but willgo ahead and do benchmarks ASAP

- INTL_MAX_LOCALE_LEN is 64 - what if I have a longer localestring, with options?
Well, do you? We can increase it, but we have to have some limitsince ICU can't stomach overlong locales.

Absolutely...[EMAIL PROTECTED];collation=traditional;calendar=thai-buddhist is what I could come up with right now... 77 characters.

The other question is what happens if the string is longer than that?Does it get cut off or something?


Assume this:

sr_Latn_RS_REVISED@currency=USD;collation=traditional;mykeyword=myvalue;calendar=thai-buddhist

That is, in theory, legal. "mykeyword" would be parsed by my custommessage formatter implementation that can localize... ancient mayawall paintings. Or whatever. So locale identifier strings can be ofany length.


Maybe ext/intl should do this:

- Accept locale strings of arbitrarys length

- Parse them and throw out any keywords ICU cannot handle (i.e.everything except "collation", "currency" and "calendar", AFAIK)

- Hand the resulting string over to ICU

What confuses me, in general, is why locales are not implemented asobjects. Why do I have to pass a locale string to every locale-awarefunction?

Also... having uloc_acceptLanguageFromHTTP exposed in the API would bepretty neat ;) Since apparently, that does a mapping of e.g. "en-GB"to "en_UK" etc

And.. is there going to be Resources support in the future? AFAIK, theCLDR data is compiled to ICU resource bundles, right? That would allowreading and using the CLDR data of a locale. Like... reading localizedcountry and language names. Yummy. We currently have all this stuffimplemented in userland PHP code, which is, er, a bit slow :)



Cheers,

David

--
PHP Unicode & I18N Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

Re: [PHP-I18N] intl 1.0.0RC1

Reply via email to