https://bugzilla.wikimedia.org/show_bug.cgi?id=8732

--- Comment #8 from Philippe Verdy <[email protected]> 2010-04-14 12:00:18 UTC 
---
You're true, I still wait for a builtin parser function that can convert any
string (purged from the HTML/XML formatting) into a binary sortable collation
key (even if it's not readable, but still prses OK as a valid Unicode
plain-text string).

Performing true collation on the client side would require much more extensive
work and would imply much more compatibility and performance problems when
implementing it in a Javascript function (this is possible but would be
horribly slow for the site navigation, when such collation keys can be informed
and cached on the server side).

This would also help fixing the collation order of entries added in categories.
One problem is to have a builtin parser function that can create collection
keys according to specific languages (not necessarily the main language of the
wiki): the collation order in categories should, by default, reflect the
collation order expected for categories specific to the language they index
(notably in Wiktionary: yuo won't sort the English/French categories like
German, Swedish, Hindi, Chinese, Korean categories). Finally, there shoud also
exist a way to:
* either index a category by multiple collations (for Chinese notably: by
radical/stroke, or by pinyin : two separate categories ?)
* or offer to users a way to change the presentation order of the same
categories (for languages that need multiple ones like Chinese), by setting up
the category page with a list of additional collation orders that can be
queried directly from the server, using the suer's preference or a user
selection via a link containing an addtional HTTP query parameter, such as:
** "http://.../wiki/Category:...?collate=zh-Latn"; for Pinyin,
** "http://.../wiki/Category:...?collate=zh-Hani"; for traditional
radical/stroke order
(this should however be cachable, and should be honored by the server, for
performance reasons, only if the category page was prepared with a list of
possible collation orders, the default collation being based on the Unicode's
DUCET, or possibly on the collation rules for the main language of the wiki if
it's a localized wiki and not an international multilingual wiki like Commons,
which should still use the DUCET by default, i.e. the collation order for the
"root" locale in Unicode's CLDR database)

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to