Re: [Dev] Index issues

Reid Ellis Wed, 25 Jan 2006 09:25:41 -0800

Most apps that deal with this have a "default encoding" preferencewhich the user can set to whatever they like, since they might knowwhat encoding most of their email is in. I assume that Chandler'slocale is derived from the OS's locale?


Reid


On Tue Jan 24 2006, at 21:21, Andi Vajda wrote:

On Tue, 24 Jan 2006, Brian Kirsch wrote:
Andi,
What do recommend doing in the case where a locale is not know forthe text?
Email is a great example, in most cases no language (locale)headers are supplied.
When no locale is supplied, the encoding supplied could be used forclues forusing a set of heuristics helping to 'guess' a locale. In the caseof email, for example, the domain of the sender may also provide aclue.
That guess may be better than nothing but not by much...

A good guess at this is important for full text indexing.
When sorting email addresses, however, I'd think that the Chandleruser's locale would prevail over the potential locale of the databeing sorted.
Andi..
-Brian

Brian Kirsch - Email Framework Engineer
Open Source Applications Foundation
543 Howard St. 5th Floor
San Francisco, CA 94105
(415) 946-3056
http://www.osafoundation.org



Andi Vajda wrote:
On Tue, 24 Jan 2006, Brian Kirsch wrote:
One issue to remember, if we are sorting on the name of the useri.e. Brian Kirsch <[EMAIL PROTECTED]> then the sortorder will need to be localized with PyICU.
Last year, I added a new index class called StringIndex. Itunderstands locale and uses PyICU's collator support forcomparing strings.Similarly, I realized recently that for full text indexing'ssake, LOBs (at least, if not all attributes) should also have alocale aspect so that when full text indexing (and queries) arerun, an analyzer that is appropriate for the language of thelocale is used to break up the text (or queries) in tokens.
Andi..


_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

Open Source Applications Foundation "Dev" mailing list
http://lists.osafoundation.org/mailman/listinfo/dev

Re: [Dev] Index issues

Reply via email to