From: Robert Muir [rcm...@gmail.com]:

[Toke: Indexing collation keys only helps with the speed problem]

> I don't really understand this measurement, collation keys are
> byte[]... (although its true we don't yet encode them this way in
> flex, I think we should)

I sounds like I'm missing something here... A quick check of running 20000 
random Strings of 30 characters from a-zA-Z0-1 + 20 different national 
characters through Java's Collator returned an average collatorKey-length of 
175 bytes. On http://wiki.apache.org/solr/UnicodeCollation it is stated that a 
standard sort is used, which - to my knowledge - loads the Strings into memory. 
For my quick test, this means a tripling of memory usage for the sort field 
when indexing collatorKeys?

Regards,
Toke Eskildsen
---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to