Re: Unicode sorting

Devin Asay Fri, 02 Jun 2006 08:56:17 -0700

AAAAARRRRGH! Disregard the previous post. I neglected to change thefunction call from the old sort function to the renamed new sortfunction. No wonder it was working!


I'll fix it and let you know how it REALLY works.


Devin
On Jun 1, 2006, at 5:21 PM, Dar Scott wrote:

Wow!  Great news for sorting Unicode!

On May 30, 2006, at 5:08 PM, Devin Asay wrote:
I got your code to work by making some simple changes in thesortCodeFromRussian function:
Deven, I've been processing some bits of UTF-8, and somethingdawned on me that is probably known by the Unicode experts.
**** A lexical byte sort of well-formed UTF-8 will result in aUnicode code point sort! *****
That avoids the NUL problem in sort. That means that russianLex()can return the UTF-8 of the string with your character conversions.
I think the replace command will work with UTF-8, so you can evenavoid a character loop. All you need is 34 replaces and then areturn. OK, that might actually be slower than a character loop.
Dar
Unicode Sophomore


_______________________________________________
use-revolution mailing list
[email protected]
Please visit this url to subscribe, unsubscribe and manage yoursubscription preferences:
http://lists.runrev.com/mailman/listinfo/use-revolution


Devin Asay
Humanities Technology and Research Support Center
Brigham Young University

_______________________________________________
use-revolution mailing list
[email protected]
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-revolution

Re: Unicode sorting

Reply via email to