Re: Unicode sorting

2006-06-05 Thread Devin Asay
On Jun 2, 2006, at 5:09 PM, Dar Scott wrote: On Jun 2, 2006, at 2:57 PM, Devin Asay wrote: You're starting to convince me that UTF is the way to go. 8 8... Right. UTF-8. It was in my head but got stuck before it came out my fingers. UTF-8 Devin

Re: Unicode sorting

2006-06-02 Thread Dar Scott
On Jun 2, 2006, at 2:57 PM, Devin Asay wrote: You're starting to convince me that UTF is the way to go. 8 UTF-8 I'm starting to convince myself, too. Dar ___ use-revolution mailing list use-revolution@lists.runrev.com Please visit this url to sub

Re: Unicode sorting

2006-06-02 Thread Devin Asay
On Jun 2, 2006, at 2:12 PM, Dar Scott wrote: On Jun 2, 2006, at 9:45 AM, Devin Asay wrote: replace "Ж" with "ж" in lList I didn't know you could do that with the current editor. I had been suggesting a way to do that kind of thing using UTF-8 and was hoping an script editor publishe

Re: Unicode sorting

2006-06-02 Thread Dar Scott
On Jun 2, 2006, at 2:12 PM, Dar Scott wrote: On Jun 2, 2006, at 9:45 AM, Devin Asay wrote: replace "Ж" with "ж" in lList I didn't know you could do that with the current editor. I had been suggesting a way to do that kind of thing using UTF-8 and was hoping an script editor publisher

Re: Unicode sorting

2006-06-02 Thread Dar Scott
On Jun 2, 2006, at 9:45 AM, Devin Asay wrote: replace "Ж" with "ж" in lList I didn't know you could do that with the current editor. I had been suggesting a way to do that kind of thing using UTF-8 and was hoping an script editor publisher would pick up on it. However, the 2.7.1 edit

Re: Unicode sorting

2006-06-02 Thread Devin Asay
AGH! Disregard the previous post. I neglected to change the function call from the old sort function to the renamed new sort function. No wonder it was working! I'll fix it and let you know how it REALLY works. Devin On Jun 1, 2006, at 5:21 PM, Dar Scott wrote: Wow! Great news for

Re: Unicode sorting

2006-06-02 Thread Devin Asay
Okay, Dar, I tried your idea. It works like a dream, at least for my problem (Cyrillic range unicode). I didn't even have to convert upper to lower case! In fact, I'm not even sure exactly why this works. On Jun 1, 2006, at 5:21 PM, Dar Scott wrote: Wow! Great news for sorting Unicode! On

Re: Unicode sorting

2006-06-01 Thread Dar Scott
Wow! Great news for sorting Unicode! On May 30, 2006, at 5:08 PM, Devin Asay wrote: I got your code to work by making some simple changes in the sortCodeFromRussian function: Deven, I've been processing some bits of UTF-8, and something dawned on me that is probably known by the Unicode e

Re: Unicode sorting

2006-06-01 Thread Kenji Kojima
Hi, This is Japanese sorting. go url "http://www.kenjikojima.com/runrev/handbook/download/ JpnSortStudy.rev" At least it works in Japanese. The last card is sorted by item 2 that is sorted by phonetic characters. -- Kenji Kojima http://www.kenjikojima.com/ ___

Re: Unicode sorting

2006-05-30 Thread Dar Scott
On May 30, 2006, at 5:08 PM, Devin Asay wrote: ## Devin's changes - it turns out leaving the code points in decimal works perfectly, ## and I only had to make a couple of adjustments. if unicodePoint > 1039 and unicodePoint < 1072 then -- ignore case add 32 to unicodePoint else if un

Re: Unicode sorting

2006-05-30 Thread Devin Asay
Dar, I got your code to work by making some simple changes in the sortCodeFromRussian function: function sortCodeFromRussianChar utf16Char set the useUnicode to true put charToNum(utf16Char) into unicodePoint ## Devin's changes - it turns out leaving the code points in decimal works pe

Re: Unicode sorting

2006-05-30 Thread Devin Asay
Dar, Thanks for the code! I'll test it at earliest opportunity. Devin On May 27, 2006, at 2:05 PM, Dar Scott wrote: On May 27, 2006, at 9:12 AM, Devin Asay wrote: For the Russian (don't know if this will come thru in your email reader): Я вижу вас. The unicode is (omitting the "U+" con

Re: Unicode sorting

2006-05-27 Thread Dar Scott
On May 27, 2006, at 9:12 AM, Devin Asay wrote: For the Russian (don't know if this will come thru in your email reader): Я вижу вас. The unicode is (omitting the "U+" convention): 042F 0020 0432 0438 0436 0443 0020 0432 0430 0441 002E But what rev is seeing during sort is a series of s

Re: Unicode sorting

2006-05-27 Thread Devin Asay
On May 26, 2006, at 5:26 PM, Dar Scott wrote: On May 26, 2006, at 3:57 PM, Devin Asay wrote: A 'sort lines' command, after converting upper case to lower, works fairly well, except that, curiously, a space sorts *after* all cyrillic chars. I think I figured out what it is. 'sort' seem

Re: Unicode sorting

2006-05-26 Thread Dar Scott
On May 26, 2006, at 3:57 PM, Devin Asay wrote: A 'sort lines' command, after converting upper case to lower, works fairly well, except that, curiously, a space sorts *after* all cyrillic chars. I think I figured out what it is. 'sort' seems to see NUL as the end of the string and U+002

Re: Unicode sorting

2006-05-26 Thread Dar Scott
On May 26, 2006, at 3:57 PM, Devin Asay wrote: A 'sort lines' command, after converting upper case to lower, works fairly well, except that, curiously, a space sorts *after* all cyrillic chars. That's weird. Space is U+0020. The basic Cyrillic lower case seem to be U+0430 to U+044F, wher

Re: Unicode sorting

2006-05-26 Thread Devin Asay
Thanks, Dar. These tips will come in handy, and help confirm some of the things I was already thinking. A 'sort lines' command, after converting upper case to lower, works fairly well, except that, curiously, a space sorts *after* all cyrillic chars. I'm sure that's because rev is really do

Re: Unicode sorting

2006-05-25 Thread Dar Scott
On May 25, 2006, at 4:19 PM, Devin Asay wrote: I have a need to sort long lists of Cyrillic unicode text according to Russian alphabet order. Before I start writing my own routine, has anyone figured out how to sort unicode text lists? Here are some hints: 1. Trick: If you are sorting st

Re: Unicode sorting

2006-05-25 Thread Devin Asay
On May 25, 2006, at 4:23 PM, Mark Schonewille wrote: Set useunicode to true and use chartonum(char 1 of word 1 of unicodeString) to sort your list. This didn't work. Did you mean chartonum(char 1 to 2 of word 1 of unicodeString) ? I see where you're going, but the problem is that none of

Re: Unicode sorting

2006-05-25 Thread Mark Schonewille
Set useunicode to true and use chartonum(char 1 of word 1 of unicodeString) to sort your list. Mark -- Economy-x-Talk Consultancy and Software Engineering http://economy-x-talk.com http://www.salery.biz Salery is the easiest way to get your own web store on-line: http:// www.salery.biz/sale

Unicode sorting

2006-05-25 Thread Devin Asay
I have a need to sort long lists of Cyrillic unicode text according to Russian alphabet order. Before I start writing my own routine, has anyone figured out how to sort unicode text lists? TIA, Devin Devin Asay Humanities Technology and Research Support Center Brigham Young University