I meant to mention that byte ordering does some strange things to Vietnamese dictionaries. The Vietnamese script is a Latin script, but because it uses some odd characters code point ordering results in illogical and non-alphabetical entry ordering. For example, the "d" with a line through it (đ) gets relegated to near the end of the dictionary instead of after the regular "d" or anything with an apostrophe at the beginning of a word or phrase gets moved to the top of the list regardless of the first letter (such as 'tis). I am supportive of the IIRC general opinion. Let the module creator worry about the ordering. Otherwise you get some very strange dictionary behavior.

Daniel

Eeli Kaikkonen wrote:
Quoting DM Smith <[EMAIL PROTECTED]>:

  
Found an interesting thread on ordering dictionary entries. Currently
we use byte ordering for the key which results in UTF-8 or Latin-1
(cp1252) code point ordering.
    

There has been some discussion about this. IIRC the general opinion  
was that the original work has the right order and it should be  
respected or at least it should be possible to follow it. Take the  
Liddell-Scott for example. I don't know if there even exists a locale  
for that which could give correct results. With these kind of works  
it's not only about practicality but also faithfulness to the  
original. Many who will use the LS module have already used the  
original and want the same order. Currently it's very difficult to  
find some words with BibleCS.

The thread was about BibleCS bugs, again IIRC.

--Eeli Kaikkonen


_______________________________________________
sword-devel mailing list: [email protected]
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

  

-- 
PMBX license 1502 
_______________________________________________
sword-devel mailing list: [email protected]
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to