On Sat, 2 Jan 2010 15:26:04 -0500, Tony Harminc wrote:
>
>Sorting is a cultural thing (where "culture" can include C programming
>as much as French-in-France, French-in-Canada, English, German, etc.)
>And each culture may have multiple sort orders appropriate for
>different circumstances. For example French dictionaries have a
>different order from French phonebooks; a French phonebook user may
>expect to find the name duPont under P, not under D. Even in English,
>where do you expect to find castor-oil in the list above? Surely the
>hyphen should be given lower weighting than even the letters that
>follow it, so that it comes out after castor bean. How about Caesar vs
>C�sar or Noel vs No�l? Google search knows that they are the same
>thing, but Gmail flunks the latter in its spelling checker. What does
>the "ls" command think?
>
OK.  As Shane suggested, it depends on Locale setting (same for
DFSORT).  With OpenSolaris's default (whatever):

506 $ ls -1
Документы 
Caesar
C�sar
castor
Castor
castor bean
castor-oil
Noel
No�l
507 $ 

(Pasted into browser window with UTF-8 selected.)

Are you suggesting that diacritical marks should be
considered embellishments, lacking semantic significance?
Ask a Spanish speaker whether "a�o" is the same as "ano".

-- gil

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [email protected] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html

Reply via email to