On Sat, 2 Jan 2010 15:26:04 -0500, Tony Harminc wrote: > >Sorting is a cultural thing (where "culture" can include C programming >as much as French-in-France, French-in-Canada, English, German, etc.) >And each culture may have multiple sort orders appropriate for >different circumstances. For example French dictionaries have a >different order from French phonebooks; a French phonebook user may >expect to find the name duPont under P, not under D. Even in English, >where do you expect to find castor-oil in the list above? Surely the >hyphen should be given lower weighting than even the letters that >follow it, so that it comes out after castor bean. How about Caesar vs >C�sar or Noel vs No�l? Google search knows that they are the same >thing, but Gmail flunks the latter in its spelling checker. What does >the "ls" command think? > OK. As Shane suggested, it depends on Locale setting (same for DFSORT). With OpenSolaris's default (whatever):
506 $ ls -1 Документы Caesar C�sar castor Castor castor bean castor-oil Noel No�l 507 $ (Pasted into browser window with UTF-8 selected.) Are you suggesting that diacritical marks should be considered embellishments, lacking semantic significance? Ask a Spanish speaker whether "a�o" is the same as "ano". -- gil ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to [email protected] with the message: GET IBM-MAIN INFO Search the archives at http://bama.ua.edu/archives/ibm-main.html

