Pádraig Brady wrote:
> Bruno Haible wrote:
>> Hi Jim,
> 
> Thanks for looking at this Bruno.
> 
>> In coreutils/src/join.c, there is a FIXME mentioning that the -i option for
>> case insensitive comparison of the input lines does not work in multibyte
>> locales.
> 
> Utils that have this issue are:
> join -i, uniq -i, sort -f, ptx -f

Note as well as folding case I think it might
be useful to fold other forms like:
  Diacritics:  À -> A
  Enclosed:  \u24b6 -> A
  Stylistic: \uff21-> A

I.E. have more general function like:
ulc_coll(fold={Case|Diactritics|Stylistic}, ...);

I'm not suggesting it's required or anything,
just something to keep in mind while designing
these interfaces.

cheers,
Pádraig.


Reply via email to