Some of the special casing rules are not clear.

#   FINAL:  The letter is not followed by a letter of category L* (e.g. Ll,
Lt, Lu, Lm, or Lo).

What happens if the word with the final sigma is followed by a period or
comma etc.  It should be final.  But what about a hyphenated word?
Technically it is still followed by a letter.  The text needs clarification.

It seems that final should be when it is followed by a space before a letter
or followed by no more letters.

Another issue - case folding.  Case folding appears to be the same as a
shift to upper followed by a shift to lower.  The sigma adjustment is not
necessary because the two forms are adjacent and will not affect sort
sequences.  The consolidation if dotted and dotless i should not impact
compares in Turkish locations but the Lithuanian removal of u0307 (combining
dot above) after i will affect the Lithuanian locale.  However, this should
not affect other locales.  It is probably a good idea to do this for all
locales.

Carl


Reply via email to