> From: Gavin Smith <[email protected]> > Date: Mon, 20 Apr 2026 21:07:18 +0100 > > At the end of this mail is an updated version of my patch. > > This now outputs "é" as "e" in the index sort key, as shown: > > $ cat test.texi > \input texinfo > > @cindex à gré, césure > @cindex écrire des lettres > @cindex bbbb > > > Index: > @printindex cp > > @bye > $ pdftex test.texi > [...] > $ cat test.cp > @entry{{à} gre, cesure}{1}{à gré, césure} > @entry{ecrire des lettres}{1}{écrire des lettres} > @entry{bbbb}{1}{bbbb} > $ LC_ALL=C texindex test.cp > $ cat test.cps > @initial {{à}} > @entry{à gré, césure}{1} > @initial {B} > @entry{bbbb}{1} > @initial {E} > @entry{écrire des lettres}{1} > > Note how here é becomes e in the sort key, while à is output surrounded > by braces, which is the default for multibyte UTF-8 characters with this > patch.
Could you please explain in more detail this idea? I'm not sure I understand the purpose of stripping the accents from accented letters, and don't understand how this will sort correctly. > What would be much harder would be to make a letter sort as its > own independent letter between A and Z, with its own heading in the > index: for example, Ñ between N and O. We could make Ñ sort between > N and O by outputting its sort string as NZZZ, but texindex would take > an entry with a sort key beginning with NZZZ as part of the "N" > section. (I'm not sure what languages this would be an issue for.) > Also multi-level collation (as in the Unicode Collation Algorithm) > is right out. Also this part. Don't people expect the non-ASCII Latin characters to sort in the order of their Unicode codepoints, which would put them _after_ all the ASCII characters? Or are you trying to mimic what the French locale mandates as the collation order of the letters? In the latter case, didn't you just explain that this is not what Texinfo should (or reasonably could) do, and that the practical solution is to sort in codepoint order? Do you plan on having each txi-YY.tex to play similar locale-dependent tricks with accented letters to "implement" the locale's collating order?
