A related issue came up for latex as we set up the formats for 2017/01/01 release defaulting to Unicode encoding for the first time, should we default to NFC normalisation (using the xetex primitive and some lua callback) which would go some way to avoiding the need to deal with combining accents in the patterns?
we didn't do that this time for fear of clashing with existing code but if this issue is going to keep coming up it might be good to look at this again... David On 26 January 2017 at 12:51, Arthur Reutenauer <[email protected]> wrote: > (Moving discussion from the TeX Live list to TeX-hyphen, please reply > there.) > >> Did you add patterns for all combining accents as I mentioned in one >> of the comments? > > Not yet, but it’s on our list: > https://github.com/hyphenation/tex-hyphen/issues/5 > I need to figure out the best way to do it: should we input the full > list of all combining characters for every language, or only those > diacritic signs that are relevant for each language? The former option > may seem like less work but we need to make sure that the accents don’t > interact with the existing patterns (for all the languages), and ensure > that it stays so in the future. If for example someone comes up with a > pattern set for Russian that does take the combining acute accent into > account, having a default list of patterns with accents may be > self-defeating. > > Best, > > Arthur
