On Sun, Jun 23, 2019 at 07:42:26PM +0200, Michael von der Heide wrote: > Would it be possible to include UTF-8 code points to check words containing > umlauts? > > WORD ([a-zA-Z]|\xc3[\x80-\xbf])+
lex/flex doesn't do that :-( They use small (256-entry) tables for the character types. I've seen a (long ago) patch to use big tables (which I've read doesn't work well). on my (too-long) to-do list, I have an idea which could be developed, to provide the feature using character-classes. That is, flex could be modified (perhaps a month's work...) -- Thomas E. Dickey <[email protected]> https://invisible-island.net ftp://ftp.invisible-island.net
signature.asc
Description: Digital signature
_______________________________________________ vile mailing list [email protected] https://lists.nongnu.org/mailman/listinfo/vile
