Re: [vile] spellflt.l: Include UTF-8 code points

Michael von der Heide Sun, 23 Jun 2019 19:57:55 -0700

It works (hunspell) for me with words like "prüfen" or "Straße". Flex
generates an 8-bit scanner. UTF-8 should work. Would you mind testing it?


--
Michael von der Heide


Thomas Dickey <[email protected]> schrieb am So., 23. Juni 2019, 21:24:

> On Sun, Jun 23, 2019 at 07:42:26PM +0200, Michael von der Heide wrote:
> > Would it be possible to include UTF-8 code points to check words
> containing
> > umlauts?
> >
> > WORD          ([a-zA-Z]|\xc3[\x80-\xbf])+
>
> lex/flex doesn't do that :-(
>
> They use small (256-entry) tables for the character types.
>
> I've seen a (long ago) patch to use big tables (which I've read
> doesn't work well).
>
> on my (too-long) to-do list, I have an idea which could be developed,
> to provide the feature using character-classes.  That is, flex could
> be modified (perhaps a month's work...)
>
> --
> Thomas E. Dickey <[email protected]>
> https://invisible-island.net
> ftp://ftp.invisible-island.net
>

_______________________________________________
vile mailing list
[email protected]
https://lists.nongnu.org/mailman/listinfo/vile

Re: [vile] spellflt.l: Include UTF-8 code points

Reply via email to