Thanks Marc and Lassi. I will take a look at Gambit's implementation. In that same direction, I also adapted Alex Shin's iset library which he used to support unicode in his utf8 egg for Chicken Scheme.
Thanks Again, Joseph Donaldson On Saturday, January 6, 2024 at 01:46:53 PM PST, Marc Feeley <fee...@iro.umontreal.ca> wrote: > On Jan 6, 2024, at 3:08 PM, Lassi Kortela <la...@lassi.io> wrote: > >> I have made Bigloo adaptions of SRFI 13 - String Libraries >> <https://github.com/donaldsonjw/srf13>and SRFI 14-Character Sets >> <https://github.com/donaldsonjw/srfi14>. As with the sample implementations, >> they are limited to iso-latin 1 but hopefully will still be useful to some. > > Thank you. These are among the most useful and widely used SRFIs. > > Marc Feeley and John Cowan can advise how to implement SRFI 14 for the full > Unicode range. Marc designed a simple and efficient data structure for > encoding subsets of the Unicode character set. I don't know whether it has > been incorporated into Gambit yet. > Yes they are part of Gambit since last September, with a few tweaks since then. It implements Unicode 15.0. It has been checked against Racket (there are a few differences that seem to be due to differences in how “numeric characters” are defined in R6RS and R7RS specs). The implementation creates all the data structures by analyzing the Unicode database files. These files are the main ones implementing SRFI 14: https://github.com/gambit/gambit/blob/master/lib/gambit/char/char%23.scm https://github.com/gambit/gambit/blob/master/lib/gambit/char/char.scm Marc Feeley