Steffen Nurpmeso wrote in <20210614162626.vfjxt%stef...@sdaoden.eu>: ... | <20210614121214.84c1621...@orac.inputplus.co.uk>: ... ||Why not iconv(3) the input from the user's locale, the MIME part's ||charset, etc., to UTF-8, work internally, and then iconv() again on the ... |functions do not support graphemes, and __STDC_ISO_10646__ is an |option, so that you cannot simply code some tables on your own to |fill the gaps, because looking at the wchar_t codepoints may not |give you a Unicode "codepoint" (though maybe all do it like that |so in practice you could make this a precondition). I had to
To add that if i recall correctly citrus for example does this, using the upper bits of wchar_t for state info, but i have forgotten whether that was done in an UTF-8 locale, or rather in CJK or SHIFT-JS or whatever (my gut says the latter). --steffen | |Der Kragenbaer, The moon bear, |der holt sich munter he cheerfully and one by one |einen nach dem anderen runter wa.ks himself off |(By Robert Gernhardt)