Re: German Umlauts / UTF8 with comparse

2020-02-17 Thread Christoph Lange
Yes, this helps. Kind of ;-) ... using the character set char-set:alphabetic, my umlauts are now parsed. But I don't get them back in my result, at least not as printable characters. Instead, the following happens, and utterly confuses me: #;2> (define s3 (parse letters (string->list s))) #;3> s3

Re: German Umlauts / UTF8 with comparse

2020-02-17 Thread kooda
Christoph Lange wrote: > meaning, that the ä isn't recognized as being a letter within the > 'char-set:letter'. The utf8 egg’s srfi-14 character sets are designed to be compatible with the original srfi-14 and only contain ASCII characters, as stated in the documentation:

Re: German Umlauts / UTF8 with comparse

2020-02-17 Thread Moritz Heidkamp
Hi Christoph, On 17 February 2020 14:31 +01, Christoph Lange wrote: > meaning, that the ä isn't recognized as being a letter within the > 'char-set:letter'. (The UTF8 aspect of correct character width works on the > other hand: in the remaining string, the ä is represented by only one #\. > If I

German Umlauts / UTF8 with comparse

2020-02-17 Thread Christoph Lange
I read older threads about parsing Japanese with comparse and took some ideas from there, but am still stuck: (import comparse utf8 utf8-srfi-14) (define s "Gänsesäger 2,1") (define s1 "Rotkehlchen 1,0") (define (utf8-in cs) (satisfies (lambda (c) (char-set-contains? cs c (define letter