Thanks Ben, your hint made the whole patch work with umlauts and accents. Thanks for the explanation IOhannes. I think I am having a hard time understanding the following:
- Bytes, which have a range between 0 and 255. - "Unicode points" which are numbers representing characters including umlauts? - "ASCII characters", which are somehow bytes but also use unicode points 0-127. - The "string" message and its difference to the "text" message in [text2d]. All the best, Peter * IOhannes m zmölnig via Pd-list <[email protected]> [2026-01-28 19:51]: > Am 28. Jänner 2026 19:08:15 MEZ schrieb "Peter P." <[email protected]>: > > > >> basically, my advice is to use the [string( message with unicode points: > >> - use [file] to read the raw UTF-8 bytes > >> - use [unicode/utf82codenumber] to convert the UTF-8 bytes into > >> unicode-points > >> - prefix [string( > > > >I tried to implement this (see attached) but [text2d] renders the actual > >bytes as numbers... > > I haven't checked your patch yet, but apart from the obvious (as Ben pointed > out), I'd like to reiterate: > > the "string" message does **not** take bytes but Unicode points (there's > about 2^20 of them) which are numbers (been 0 and ~1000000). > bytes can only have values between 0 and 255. > > for ASCII characters (which only use Unicode points between 0 and 127), the > UTF-8 representation is identical to the Unicode point representation. > > > mfg.sfg.jfd > IOhannes > --- > [email protected] - the Pure Data mailinglist > https://lists.iem.at/hyperkitty/list/[email protected]/message/3HTQ2OT4USBHFLMCZR7HFV7EZA3TEETC/ > > To unsubscribe send an email to [email protected] mailing list > UNSUBSCRIBE and account-management -> https://lists.iem.at/ --- [email protected] - the Pure Data mailinglist https://lists.iem.at/hyperkitty/list/[email protected]/message/7HZC2LYUHLSYIVSG5L2ZM3ZLVE6A375E/ To unsubscribe send an email to [email protected] mailing list UNSUBSCRIBE and account-management -> https://lists.iem.at/
