On Jul 24 09:28, Brian Inglis via Cygwin wrote: > On 2025-07-24 04:30, Corinna Vinschen via Cygwin wrote: > > Or shall simply go along with CESU-8 when converting back to multibyte > > to keep the string the same as with wcstombs? > > There are 15 * SMP as BMP characters, so many non-Western and emoji > characters will be expanded from 4 UTF-8 bytes to 6 CESU-8 bytes, and this > is not supported anywhere as a string representation, designed for internal > use only per the TR.
We're only talking about invalid sequences, not using CESU-8 throughout. Corinna -- Problem reports: https://cygwin.com/problems.html FAQ: https://cygwin.com/faq/ Documentation: https://cygwin.com/docs.html Unsubscribe info: https://cygwin.com/ml/#unsubscribe-simple