On Sat, Dec 13, 2025, 02:16 Collin Funk <[email protected]> wrote:
>
> +@c https://austingroupbugs.net/view.php?id=1959
> +POSIX leaves the behavior of @samp{lcase} and @samp{ucase} unspecified
> +on multibyte characters. GNU @command{dd} only converts one byte at a
> +time,
I wonder if it may be ambiguous if "converts one byte at a time" means
"reads one byte and converts it" or "reads one byte and converts it to
one byte". This seems to leave open the possibility that the "i" will
be converted in something like:
$ LC_ALL=tr_TR.utf8 dd conv=ucase <<< hij
HiJ
> because multibyte characters may cross block boundaries and case
> +conversion may change the length of characters.
But, OTOH, the meaning might be obvious enough from the rationale.