On Thu, 27 Nov 2025, katahiromz wrote:

> Hello, I'm katahiromz. Thank you for your great software.
> I want to add UTF-16/UTF-32 support to your C preprocessor.
> 
> This patch (attached) might add automatic character encoding detection
> to `libcpp/files.cc` by examining the first 4 bytes of input files.
> I hope this patch helps.

It's not conventional for Unix-like utilities accepting text files to do 
this sort of thing; text files are expected to be in the locale's 
encoding, or at least in some ASCII-compatible encoding; UTF-16 and UTF-32 
are binary files, and in practice the world utilities such as GCC operate 
in has settled on UTF-8 as the standard Unicode encoding for input and 
output.  Furthermore, the existing logic is documented in cppopts.texi, 
which this patch doesn't change.

-- 
Joseph S. Myers
[email protected]

Reply via email to