Re: How to print unicode characters (no library)?

Adam Ruppe via Digitalmars-d-learn Sun, 26 Dec 2021 13:26:48 -0800

On Sunday, 26 December 2021 at 20:50:39 UTC, rempas wrote:

I want to do this without using any library by using the"write" system call directly with 64-bit Linux.

write just transfers a sequence of bytes. It doesn't know norcare what they represent - that's for the receiving end to figureout.

know (and tell me if I'm mistaken), UTF-16 and UTF-32 havefixed size lengths for their characters.

You are mistaken. There's several exceptions, utf-16 can come inpairs, and even utf-32 has multiple "characters" that combineonto one thing on screen.

I prefer to think of a string as a little virtual machine thatcan be run to produce output rather than actually being"characters". Even with plain ascii, consider the backspace"character" - it is more an instruction to go back than it is athing that is displayed on its own.

Now the UTF-8 string will report 11 characters and print themnormally.

This is because the *receiving program* treats them as utf-8 andruns it accordingly. Not all terminals will necessarily do this,and programs you pipe to can do it very differently.

Now what about the other two? I was expecting UTF-16 to report16 characters and UTF-32 to report 32 characters.

The [w|d|]string.length function returns the number of elementsin there, which is bytes for string, 16 bit elements for wstring(so bytes / 2), or 32 bit elements for dstring (so bytes / 4).

This is not necessarily related to the number of charactersdisplayed.

Isn't the "write" system call just writing a sequence ofcharacters without caring which they are?

yes, it just passes bytes through. It doesn't know they aresupposed to be characters...

Re: How to print unicode characters (no library)?

Reply via email to