Re: Reading binary streams with decoding to Unicode

Dukc via Digitalmars-d-learn Mon, 15 Oct 2018 11:01:06 -0700

On Monday, 15 October 2018 at 10:49:49 UTC, Vinay Sajip wrote:

Is there a standardised way of reading over buffered binarystreams (at least strings, files, and sockets) where you canlayer a decoder on top, so you get a character stream you canread one Unicode char at a time? Initially UTF-8, but lateralso other encodings. I see that std.stream was deprecated, butcan't see what other options there are. Can anyone point me inthe right direction?

This is done automatically for character arrays, which includesstrings. wchar arrays wil iterate by UTF-16, and dchar arrays byUTF-32. If you have a byte/ubyte array you know to beunicode-encoded, convert it to char[] to iterate by code points.

Vice-versa, if you want to iterate a character array by codeunit, convert it to ubyte[]/ushort[] (depending on code unitlength) or use std.utf.byCodeUnit

Re: Reading binary streams with decoding to Unicode

Reply via email to