Re: Why ElementType!(char[3]) == dchar instead of char?

FreeSlave via Digitalmars-d-learn Wed, 02 Sep 2015 01:36:13 -0700

On Wednesday, 2 September 2015 at 05:00:42 UTC, drug wrote:

02.09.2015 00:08, Jonathan M Davis via Digitalmars-d-learnпишет:
On Tuesday, September 01, 2015 20:05:18 drug viaDigitalmars-d-learn wrote:
My case is I don't know what type user will be using, becauseI write a
library. What's the best way to process char[..] in this case?
char[] should never be anything other than UTF-8. Similarly,wchar[] isUTF-16, and dchar[] is UTF-32. So, if you're getting somethingother thanUTF-8, it should not be char[]. It should be something morelike ubyte[].If you want to operate on it as char[], you should convert itto UTF-8.std.encoding may or may not help with that. But pretty mucheverything in D- certainly in the standard library - assumes that char,wchar, and dcharare UTF-encoded, and the language spec basically defines themthat way.Technically, you _can_ put other encodings in them, but it'sjust asking for
trouble.

- Jonathan M Davis
I see, thanks. So I should always treat char[] as UTF in Ditself, but because I need to pass char[], wchar[] or dchar[]to a C library I should treat it as not UTF but ubytes sequenceor ushort or uint sequence - just to pass it correctly, right?

You should just keep in mind that strings returned by Phobos areUTF encoded. Does your C library have UTF support? Is it relevantat all? Maybe it just treats char array as binary data. But if itdoes some non-trivial string and character manipulations or talksto file system, then it surely should expect strings in somespecific encoding, and if it's not UTF, you should re-encode databefore passing from D to this library.

Also C does not have wchar and dchar, but has wchar_t which sizeis not fixed and depends on particular platform.

Re: Why ElementType!(char[3]) == dchar instead of char?

Reply via email to