Re: Treating the abusive unsigned syndrome

Andrei Alexandrescu Wed, 26 Nov 2008 07:15:22 -0800

Don wrote:

Andrei Alexandrescu wrote:
D pursues compatibility with C and C++ in the following manner: if acode snippet compiles in both C and D or C++ and D, then it shouldhave the same semantics.
A classic problem with C and C++ integer arithmetic is that anyoperation involving at least an unsigned integral receivesautomatically an unsigned type, regardless of how silly that actuallyis, semantically. About the only advantage of this rule is that it'ssimple. IMHO it only has disadvantages from then on.
The following operations suffer from the "abusive unsigned syndrome"(u is an unsigned integral, i is a signed integral):
(1) u + i, i + u
(2) u - i, i - u
(3) u - u
(4) u * i, i * u, u / i, i / u, u % i, i % u (compatibility with Crequires that these all return unsigned, ouch)
(5) u < i, i < u, u <= i etc. (all ordering comparisons)
(6) -u
I think that most of these problems are caused by C enforcing a foolishconsitency between literals and variables.The idea that literals like '0' and '1' are of type int is absurd, andhas caused a torrent of problems. '0' is just '0'.
uint a = 1;
does NOT contain an 'implicit conversion from int to uint', any morethan there are implicit conversions from naturals to integers inmathematics. So I really like the polysemous types idea.

Yah, polysemy will take care of the constants. It's also rather easy toimplement for them.

For example, when is it reasonable to use -u?
It's useful with literals like
uint a = -1u; which is equivalent to uint a = 0xFFFF_FFFF.
Anywhere else, it's probably a bug.

Maybe not even for constants as all uses of -u can be easily convertedin ~u + 1. I'd gladly agree to disallow -u entirely.

My suspicion is, that if you allowed all signed-unsigned operations whenat least one was a literal, and made everything else illegal, you'd fixmost of the problems. In particular, there'd be a big reduction inpeople abusing 'uint' as a primitive range-limited int.

Well, part of my attempt is to transform that abuse into legit use. Inother words, I do want to allow people to consider uint a reasonablemodel of natural numbers. It can't be perfect, but I believe we can makeit reasonable.

Notice that the fact that one operand is a literal does not solve all ofthe problems I mentioned. There is for example no progress in typing u1- u2 appropriately.

Although it would be nice to have a type which was range-limited, 'uint'doesn't do it. Instead, it guarantees the number is between 0 andint.max*2+1 inclusive. Allowing mixed operations encourages programmersto focus the benefit of 'the lower bound is zero!' while forgetting thatthere is an enormous downside ('I'm saying that this could be largerthan int.max!')

I'm not sure I understand this part. To me, the larger problem isunderflow, e.g. when subtracting two small uints results in a large uint.

Interestingly, none of these problems exist in assembly languageprogramming, where every arithmetic instruction affects the overflowflag (for signed operations) as well as the carry flag (for unsigned).

They do exist. You need to use imul/idiv vs. mul/div depending on whatsignedness your operators have.



Andrei

Re: Treating the abusive unsigned syndrome

Reply via email to