Re: Clay language

Steven Schveighoffer Thu, 30 Dec 2010 11:20:20 -0800

On Thu, 30 Dec 2010 12:52:32 -0500, Andrei Alexandrescu<[email protected]> wrote:

On 12/30/10 11:08 AM, Steven Schveighoffer wrote:
I'd have to see how it works. I also thought the new operator
overloading scheme was reasonable -- until I tried to use it.
You mean until you tried to use it /once/.
Note this is even more bloated because you generate one function per
pair of types used in concatenation, vs. one function per class defined.
That function is inlined and vanishes out of existence. I wish one daywe'd characterize this bloating issue more precisely. Right now anythinggeneric has the "bloated!!" alarm stuck to it indiscriminately.

Functions inline out of existence during runtime, but the function itselfremains resident in the compiled binary.

I don't know that it's an important aspect to keep it in there or not, Ijust know it's kept. There are a whole slew of improvements we can makein this regard, but I'm not sure they are possible, because I'm not acompiler writer.

One such nuisance in particular is the proliferation of types when you usesomething like isInputRange. That invariably is *only* used at compiletime, yet the type and its typeinfo are injected into the binary.

I mean bloated because you are generating template functions that just
forward to other functions. Those functions are compiled in and take up
space, even if they are inlined out.


I think we can safely leave this matter to compiler technology.

I hope that can be done. D already suffers from the 'hey what gives, howcome hello world is 1MB?!!' syndrome.

Let's also realize that the mixin is going to be required *per
interface* and *per class*, meaning even more bloat.
The bloating argument is a complete red herring in this case. I do agreethat generally it could be a concern and I also agree that the compilerneeds to be improved in that regard. But by and large I think we cancalmly and safely think that a simple short function is not a source ofworry.

Short template functions still have template mangled names. I have foundthat template names with lots of parameters can slow down compilation, butI think Walter is working on fixing that.

If we can get to a point where language constructs such as this can betruly inlined out of existence, then I think we will be on another levelfrom other languages. It's not an unimportant nuisance to be dealt withlater.


And at that point, I can agree that the template solution is not bloated ;)

So I'd say, while my example is not proof that this is a disaster, I
think it shows the change in operator overloading cannot yet be declared
a success. One good example does not prove anything just like one bad
example does not prove anything.
Many good examples do prove a ton though. Just off the top of my head:

- complex numbers

- checked integers

- checked floating point numbers

- ranged/constrained numbers

- big int

- big float

- matrices and vectors

- dimensional analysis (SI units)

- rational numbers

- fixed-point numbers
If I agree with something is that opCat is an oddity here as it doesn'tusually group with others. Probably it would have helped if opCat wouldhave been left named (just like opEquals or opCmp) but then uniformityhas its advantages too. I don't think it's a disaster one way oranother, but I do understand how opCat in particular is annoying to yourcase.

Probably the most common operator overload in D is opEquals, luckily thatis not a template (even though it sadly does not work with interfaces yet).

It seems that operator overloads are in categories. There are the numericoverloads, which I agree are generally overloaded in groups. When Idefined cursors to be more like C++ iterators in dcollections instead ofsmall ranges, I used the ++ and -- overloads, which you typically definetogether.

When designing the mixin that allows you to define various operatoroverloads, I think it would be hugely beneficial to take into accountthese groupings and make the mixins modular.

I haven't had that experience. This is just me talking. Maybe others
believe it is good.

I agree that the flexibility is good, I really think it should have that
kind of flexibility. Especially when we start talking about the whole
opAddAssign mess that was in D1. It also allows making wrapper types
easier.

The problem with flexibility is that it comes with complexity. Most
programmers looking to understand how to overload operators in D are
going to be daunted by having to use both templates and template
constraints, and possibly mixins.

Most programmers looking to understand how to overload operators in Dwill need to bundle them (see the common case argument above) and willgo with the TDPL examples, which are clear, short, simple, and useful.

The code itself is simple, it's the "how does x + y match up with thistemplate thingy" which is the problem I think. We've already had severalposts on d.learn ask how operator overloads work even after reading TDPL.

There once was a discussion on how to improve operators on the phobos
mailing list (don't have the history, because i think it was on
erdani.com). Essentially, the two things were:

1) let's make it possible to easily specify template constraints for
typed parameters (such as string) like this:

auto opBinary("+")(Foo other)

which would look far less complex and verbose than the current
incarnation. And simple to define when all you need is one or two
operators.

I don't see this slight syntactic special case a net improvement overwhat we have.

It's less intimidating. Max pointed out opBinary(string op : "+"), whichis close, but still has some seemingly superfluous syntax (why do I needstring op? and what is that : for?)


Compare that to C++ operators:

operator+(Foo rhs)

I'd call that very simple to understand in the context of operatoroverloading.

Also, the proposal is a specialization of templates in general, not justfor operator overloading. It translates to the same thing as if you wrote:


opBinary(string $)(Foo other) if($ == "+")

where $ is an inaccessible symbol. It basically optimizes out the partsyou don't care about if you don't care about them.

2) make template instantiations that provably evaluate to a single
instance virtual. Or have a way to designate they should be virtual.
e.g. the above operator syntax can only have one instantiation.
This may be worth exploring, but since template constraints arearbitrary expressions I fear it will become a mess of special casesdesigned to avoid the Turing tarpit.

That's why I conditioned it as "provably" evaluate to single instance. Imeant provable by the compiler, so even something that may look obvious toa user as only instantiating to one instance may not be provable by thecompiler. It probably requires you to use specific constructs (like theone mentioned above) to help the compiler out.

Using operator overloading in conjunction with class inheritance israre.
I don't use operator overloads and class inheritance, but I do use
operator overloads with interfaces. I think rare is not the right term,
it's somewhat infrequent, but chances are if you do a lot of interfaces,
you will encounter it at least once. It certainly doesn't dominate the
API being defined.
Maybe a more appropriate characterization is that you use catenationwith interfaces.

Concatenation, equality comparison, indexing, and assignment. Out ofthose, only concatenation gives me headaches because it requires templates.

If you propose we remove concatenation from opBinary and give it its ownform, then I thing that would solve the problem too.

Actually, the functionality almost exists in template this parameters.
At least, the reevaluation part is working. However, you still must
incur a performance penalty to cast to the derived type, plus the
template nature of it adds unnecessary bloat.

Saw that. I have a suspicion that we'll see a solid solution from yousoon!

Alas, no solution is possible without templates being allowed ininterfaces :( But yes, I plan to use this technique as soon as it'spossible.


-Steve

Re: Clay language

Reply via email to