Re: What is the case against a struct post-blit default constructor?

foobar Wed, 10 Oct 2012 12:16:13 -0700

Thank you for explaining.
See comments inline.

On Wednesday, 10 October 2012 at 18:12:13 UTC, Jonathan M Daviswrote:

On Wednesday, October 10, 2012 13:40:06 foobar wrote:
Can you please elaborate on where the .init property is being
relied on? This is an aspect of D I don't really understand.
What's the difference between a no-arg ctor and one with argsin
relation to this requirement?
init is used anywhere and everywhere that an instance of a typeneeds to bedefault-initialized. _One_ of those places is a local variable.Not all placeswhere an instance of an object needs to be created can beinitialized by theprogrammer. The prime example of this would be arrays. If youdeclare
auto i = new int[](5);

or

int[12] s;
all of the elements in the array need to be initialized orthey'll be garbage,and there's no way for the programmer to indicate what valuesthey should be.The whole point of init is to avoid having variables ever begarbage withoutthe programmer explicitly asking for it. And having a defaultconstructorwouldn't help one whit with arrays, because the values of theirelements mustbe known at compile time (otherwise you couldn't directlyinitialize membervariables or static variables or anything else which requires avalue atcompile time with an array). With init, the compiler can takeadvantage of thefact that it knows the init value at compile time toefficiently initialize the
array.

I understand the idea of default initialization. I was moreinterested in the machinery and implementation details :) Solet's dive in into those details:Arrays - without changing existing syntax we can use thesesemantics:


auto a = new int[](5); // compiler calls T() for each instance
int[12] b; // ditto

This would be same as in C++. We could also expand the syntax andallow:

auto b = new int[](5, 9); // init all instances to 9
auto b = new int[](5, int (int index) { return index; });
initializes each member via a function call.
This can be generalized for multi dimensions.

But even constructing objects sanely relies on init. Alluser-defined objectsare fully initialized to what their member variables aredirectly initializedto before their constructors are even called. In the case of astruct, that'sthe struct's init value. It's not for a class, because youcan't have a classseparate from its reference (so it's the reference which getsthe init value),but the class still has a state equivalent to a struct's initvalue, andthat's the state that it has before any of its constructors arecalled.

So for classes .init is null which complicates non-nullableclasses. It seems the "solution" (more like a hack IMO) of@disable _breaks_ the .init guaranty in the language.

If it weren't for that, you'd get the insanity that C++ or Javahave withregards to the state of objects prior to construction. C++ isparticularly badin that each derived class is created in turn, meaning thatwhen a constructoris called, the object _is_ that class rather than the derivedclass thatyou're ultimately constructing (which means that things can gohorribly wrongif you're stupid enough to call a virtual function from aconstructor in C++).I believe that Java handles that somewhat better, but it getsbizarre orderingissues with regards to initializing member variables that causeproblems ifyou try and alter member variables from base classes inside ofa derivedconstructor. With D, the object is guaranteed to be in a sanestate prior to
construction.

C++ is insanely bad here mainly due to [virtual?] MI whichdoesn't affect Dand Java _allows_ virtual methods in constructors, which I thinkis also "fixed" in the latest c++ standard. I don't know aboutthe ordering problems you mention but AFAIK the complicationarises with MI, not default initialization. It's just a matter ofproperly defining the inheritance semantics.

And without init, even if every place that an object isinstantiated could bedirectly initialized by the programmer (which it can't), thenyou would eitherend up with garbage every time that a variable isn't directlyinitialized, oryou'd have to directly initialize them all. In order for D'sconstructionmodel to work, this would include directly initializing _all_member variableseven if the constructor then set them to something else (whichwould actuallycause problems with const and immutable). And that would get_very_ annoying,even if it would be preferable for the local variable torequire explicit
initialization.


You talk about:
class C {
immutable T val; // what to do here?
this() { ... }
}

This can be solved be either requiring a ctor call at # or ifnone specified call T(), or we can require the init to happen inthe ctor a-la C++ semantics.

Another case where init is required is out parameters. All outparameters areset to their init value when the function is called in order toavoid bugscaused by reading the value of an out parameter before it's setwithin the
function. That wouldn't work at all without init.

Personally, I'd just get remove this feature from the lanuage,tuples are a far better design for returning multiple values andeven with this feature intact, we could always use the defaultno-arg constructor.

E.g
void foo(out T val);
becomes:
void foo(out T val = T());

One of the more annoying AA bugs makes it so that if the foofunction in this
code

aa[5] = foo();
throws, then aa[5] gets set with a init value of the elementtype. While thisclearly shouldn't happen, imagine how much worse it would be ifwe didn't have
init, and that element got set to garbage?

I don't get this example. If foo throws than the calling codewill get control. How would you ever get to read that garbage inaa[5]? The surrounding try catch block should take care of thisexplicitly anyway.


E.g.
try {
 aa[5] = foo(); // foo throws
 // ## do something with aa[5], this won't happen
} catch {
// Please handle aa[5] here explicitly.
//
}

// @@ do something with aa[5], works due to the explicit fix inthe catch.

There are probably other cases that I can't think of right nowwhere init getsused - probably in the runtime if nowhere else. Every placethat couldpossibly result in a variable being garbage _doesn't_ result ingarbage,
because we have init.
And regardless of what the language does, there are definitelyplaces where thestandard library takes advantage of init. It uses it a lot fortypeinferrence, but it also uses it directly in places such asstd.algorithm.move.Without init, it would end up dealing with garbage values. It'salso alifesaver in generic code, because without it, generic code_can't_ initialize
variables in many cases. Take something like

T t;

if(cond)
{
 ...
 t = getValue();
 ...
}
else
{
 ...
 t = getOtherValue();
 ...
}
How on earth could a generic function initialize t withoutT.init? void?That's just begging for bugs when one the paths doesn'tactually set t likeit's supposed to. It doesn't know anything about the type andthereforedoesn't know what a reasonable default value would be, so itcan't possibly
initialize t properly.

Isn't @disable breaks those algorithms in phobos anyway? howwould that work for non-nullable classes?To answer the above question, I'd say there's nothing wrong withinit to void. This is what happens anyway since the .init isn'tused and the optimizer will optimize it away.

I can understand prefering that local variables have to bedirectlyinitialized by the programmer, but it just doesn't scale.Having init is_far_more flexible and far more powerful. Any and everysituation that mightneed to initialize a variable can do it. Without init, thatjust isn't
possible.

- Jonathan M Davis

Again, thanks for the explanation. I have to say that on ageneral level I have to agree with Don's post and I don't see howthe .init idiom generally "works" or is useful. I can't seeanything in the above examples that shows that .init isabsolutely required and we can't live without it. The only thingthat worries me here is the reliance of the runtime/phobos on.init.

Re: What is the case against a struct post-blit default constructor?

Reply via email to