Opt-in non-null class references?

SimonN via Digitalmars-d Wed, 28 Feb 2018 05:46:31 -0800

Hi,

Andrei said in 2014 that not-null-references should be thepriority of 2014's language design, with consideration to makenot-null the default. In case the code breakage is too high, thiscan be an opt-in compiler flag.

Discussion here:https://forum.dlang.org/post/[email protected]

Everybody in the 2014 thread was hyped, but has anything everhappened in the language? In November 2017, the D forum discussedC#'s non-null warnings. Has anybody thought about this againsince?

In D, to prevent immense breakage, non-nullable class referencesneed to be opt-in. I would love to see them and don't mindadapting my 25,000-line D-using project during a weekend.

Are there any counter-arguments to why non-nullablereferences/pointers haven't made it into D yet? Feel free toattack my answers below.


* * *

Argument: If A denotes non-null reference to class A, it can'thave an init value.Answer: Both A?.init and A.init shall be null, then use code-flowanalysis.

This would match D's immutable: In a class constructor, you mayassign the value 5 to a field of type immutable(int) that hasinit value 0. The compiler is happy as long as it can prove thatwe never write a second time during this constructor, and that wenever read before the first assignment.

Likewise, it should be legal to assign from A to another Aexpression such as new A(), and the compiler is happy as long asthe reference is assigned eventually, and if the reference isnever read before assignment. (I haven't contributed to thecompiler, I can't testify it's that easy.)

To allow hacks, it should remain legal to cast A? (nullablereference) to A (non-nullable). This should pass compilation(because casting takes all responsibility from the compiler) andthen segfault at runtime, like any null dereference today.


* * *

Argument: I shall express non-null with contracts.

Answer: That's indeed the best solution without any languagechange. But it's bloaty and doesn't check anything atcompile-time.


    class A { }
    void f1(A a) in { assert(a); } do { f2(a); }
    void f2(A a) in { assert(a); } do { f3(a); }
    void f3(A a) in { assert(a); } do { ...; }
    void g(A a) { if (a) ...; else ...; }

Sturdy D code must look like this today. Some functions handlethe nulls, others request non-null refs from their callers. Thefunction signature should express this, and a contract is part ofthe signature.


But several maintenance problems arise from non-null via contract.

First issue: We now rely on unit-testing to ensure our types arecorrect. You would do that in dynamic languages where the typesystem can't give you meaningful diagonstic errors otherwise. I'drather not fall back to this in D. It's easy to forget suchtests, coverage analysis doesn't help here.

Second issue: Introducing new fields requires updating allmethods that uses the fields. This isn't necessarily only themethods in the class. If you have this code:


    class B {
        A a1;
        void f1() in { assert(a1); } do { ... }
        void f2() in { assert(a1); } do { ... }
    }

When you introduce more fields, you must update every method.This is bug-prone; we have final-switch (a full-blown languagefeature) just to solve similar issues:


    class B {
        A a1;
        A a2;
        void f1() in { assert(a1); assert(a2); } do { ... }
        void f2() in { assert(a1); /+ forgot +/ } do { ... }
    }

Third issue: Most references in a program aren't null. Especiallyclass references that are fields of another class are ofteninitialized in the constructor once, and never re-set. This isthe predominant use of references. In D, the default, implicitcase should do the Right Thing; it's fine when nonstandardfeatures (allowing null) are explicit.


Assuming that A means non-null A, I would love this instead:

    class A { }
    void f1(A a) { f2(a); }
    void f2(A a) { f3(a); }
    void f3(A a) { ...; }
    void g(A? a) { if (a) ...; else ...; }
Or:
    void g(A @nullable a) { if (a) ...; else ...; }

Code-flow analysis can already statically check that weinitialize immutable values only once. Likewise, it should checkthat we only pass A? to f1 after we have tested it for non-null,and that we only call methods on A? after checking for itsnon-null-ness (and the type of `a' inside the `if' block shouldprobably still be A?, not A.)


* * *

Argument: null refs aren't a problem, they're memory-safe.

Answer: Memory-safety is not the concern here. Readability ofcode is, and preventing at compiletime what safely explodes atruntime.


* * *

Argument: Roll your own non-null type as a wrapper around D'snullable class reference.Answer: That will look ugly, is an abstraction inversion, andchecks at runtime only.


    class A { }

    struct NotNull(T)
        if (is(T == class))
    {
        T payload;
        @disable this();
        this(T t) {
            assert(t !is null);
            payload = t;
        }
        alias payload this;
    }

    NotNull!A a = NotNull!A(new A());

The non-nullable type is type with simpler behavior, I can callall methods without segfault. The nullable type is the morecomplex type, I can either call methods on it or must check firstfor non-nullness. My NotNull implements a simple type in terms ofa more complex type. Such abstraction inversion is dubious design.

And this solution would only assert at runtime again, not atcompile time.

Microsoft's C++ Guideline Support Library has not_null<T>. Thatattacks the right problem, but becomes boilerplate when itappears everywhere in your codebase.


* * *

Argument: If A is going to denote non-null-A, then this willbreak huge amounts of code.

Answer: Like @safe, any such massive break must be opt-in.

The biggest downside of opt-in is that few projects will use it,and the feature will be buggy for a long time.

For example, associative arrays in opt-in @safe code togetherwith overriding opEquals with @safe-nothrow-... annotations, allthis can subtly fail if you mix it in complicated ways.Sometimes, you resort to ripping out the good annotations in yourprojects to please the compiler instead of dustmiting yourproject.


* * *

Argument: It's not worth it.

I firmly believe it's worth it, but I accept that others deemother things more important.

I merely happen to love OOP and use D classes almost everywhere,thus I have references everywhere, and methods everywhere thataccept references as parameters.


-- Simon

I'll be happy to discuss this in person at DConf 2018. :-)

Opt-in non-null class references?

Reply via email to