Re: Species-static members vs singletons

Peter Levart Sun, 22 May 2016 16:00:06 -0700

Hi Brian,

I agree that "species" placement is a better, less verbose option. Buthow to solve the language problem of having "species" and "instance"members of the same "type-variable" type be assignable to one-another?For example:


class Foo<any T> {
    species T st;
    T it;

    void m() {
        it = st; // this can not be allowed
        st = it; // this can be allowed

        // maybe this could be allowed?
        @SuppressWarnings("unchecked")
        it = (T) st;
    }


Singleton abstraction has the same problem.

So while technically possible, it would be weird to have 'T' sometimesnot be assignable to 'T'. Can we live with that?


Regards, Peter

On 05/19/2016 04:36 PM, Brian Goetz wrote:

We discussed two primary means to surface species-specific members inthe language: a "species" placement (name TBD) as distinct from staticand instance, or a "singleton" abstraction (a la Scala's "object"abstraction, as Peter L suggested). We've done some experimentscomparing the two approaches.
Separately, we discussed two strategies for handling this at the VMlevel: having three separate placements (ACC_STATIC, ACC_SPECIES, andinstance) or retconning ACC_STATIC to mean "species" and usingcompiler trickery to simulate traditional statics. In recentdiscussions with Oracle and IBM VM folks, they seemed happy enoughwith having a new placement (and possibly new bytecodes,{get,put,invoke}species, or overloading these onto *static withParamTypes in the owner field of the various XxxRef constants.)
There are several places where the language itself can take advantageof species members:
1. Reifying type variables. For an any-generic class Foo<T,U>, thecompiler can generate public static final reflection-thingie-valuedfields called "T" and "U", which means that "aFoo.T" (as an ordinaryfield ref!) would evaluate to the reflective mirror for the reified T-- if present, otherwise it would evaluate to the reflective mirrorfor 'erased'.
2. Representation of generic methods. The current translationstrategy has us translating any-generic methods to classes; a staticmethod
    static<any T> void foo(T t) { }

translates to a class (plus an erased bridge):

    bridge static foo(Object o) { ... invoke erased specialization ... }

    static class Xxx$foo<any T> {
        void foo(T t) { ... }
    }
This means that an instance of Xxx$foo is needed to invoke the method-- but serves solely to carry the type variables -- which isunfortunate. If instead we translate as:
    static class Xxx$foo<any T> {
*species-static *void foo(T t) { ... }
    }

then we can invoke this method via invokespecies:

    invokespecies ParamType[Xxx$foo, T_inf].foo(T_inf)
where T_inf is the erasure-normalized type inferred for T (reified ifvalue, `erased` reference.) No fake receiver required.
The translation for generic instance methods is still somewhat messier(will post separately), but still less messy than if we also had tomanage / cache a receiver.
We also drafted some examples of how such a facility would be used,writing them both with species-static and with singleton. Examplesand notes below; the summary is that in all cases, the species-staticversion is either better or about as good.
1.  The old favorite, caching an instantiated instance.

Species
        Singleton
class Collections {
    private static class Holder<any T> {
        private species List<T> empty = new EmptyList<T>();
    }

    static<any T> List<T> emptyList() { return Holder<T>.empty; }
}
        class Collections {
    private singleton Holder<any T> {
        private empty = new EmptyList<T>();
    }

    static<any T> List<T> emptyList() { return Holder<T>.empty; }
}
Note that in this case, species by itself isn't enough -- we stillneed a holder class, and its a bit ugly. Arguably we could mergeHolder into EmptyList (if that's under our control) but becauseCollections is an old-style "static bag" class (aka "sin bin"), wewould still need a holder class for state. (Collections could share asingle holder for multiple things; empty list, empty set, etc.)
Neither the left nor the right seems particularly better than theother here. (If we were putting this method on Collection, where itwould likely go in new code since now interfaces can have statics, thespecies approach would win, since we'd not need the holder class anymore.)
2.  Instantiation tracking.

Species
        Singleton
class Foo<any T> {
    private species int count;
    private species List<Foo<T>> foos;

    public Foo() {
        ++count;
        foos.add(this);
    }
}
        class Foo<any T> {
    private singleton FooStuff<T> {
        private int count;
        private List<Foo<T>> foos;
    }

    public Foo() {
        ++Foo<T>.count;
        Foo<T>.foos.add(this);
    }
}
Because the state is directly tied to the instantiation, the leftseems more attractive -- doesn't require an extra artifact, and theconstructor body seems more straightforward.
3. Implicit-like associations. Here, we're caching typeassociations. For example, suppose we have a Box<T>, and we want tocache the associated class for List<T>.
Species
        Singleton
class Box<any T> {
    private species Class<List<T>> listClass
        = Class.forSpecialization(List, T.crass);
}
        class Box<any T> {
    private singleton ListBuddy<any T> {
        Class<List<T>> clazz
            = Class.forSpecialization(List, T.crass);
    }
}
The extra singleton declaration feels like "noise" here, because againthe association is with the full set of type args for the class.
4. Static factories. Arguably, it makes sense to move factories tothe types they describe.
Species
        Singleton
interface List<any T> {
    private species List<T> empty = new EmptyList<>();
    species List<T> emptyList() { return empty; }
}
        interface List<any T> {
    private singleton Stuff<any T> {
        List<T> empty = new EmptyList<>();
    }
    species List<T> emptyList() { return Stuff<T>.empty; }
}


In this model, you'd get an empty list with

    List<T> aList = List<T>.empty()
rather than
List<T> aList = Collections.<T>empty();
In the latter, the type witnesses can be omitted; in the former theyprobably can be as well but that's something new.
5. Typevar shredding. Here, we have separate state for differentsubsets of variables. This should be the place where the singletonapproach shines.
Species
        Singleton
class HashMap<any K, any V> {
    private static class Keys<any K> {
        species Set<K> allKeys = ...
    }

    private static class Vals<any V> {
species Set<V> allVals = ...
    }

    void put(K k, V v) {
        Keys<K>.allKeys.add(k);
Vals<V>.allVals.add(v);
    }
}
        class HashMap<any K, any V> {
    private singleton Keys<any K> {
        Set<K> allKeys = ...
    }

    private singleton Vals<any V> {
Set<V> allVals = ...
    }

    void put(K k, V v) {
        Keys<K>.allKeys.add(k);
Vals<V>.allVals.add(v);
    }
}
But, it doesn't really shine that much; the left is not really muchworse than the right, just a little more fussy.
In cases where the singleton approach is more natural, thecorresponding "species in static class" idiom isn't so bad either.But in cases where the species approach is more natural, there'ssomething unappealing about creating classes (both in source andruntime footprint) in cases 2/3/4 when we don't need one. The onlyplace where the singleton approach seems to win big is when there aremultiple variables in the same scope bound by invariants -- here, thesingleton having a ctor is a big win -- but how often does this happen?
So our conclusion is that the species-placement is as good or betterfor the identified use cases -- and it also fits cleanly into theexisting model for member placement.

Re: Species-static members vs singletons

Reply via email to