Re: Species-static members vs singletons

Maurizio Cimadamore Mon, 23 May 2016 07:28:59 -0700


On 23/05/16 15:20, Brian Goetz wrote:

Right. And Peter’s question is: (a) did we think of this (yes) and(b) are we OK with this. Which I think is also yes?

I think it's yes; an unfortunate accident of erasure - I don't see anyother way around it at the moment.


Maurizio

On May 23, 2016, at 7:18 AM, Maurizio Cimadamore<[email protected]<mailto:[email protected]>> wrote:
Sorry - I now realize that the point I made in my earlier email wasunclear.
What I'm suggesting is to have a single rule for generating uncheckedwarnings that goes like this:
"If the qualifier of a species static access is not reifiable, anunchecked warning should occur".
In the example Peter sent, the only thing worth mentioning is thatthe qualifier is 'implicit' (i.e. can be omitted and be assumed to bethe current class Foo<T>); now since Foo<T> is not reifiable, everyunqualified access to 'st' from Foo<T> will get a warning -excluding, of course, accesses occurring in a context where T isrestricted (i.e. __WhereVal(T)).
Maurizio

On 23/05/16 14:56, Brian Goetz wrote:
Note that we have this same problem with unchecked warnings today inmany of the use cases. For example, in the “cached empty list”case, we always have to use an unchecked cast to cast the cachedlist to the desired type. When we use species-static to do thesame, and it is possible that the species could correspond to morethan one T, we still have to do the same unchecked warning (and asyou mention, the singleton form has the same problem.) I think itsan unescapable consequence of erasure, but one we’re already sort ofcomfortable with.
If you use a more constrained type selector (e.g., List<int>), youwon’t get a warning, as the compiler will know that st is exactly int.
On May 23, 2016, at 3:05 AM, Maurizio Cimadamore<[email protected]<mailto:[email protected]>> wrote:
Hi Peter,
are you sure we need special treatment for 'it = st' ? After all,the compiler will issue unchecked warnings every time you'll try toaccess a species static from a non-reifiable type i.e.
Foo<String>.st = ""; //warn
Foo<int>.st = 42; //no warn
In other words, can we put the burden of heap pollution-ness on theclient and be happy?
Maurizio

On 22/05/16 23:58, Peter Levart wrote:
Hi Brian,
I agree that "species" placement is a better, less verbose option.But how to solve the language problem of having "species" and"instance" members of the same "type-variable" type be assignableto one-another? For example:
class Foo<any T> {
    species T st;
    T it;

    void m() {
        it = st; // this can not be allowed
        st = it; // this can be allowed

        // maybe this could be allowed?
        @SuppressWarnings("unchecked")
        it = (T) st;
    }


Singleton abstraction has the same problem.
So while technically possible, it would be weird to have 'T'sometimes not be assignable to 'T'. Can we live with that?
Regards, Peter

On 05/19/2016 04:36 PM, Brian Goetz wrote:
We discussed two primary means to surface species-specificmembers in the language: a "species" placement (name TBD) asdistinct from static and instance, or a "singleton" abstraction(a la Scala's "object" abstraction, as Peter L suggested). We'vedone some experiments comparing the two approaches.
Separately, we discussed two strategies for handling this at theVM level: having three separate placements (ACC_STATIC,ACC_SPECIES, and instance) or retconning ACC_STATIC to mean"species" and using compiler trickery to simulate traditionalstatics. In recent discussions with Oracle and IBM VM folks,they seemed happy enough with having a new placement (andpossibly new bytecodes, {get,put,invoke}species, or overloadingthese onto *static with ParamTypes in the owner field of thevarious XxxRef constants.)
There are several places where the language itself can takeadvantage of species members:
1. Reifying type variables. For an any-generic class Foo<T,U>,the compiler can generate public static finalreflection-thingie-valued fields called "T" and "U", which meansthat "aFoo.T" (as an ordinary field ref!) would evaluate to thereflective mirror for the reified T -- if present, otherwise itwould evaluate to the reflective mirror for 'erased'.
2. Representation of generic methods. The current translationstrategy has us translating any-generic methods to classes; astatic method
    static<any T> void foo(T t) { }

translates to a class (plus an erased bridge):
bridge static foo(Object o) { ... invoke erasedspecialization ... }
    static class Xxx$foo<any T> {
        void foo(T t) { ... }
    }
This means that an instance of Xxx$foo is needed to invoke themethod -- but serves solely to carry the type variables -- whichis unfortunate. If instead we translate as:
    static class Xxx$foo<any T> {
*species-static *void foo(T t) { ... }
    }

then we can invoke this method via invokespecies:

    invokespecies ParamType[Xxx$foo, T_inf].foo(T_inf)
where T_inf is the erasure-normalized type inferred for T(reified if value, `erased` reference.) No fake receiver required.
The translation for generic instance methods is still somewhatmessier (will post separately), but still less messy than if wealso had to manage / cache a receiver.
We also drafted some examples of how such a facility would beused, writing them both with species-static and with singleton.Examples and notes below; the summary is that in all cases, thespecies-static version is either better or about as good.
1.  The old favorite, caching an instantiated instance.

Species
        Singleton
class Collections {
    private static class Holder<any T> {
        private species List<T> empty = new EmptyList<T>();
    }

    static<any T> List<T> emptyList() { return Holder<T>.empty; }
}
        class Collections {
    private singleton Holder<any T> {
        private empty = new EmptyList<T>();
    }

    static<any T> List<T> emptyList() { return Holder<T>.empty; }
}
Note that in this case, species by itself isn't enough -- westill need a holder class, and its a bit ugly. Arguably we couldmerge Holder into EmptyList (if that's under our control) butbecause Collections is an old-style "static bag" class (aka "sinbin"), we would still need a holder class for state. (Collectionscould share a single holder for multiple things; empty list,empty set, etc.)
Neither the left nor the right seems particularly better than theother here. (If we were putting this method on Collection, whereit would likely go in new code since now interfaces can havestatics, the species approach would win, since we'd not need theholder class any more.)
2.  Instantiation tracking.

Species
        Singleton
class Foo<any T> {
    private species int count;
    private species List<Foo<T>> foos;

    public Foo() {
        ++count;
        foos.add(this);
    }
}
        class Foo<any T> {
    private singleton FooStuff<T> {
        private int count;
        private List<Foo<T>> foos;
    }

    public Foo() {
++Foo<T>.count;
Foo<T>.foos.add(this);
    }
}
Because the state is directly tied to the instantiation, the leftseems more attractive -- doesn't require an extra artifact, andthe constructor body seems more straightforward.
3. Implicit-like associations. Here, we're caching typeassociations. For example, suppose we have a Box<T>, and we wantto cache the associated class for List<T>.
Species
        Singleton
class Box<any T> {
    private species Class<List<T>> listClass
        = Class.forSpecialization(List, T.crass);
}
        class Box<any T> {
    private singleton ListBuddy<any T> {
Class<List<T>> clazz
            = Class.forSpecialization(List, T.crass);
    }
}
The extra singleton declaration feels like "noise" here, becauseagain the association is with the full set of type args for theclass.
4. Static factories. Arguably, it makes sense to move factoriesto the types they describe.
Species
        Singleton
interface List<any T> {
    private species List<T> empty = new EmptyList<>();
    species List<T> emptyList() { return empty; }
}
        interface List<any T> {
    private singleton Stuff<any T> {
        List<T> empty = new EmptyList<>();
    }
    species List<T> emptyList() { return Stuff<T>.empty; }
}


In this model, you'd get an empty list with

    List<T> aList = List<T>.empty()
rather than
List<T> aList = Collections.<T>empty();
In the latter, the type witnesses can be omitted; in the formerthey probably can be as well but that's something new.
5. Typevar shredding. Here, we have separate state fordifferent subsets of variables. This should be the place wherethe singleton approach shines.
Species
        Singleton
class HashMap<any K, any V> {
    private static class Keys<any K> {
        species Set<K> allKeys = ...
    }

    private static class Vals<any V> {
species Set<V> allVals = ...
    }

    void put(K k, V v) {
Keys<K>.allKeys.add(k);
Vals<V>.allVals.add(v);
    }
}
        class HashMap<any K, any V> {
    private singleton Keys<any K> {
        Set<K> allKeys = ...
    }

    private singleton Vals<any V> {
Set<V> allVals = ...
    }

    void put(K k, V v) {
Keys<K>.allKeys.add(k);
Vals<V>.allVals.add(v);
    }
}
But, it doesn't really shine that much; the left is not reallymuch worse than the right, just a little more fussy.
In cases where the singleton approach is more natural, thecorresponding "species in static class" idiom isn't so badeither. But in cases where the species approach is more natural,there's something unappealing about creating classes (both insource and runtime footprint) in cases 2/3/4 when we don't needone. The only place where the singleton approach seems to win bigis when there are multiple variables in the same scope bound byinvariants -- here, the singleton having a ctor is a big win --but how often does this happen?
So our conclusion is that the species-placement is as good orbetter for the identified use cases -- and it also fits cleanlyinto the existing model for member placement.

Re: Species-static members vs singletons

Reply via email to