Re: [swift-dev] Reconsidering the global uniqueness of type metadata and protocol conformance instances

John McCall via swift-dev Fri, 28 Jul 2017 16:52:53 -0700

> On Jul 28, 2017, at 7:45 PM, Joe Groff <jgr...@apple.com> wrote:
>> On Jul 28, 2017, at 4:30 PM, John McCall <rjmcc...@apple.com 
>> <mailto:rjmcc...@apple.com>> wrote:
>> 
>>> On Jul 28, 2017, at 7:11 PM, Joe Groff <jgr...@apple.com 
>>> <mailto:jgr...@apple.com>> wrote:
>>>> On Jul 28, 2017, at 3:54 PM, John McCall <rjmcc...@apple.com 
>>>> <mailto:rjmcc...@apple.com>> wrote:
>>>>> On Jul 28, 2017, at 6:34 PM, Joe Groff <jgr...@apple.com 
>>>>> <mailto:jgr...@apple.com>> wrote:
>>>>>> On Jul 28, 2017, at 3:30 PM, John McCall <rjmcc...@apple.com 
>>>>>> <mailto:rjmcc...@apple.com>> wrote:
>>>>>>> On Jul 28, 2017, at 6:24 PM, Joe Groff <jgr...@apple.com 
>>>>>>> <mailto:jgr...@apple.com>> wrote:
>>>>>>>> On Jul 28, 2017, at 3:15 PM, John McCall <rjmcc...@apple.com 
>>>>>>>> <mailto:rjmcc...@apple.com>> wrote:
>>>>>>>> 
>>>>>>>> 
>>>>>>>>> On Jul 28, 2017, at 6:02 PM, Andrew Trick via swift-dev 
>>>>>>>>> <swift-dev@swift.org <mailto:swift-dev@swift.org>> wrote:
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>>> On Jul 28, 2017, at 2:20 PM, Joe Groff via swift-dev 
>>>>>>>>>> <swift-dev@swift.org <mailto:swift-dev@swift.org>> wrote:
>>>>>>>>>> 
>>>>>>>>>> The Swift runtime currently maintains globally unique pointer 
>>>>>>>>>> identities for type metadata and protocol conformances. This makes 
>>>>>>>>>> checking type equivalence a trivial pointer equality comparison, but 
>>>>>>>>>> most operations on generic values do not really care about exact 
>>>>>>>>>> type identity and only need to invoke value or protocol witness 
>>>>>>>>>> methods or consult other data in the type metadata structure. I 
>>>>>>>>>> think it's worth reevaluating whether having globally unique type 
>>>>>>>>>> metadata objects is the correct design choice. Maintaining global 
>>>>>>>>>> uniqueness of metadata instances carries a number of costs. Any code 
>>>>>>>>>> that wants type metadata for an instance of a generic type, even a 
>>>>>>>>>> fully concrete one, must make a potentially expensive runtime call 
>>>>>>>>>> to get the canonical metadata instance. This also greatly 
>>>>>>>>>> complicates our ability to emit specializations of type metadata, 
>>>>>>>>>> value witness tables, or protocol witness tables for concrete 
>>>>>>>>>> instances of generic types, since specializations would need to be 
>>>>>>>>>> registered with the runtime as canonical metadata objects, and it 
>>>>>>>>>> would be difficult to do this lazily and still reliably favor 
>>>>>>>>>> specializations over more generic witnesses. The lack of witness 
>>>>>>>>>> table specializations leaves an obnoxious performance cliff for 
>>>>>>>>>> instances of generic types that end up inside existential containers 
>>>>>>>>>> or cross into unspecialized code. The runtime also obligates 
>>>>>>>>>> binaries to provide the canonical metadata for all of their public 
>>>>>>>>>> types, along with all the dependent value witnesses, class methods, 
>>>>>>>>>> and protocol witness tables, meaning a type abstraction can never be 
>>>>>>>>>> completely "zero-cost" across modules.
>>>>>>>>>> 
>>>>>>>>>> On the other hand, if type metadata did not need to be unique, then 
>>>>>>>>>> the compiler would be free to emit specialized type metadata and 
>>>>>>>>>> protocol witness tables for fully concrete non-concrete value types 
>>>>>>>>>> without consulting the runtime. This would let us avoid runtime 
>>>>>>>>>> calls to fetch metadata in specialized code, and would make it much 
>>>>>>>>>> easier for us to implement witness specialization. It would also 
>>>>>>>>>> give us the ability to potentially extend the "inlinable" concept to 
>>>>>>>>>> public fragile types, making it a client's responsibility to emit 
>>>>>>>>>> metadata for the type when needed and keeping the type from 
>>>>>>>>>> affecting its home module's ABI. This could significantly reduce the 
>>>>>>>>>> size and ABI surface area of the standard library, since the 
>>>>>>>>>> standard library contains a lot of generic lightweight adapter types 
>>>>>>>>>> for collections and other abstractions that are intended to be 
>>>>>>>>>> optimized away in most use cases.
>>>>>>>>>> 
>>>>>>>>>> There are of course benefits to globally unique metadata objects 
>>>>>>>>>> that we would lose if we gave up uniqueness. Operations that do 
>>>>>>>>>> check type identity, such as comparison, hashing, and dynamic 
>>>>>>>>>> casting, would have to perform more expensive checks, and nonunique 
>>>>>>>>>> metadata objects would need to carry additional information to 
>>>>>>>>>> enable those checks. It is likely that class objects would have to 
>>>>>>>>>> remain globally unique, if for no other reason than that the 
>>>>>>>>>> Objective-C runtime requires it on Apple platforms. Having multiple 
>>>>>>>>>> equivalent copies of type metadata has the potential to increase the 
>>>>>>>>>> working set of an app in some situations, although it's likely that 
>>>>>>>>>> redundant compiler-emitted copies of value type metadata would at 
>>>>>>>>>> least be able to live in constant pages mapped from disk instead of 
>>>>>>>>>> getting dynamically instantiated by the runtime like everything is 
>>>>>>>>>> today. There could also be subtle source-breaking behavior for code 
>>>>>>>>>> that bitcasts metatype values to integers or pointers and expects 
>>>>>>>>>> bit-level equality to indicate type equality. It's unlikely to me 
>>>>>>>>>> that giving up uniqueness would buy us any simplification to the 
>>>>>>>>>> runtime, since the runtime would still need to be able to 
>>>>>>>>>> instantiate metadata for unspecialized code, and we would still want 
>>>>>>>>>> to unique runtime-instantiated metadata objects as an optimization.
>>>>>>>>>> 
>>>>>>>>>> Overall, my intuition is that the tradeoffs come out in favor for 
>>>>>>>>>> nonunique metadata objects, but what do you all think? Is there 
>>>>>>>>>> anything I'm missing?
>>>>>>>>>> 
>>>>>>>>>> -Joe
>>>>>>>>> 
>>>>>>>>> In a premature proposal two years ago, we agreed to ditch unique 
>>>>>>>>> protocol conformances but install the canonical address as the first 
>>>>>>>>> entry in each specialized table.
>>>>>>>> 
>>>>>>>> This would be a reference to (unique) global data about the 
>>>>>>>> conformance, not a reference to some canonical version of the protocol 
>>>>>>>> witness table.  We do not rely on having a canonical protocol witness 
>>>>>>>> table.  The only reason we unique them (when we do need to 
>>>>>>>> instantiate) is because we don't want to track their lifetimes.
>>>>>>>> 
>>>>>>>>> That would mitigate the disadvantages that you pointed to. But, we 
>>>>>>>>> would also lose the ability to emit specialized metadata/conformances 
>>>>>>>>> in constant pages. How do you feel about that tradeoff?
>>>>>>>> 
>>>>>>>> Note that, per above, it's only specialized constant type metadata 
>>>>>>>> that we would lose.
>>>>>>>> 
>>>>>>>> I continue to feel that having to do structural equality tests on type 
>>>>>>>> metadata would be a huge loss.
>>>>>>> 
>>>>>>> I don't think it necessarily needs to be deep structural equality. If 
>>>>>>> the type metadata object or value witness table had a pointer to a 
>>>>>>> mangled type name string, we could strcmp those strings to compare 
>>>>>>> equality, which doesn't seem terribly onerous to me, though if it were 
>>>>>>> we could perhaps use the string to lazily resolve the canonical type 
>>>>>>> metadata pointer, sort of like we do with type metadata for imported C 
>>>>>>> types today.
>>>>>> 
>>>>>> So generic code to instantiate type metadata would have to construct 
>>>>>> these mangled strings eagerly?
>>>>> 
>>>>> We already do exactly that for the ObjC runtime name of generic class 
>>>>> instantiations, for what it's worth, but it could conceivably be lazy as 
>>>>> well, at the cost of making the comparison yet more expensive. There 
>>>>> aren't that many runtime operations that need to do type comparison, 
>>>>> though—the ones I can think of are casting and the equality/hashing 
>>>>> operations on Any.Type—so how important is efficient type comparison?
>>>> 
>>>> A fair question.  It's extremely important for type uniquing — of course, 
>>>> you're talking about making that less important, but when it does happen, 
>>>> it will cost more.
>>>> 
>>>> The way I see it is that the importance of specialization is 95% about 
>>>> specializing tables of function pointers, i.e. value witness tables, 
>>>> protocol witness tables, and class v-tables.  There's no reason we can't 
>>>> use specialized protocol witness tables today.  Your proposal still leaves 
>>>> us uniquing class v-tables.  So this is just about making it easier (on us 
>>>> as implementors) to create specialized value witness tables, plus the 
>>>> trade-off of being able to refer to non-dependent type metadata slightly 
>>>> more cheaply vs. making type comparisons vastly more expensive.
>>> 
>>> Well, what do you think about the possibility of making some public types 
>>> in the standard library "always-emit-into-client"? AIUI a lot of the 
>>> standard library's space and ABI surface area is spent on type metadata and 
>>> conformances for things that almost always get inlined in practice, so I 
>>> think there's also a potential to shrink the stdlib's size and ABI 
>>> liability. (To be fair, we could also potentially accomplish that using the 
>>> foreign metadata table today if it was interesting.)
>> 
>> Well, first, I think our metadata could pretty easily go on a diet, even 
>> apart from any question of laziness.  Value type metadata don't need to 
>> store a parent, nominal type descriptors are not optimized for compactness, 
>> generic patterns are extremely bloated, etc.  (Do we really even need a 
>> "pattern" to instantiate a type?)  All of this is stuff we need to do for 
>> ABI stability.
> 
> Fair point, though there's also a lot of stuff that hangs off of the 
> metadata, particularly value witnesses, that could be lazified with it.


For sure.  Although I suspect we could get a lot of that back by structurally 
uniquing value witnesses!  Every generic type which just wraps a collection has 
basically exactly the same representation...

John.

_______________________________________________
swift-dev mailing list
swift-dev@swift.org
https://lists.swift.org/mailman/listinfo/swift-dev

Re: [swift-dev] Reconsidering the global uniqueness of type metadata and protocol conformance instances

Reply via email to