Re: [swift-evolution] [Draft] Rename Sequence.elementsEqual

Michael Ilseman via swift-evolution Mon, 16 Oct 2017 09:19:21 -0700


> On Oct 16, 2017, at 7:20 AM, Thorsten Seitz via swift-evolution 
> <swift-evolution@swift.org> wrote:
> 
> 
> 
> Am 16.10.2017 um 07:19 schrieb Xiaodi Wu <xiaodi...@gmail.com 
> <mailto:xiaodi...@gmail.com>>:
> 
>> On Sun, Oct 15, 2017 at 11:57 PM, Thorsten Seitz <tseit...@icloud.com 
>> <mailto:tseit...@icloud.com>> wrote:
>> 
>> 
>> Am 16.10.2017 um 00:41 schrieb Xiaodi Wu via swift-evolution 
>> <swift-evolution@swift.org <mailto:swift-evolution@swift.org>>:
>> 
>>> On Sun, Oct 15, 2017 at 2:32 PM, Kevin Nattinger <sw...@nattinger.net 
>>> <mailto:sw...@nattinger.net>> wrote:
>>>> […]
>>>> Sets, as a mathematical concept, have no intrinsic order. However, 
>>>> instances of `Set`, which can be iterated over, *do* have at least one 
>>>> order which can be said to be intrinsic in the following sense: as long as 
>>>> iteration is possible, no API design can prevent that order from being 
>>>> observed and associated with the instance. Put another way, if you can use 
>>>> an instance of a type in a for...in loop, intrinsic to that functionality 
>>>> is a publicly visible order.
>>> 
>>> You keep saying this, I keep saying it’s only a technical “order” that is 
>>> an implementation detail
>>> 
>>> You keep saying it's an implementation detail, which it emphatically is 
>>> *not*. It's a *public guarantee* by the protocol that you can use an 
>>> instance of `Set` in a `for...in` loop, thereby exposing a publicly visible 
>>> order. An implementation detail is something
>> 
>> Being able to use a Set in a for...in loop does *not* make it ordered! The 
>> purpose is is just being able to do something with each element. That a 
>> for...loop works sequentially is just a side effect. Just imagine we had 
>> parallelized for...in loops.
>> 
>> No, it is not at all a "side effect." A for...in loop is a way of 
>> controlling the flow of code which accesses elements in a sequence one after 
>> another, and the correct behavior of code inside the loop depends on these 
>> semantics. A "parallel for" loop would be a totally different thing; 
>> arbitrary for...in loops can't be automatically "upgraded" to a "parallel 
>> for" loop because they have different semantics, and types that support 
>> "parallel for" would likely have to conform to a protocol other than 
>> `Sequence`.
> 
> Exactly.
> 
>>> that could go away with an alternative implementation. By contrast, no 
>>> implementation that permits an instance of `Set` being iterated over in a 
>>> `for...in` loop can avoid exposing at least one publicly visible order, 
>>> because it's not a matter of implementation. Put another way, by allowing 
>>> iteration, a type necessarily exposes at least one publicly visible order 
>>> and thereby models a sequence in at least one way.
>> 
>> Wrong. I could easily implement Set or another type to iterate in a random 
>> order over its elements, so each iteration would expose a different 
>> (meaningless) order.
>> 
>> No, you cannot implement `Set` in this way because it conforms to 
>> `Collection`, which guarantees a multi-pass sequence. Set *must expose the 
>> same order on every iteration*.
> 
> Set conforming to Collection is even worse than just conforming to Sequence 
> as a quote from the documentation shows: "In addition to the operations that 
> collections inherit from the Sequence protocol, you gain access to methods 
> that depend on accessing an element at a specific position in a collection."
> Clearly the elements of a Set do not have specific positions.
>


That’s not at all clear to me, could you elaborate? My understanding is that 
elements of Set definitely *do* have a position, and that’s why you can use an 
index on a set to retrieve the element. The same index on the same set 
retrieves the same element.


> Furthermore my argument still stands for types derived from Sequence, so 
> sidestepping it by pointing out that the situation is even worse for the 
> current implementation of Set won't work. Can you explain why a type derived 
> from Sequence which will iterate over its elements in random order should 
> have methods like dropFirst()?
> 
> 
>>  
>> This demonstrates that being able to be used in a for...in loop is about 
>> doing somthing with each element and not about element order.
>> 
>> Again, Sequence *already doesn't* require the order to be meaningful or even 
>> repeatable.
> 
> I know. That's why I am arguing that certain methods of Sequence make no 
> sense. At least we have reached that common ground. 
> So why do you insist on Sequence having methods like dropFirst() if the 
> notion of "first" does not make any sense?
> 
>> Notice that, above, I said at least one order. A conforming type could
>> expose as many different orders as iterations, but that changes nothing. I 
>> can still map an instance of that type to get an array of elements, and that 
>> array will reveal some order which is the result of the inner workings of 
>> the type.
>> 
>> The latter is an additional property which should be expressed in an 
>> additional protocol like Kevin suggested.
>> 
>> What useful generic algorithms would this protocol support that are not 
>> already possible?
> 
> It would allow expressing generic algorithms depending on an order.
> 
> -Thorsten
> 
> 
>> 
>> -Thorsten
>> 
>> 
>> 
>>>  
>>> and can’t be relied upon for anything and so we shouldn’t provide methods 
>>> that rely on it. I think this part of the discussion has reached the “agree 
>>> to disagree” stage.
>>> 
>>>> […]
>>>> You’re a fan of the principal of least surprise. Tell me, which would be 
>>>> less surprising: Set.dropFirst() actually drops a random element, or Set 
>>>> doesn’t have a dropFirst()? And if you think dropFirst() removing an 
>>>> element at random is not surprising, please explain why.
>>>> 
>>>> I think Set.dropFirst removing the first element that I observe on 
>>>> iteration is the least surprising answer, because Swift tells me that the 
>>>> stdlib Set models a set but that it is also a sequence.
>>> 
>>> Your logic is backwards. You’re saying it’s “least surprising” because 
>>> that’s how it’s currently implemented, not that it should be implemented 
>>> that way because it’s least surprising.
>>> 
>>> No, I'm saying it's least surprising because any type that supports 
>>> iterated access thereby exposes an order--not as an implementation detail 
>>> but as a matter of public API--and in the absence of any other order, 
>>> "first" must refer to that order so exposed.
>>>>>>> […]
>>>> 
>>>> And that’s PRECISELY why lexicographicallyEqual does not make sense to 
>>>> apply to unordered sets. There is no lexicographical comparison possible, 
>>>> so why do you keep insisting they should have a method that falsely claims 
>>>> to lexicographically compare them?
>>>> 
>>>> I agree! It doesn't make sense if no comparison is possible! But Swift 
>>>> tells me that a `Set` is a `Sequence`!
>>> 
>>> You keep making the circular argument that a Set should do things because 
>>> it currently does them. If you want to argue against changing things, argue 
>>> that things shouldn’t be changed, not that the current implementation is 
>>> correct because it is the current implementation.
>>> 
>>> No, I'm arguing that `Set`, by supporting iterated access, is not wrong to 
>>> conform to a protocol called `Sequence` because it does have an intrinsic 
>>> and publicly observable order, which is not an accident of a particular 
>>> implementation but is inherent to any type that supports iterated access. 
>>> Now, whether it's the *best choice* to conform `Set` to `Sequence` and 
>>> offer order-dependent functions is a matter of taste, but it isn't *wrong*.
>>>> […]
>>>> You will always have to account for this possibility, because Swift's 
>>>> `Equatable` explicitly allows "special values" to be not equal to 
>>>> themselves. This is, at least in part, in order to accommodate the IEEE 
>>>> decree that NaN != NaN:
>>> 
>>>> 
>>>> ```
>>>> let x = [Double.nan]
>>>> x.elementsEqual(x) // false
>>>> ```
>>> 
>>> NaN is special, one-shot and unordered sequences are not. Unless you think 
>>> that all unordered and single-pass sequences should compare false for 
>>> `elementsEqual`, this is irrelevant for any sequence that doesn’t contain 
>>> NaN and well-defined (false) for any that does.
>>> 
>>> Certainly, not all single-pass sequences should compare false to 
>>> themselves, but some should: for instance, an infinite single-pass stream 
>>> of all 1's should compare true to itself, but an infinite single-pass 
>>> stream of alternating 1's and 0's should compare false to itself. If you 
>>> write generic code that calls `elementsEqual`, it is pervasively incorrect 
>>> to test for identity by assuming that elementsEqual will return true on 
>>> reflexive comparison. NaN is only one of many reasons why such code would 
>>> blow up.
>>>  
>>> 
>>>> Changing this behavior is way beyond the scope of this thread (and has 
>>>> been the topic of hours (actually, weeks and weeks) of fun on this list 
>>>> previously).
>>> 
>>> Yes, I’ve seen the discussion on NaN and Comparable. It’s not the same 
>>> discussion.
>>> 
>>>> […]
>>>>> It would be better to say that the iteration order is well-defined. That 
>>>>> will almost always mean documented, and usually predictable though 
>>>>> obviously e.g. RNGs and iterating in random order will not be predictable 
>>>>> by design.
>>>> 
>>>> Wouldn't it then suffice to document, say, that a set's iteration order is 
>>>> the insertion order?
>>> 
>>> Now this actually gave me pause. I guess it does match what I said, but I 
>>> still take issue with the fact that two Sets could compare `==` but not 
>>> `elementsEqual`. I think that defining iteration order as insertion order 
>>> adds a piece of publicly documented state that goes beyond what a Set 
>>> really is. What you describe is really an OrderedSet, just without the 
>>> random-access manipulation.
>>> 
>>> a) There is no semantic requirement on the part of `==` to be equivalent to 
>>> an elementwise comparison when it is defined on a collection; in fact, one 
>>> could imagine that some exotic sequence might legitimately define equality 
>>> in a way that has nothing to do with elementwise comparison. Put another 
>>> way, `==` returning `true` does not imply `elementsEqual` returning `true`, 
>>> and `elementsEqual` returning `true` does not imply `==` returning `true`. 
>>> This applies equally to ordered collections and is independent of the 
>>> question of how to model unordered collections.
>>> 
>>> b) You keep writing that some Foo is really some Bar, but those are really 
>>> just names. What would be the harm if Swift's `Set` indeed simply models an 
>>> ordered set without random-access manipulation?
>>> 
>>> I’ll have to mull this over to see if I can come up with a coherent and 
>>> (more) specific requirement for what makes an Iterable a Sequence, since 
>>> clearly “documented” isn’t enough.  Perhaps something along the lines that 
>>> any two Sequences that compare equal must iterate the same.
>>> 
>>>> […]
>>>> Apple documentation calls this one of the "order-dependent" methods. It is 
>>>> surely acceptable for a type that conforms to an order-dependent protocol 
>>>> to have methods that are order-dependent; they do, however, have to be 
>>>> clearly order-dependent to avoid confusion on unordered types.
>>> 
>>> I’m not clear on what you’re trying to get across here. It seems you’re 
>>> saying unordered types shouldn’t have order-dependent methods, which is 
>>> exactly what I’ve been arguing.
>>> 
>>> No, I'm saying, essentially, that there are no truly unordered types in 
>>> Swift; `Set` and `Dictionary` lead double lives modeling unordered 
>>> collections on the one hand and ordered collections on the other. The 
>>> order-dependent methods can continue to exist; they just need to be clearly 
>>> named so that users know when they're using an instance of `Set` in the 
>>> manner of an unordered collection and when they're using an instance of 
>>> `Set` in the manner of an ordered collection.
>>>  
>>> 
>>>>  [...]
>>>> Then there are all the methods that imply a specific order of iteration. 
>>>> If the “sequence” is unordered, who knows what you’ll get? It is 
>>>> incredibly easy for an engineer to write a method that implicitly relies 
>>>> on a passed sequence being intrinsically ordered and another engineer to 
>>>> pass it an unordered “sequence.”  The first engineer could neglect to 
>>>> document the dependency, or even not notice it; or the second engineer 
>>>> could just fail to read the documentation thoroughly enough.  There is 
>>>> currently no way for the compiler to enforce passing only an object that 
>>>> is (or at least claims to be) intrinsically ordered.
>>>> 
>>>> It is also incredibly easy for such an engineer to use `for...in` instead 
>>>> to accomplish the same task, generic over ordered and unordered sequences 
>>>> whatever you name such distinguished protocols. I think your beef really 
>>>> still boils down to Set being compatible with `for...in` at all, as Jon 
>>>> acknowledges.
>>> 
>>> Not providing ordered functions for unordered collections makes the 
>>> developers think about what they actually need. If any object will do, they 
>>> can use for…in, .makeIterator().next(), or an `anyObject()` we provide as a 
>>> convenience. If they actually need the first from some specific order, it’s 
>>> a reminder they need to sort the objects first to get the right one.
>>> 
>>> The whole point of protocol hierarchies is to enable useful generic 
>>> algorithms. Here, the purpose of having a protocol that unites both ordered 
>>> and unordered collections is to permit the writing of generic algorithms 
>>> that operate on both; a user would want the first item from an ordered 
>>> collection or an arbitrary item (but the same one on multiple passes) from 
>>> an unordered collection. The name for that is currently `first`. Brent 
>>> demonstrated a trivial one-line example of such a use.
>>> 
>>> That’s particularly useful for functions that actually need an ordered 
>>> sequence; using OrderedSequence instead of Iterable (just as placeholders) 
>>> would be a firm reminder not to pass in an unordered collection.
>>> 
>>>>> […]
>>>> 
>>>> 
>>>> As I said, you're welcome to tackle the protocol hierarchy, but I really 
>>>> doubt it's within the realm of realistic endpoints for Swift 5. I'm just 
>>>> trying to propose a narrowly targeted pragmatic solution to one specific 
>>>> limited harm that might be deliverable by the next point release. As a 
>>>> great man once said, Swift is a pragmatic language.
>>> 
>>> If you want a pragmatic solution, fix the bug in functionality, don’t try 
>>> and rename the method to something obscure to cover it up.
>>> 
>>> What I'm arguing is that there *is no bug in functionality*, only a naming 
>>> problem. It is true that the current protocol hierarchy would not be my 
>>> preferred design, but that's a matter of taste in terms of, again, where to 
>>> draw the line between too much modeling or not enough. But that's not 
>>> tantamount to a *bug*.
>>>  
>>> If you want to limit the harm, override `equalObjects` on unordered 
>>> sequences to use `==` (very strongly preferred), or always `false` (less 
>>> desirable, but at least consistent)
>>> 
>>>>> […]
>>> 
>>>> 
>>>> The Swift stdlib deliberately eschews modeling everything in protocol 
>>>> hierarchies with the highest level of granularity. There's some fudging, 
>>>> deliberately, to find a happy medium between obtuse and approachable, 
>>>> between too many/too specialized and not enough. For example, I pushed for 
>>>> protocols such as `Field` and `Ring` at the top of the numeric hierarchy, 
>>>> which might allow complex number types to slot into the hierarchy more 
>>>> sensibly, for example. But we have a compromise protocol `Numeric` which 
>>>> doesn't quite have the same guarantees but is much more approachable. 
>>>> Notice that we also don't break down numeric protocols into `Addable`, 
>>>> `Subtractable`, etc.; we also have that fudge factor built into 
>>>> `Equatable`, as I mentioned.
>>> 
>>> Eh, one or two corner cases on a protocol is probably fine. What’s not fine 
>>> is over half (Sequence) or almost all (Collection) the methods not being 
>>> applicable.  There is a very clear gap there. We don’t need to fix 
>>> everything, but this is something that can and should be addressed.
>>> 
>>> This would be based on the premise that an instance of `Set` has no 
>>> intrinsic order; I disagree for the reasons above.
>>> _______________________________________________
>>> swift-evolution mailing list
>>> swift-evolution@swift.org <mailto:swift-evolution@swift.org>
>>> https://lists.swift.org/mailman/listinfo/swift-evolution 
>>> <https://lists.swift.org/mailman/listinfo/swift-evolution>
>> 
> _______________________________________________
> swift-evolution mailing list
> swift-evolution@swift.org
> https://lists.swift.org/mailman/listinfo/swift-evolution

_______________________________________________
swift-evolution mailing list
swift-evolution@swift.org
https://lists.swift.org/mailman/listinfo/swift-evolution

Re: [swift-evolution] [Draft] Rename Sequence.elementsEqual

Reply via email to