Github user mgaido91 commented on the issue:

    https://github.com/apache/spark/pull/21449
  
    yes @daniel-shields, you are right with your analysis. The problem was that 
we were sometimes using `==`, sometimes `semanticEquals`. And `equals` has the 
problem you mentioned.
    
    I think this is the only way for addressing the problem described here is 
to reference which dataset the column is coming from. I think adding a metadata 
for it is the cleanest way. We may also add a new attribute to the `Attribute` 
class instead of using metadata, but honestly this way seemed cleaner to me. 
What do you think? Do you have other suggestions?
    
    cc @cloud-fan @hvanhovell @gatorsmile  


---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to