Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21331#discussion_r189715760
  
    --- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Canonicalize.scala
 ---
    @@ -85,6 +87,10 @@ object Canonicalize {
         case Not(GreaterThanOrEqual(l, r)) => LessThan(l, r)
         case Not(LessThanOrEqual(l, r)) => GreaterThan(l, r)
     
    +    // order the list in the In operator
    +    case In(value, list) =>
    +      In(value, list.sortBy(_.semanticHash()))
    --- End diff --
    
    Why using `semanticHash` instead of `hashCode`? Do you have a test case to 
show the difference?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to