Github user gatorsmile commented on a diff in the pull request:
https://github.com/apache/spark/pull/21331#discussion_r189715760
--- Diff:
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Canonicalize.scala
---
@@ -85,6 +87,10 @@ object Canonicalize {
case Not(GreaterThanOrEqual(l, r)) => LessThan(l, r)
case Not(LessThanOrEqual(l, r)) => GreaterThan(l, r)
+ // order the list in the In operator
+ case In(value, list) =>
+ In(value, list.sortBy(_.semanticHash()))
--- End diff --
Why using `semanticHash` instead of `hashCode`? Do you have a test case to
show the difference?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]