sunchao commented on a change in pull request #33930:
URL: https://github.com/apache/spark/pull/33930#discussion_r736031215
##########
File path:
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/expressions.scala
##########
@@ -813,6 +860,39 @@ object NullPropagation extends Rule[LogicalPlan] {
}
+/**
+ * Unwrap the input of IsNull/IsNotNull if the input is NullIntolerant
+ * E.g. IsNull(Not(null)) == IsNull(null)
+ */
+object NullDownPropagation extends Rule[LogicalPlan] {
+ // Not all NullIntolerant can be propagated
+ // Return false if the expression may return null without non-null inputs.
+ // E.g. Cast is NullIntolerant; however, cast('Infinity' as integer) returns
null.
+ // Cannot apply to `ExtractValue` as the query planner uses the trait to
resolve the columns.
+ // E.g. the planner may resolve column `a` to `a#123`, then IsNull(a#123)
cannot be optimized
+ // Applying to `EqualTo` is too disruptive for [SPARK-32290] test cases
+ // e with multiple children requires the deterministic check because
optimizing IsNull(a > b) to
+ // Or(IsNull(a), IsNull(b)), for example, may cause skipping the evaluation
of b
+ private def supportedNullIntolerant(e: NullIntolerant): Boolean = (e match {
+ case _: Not => true
+ case _: GreaterThan | _: GreaterThanOrEqual | _: LessThan | _:
LessThanOrEqual
Review comment:
nit: why `EqualTo` is not handled here?
##########
File path:
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/expressions.scala
##########
@@ -446,6 +446,53 @@ object BooleanSimplification extends Rule[LogicalPlan]
with PredicateHelper {
}
+/**
+ * Move/Push `Not` operator if it's beneficial.
Review comment:
nit: maybe mention this is only for the case where leaf nodes are
boolean or null. The name appears to be more general.
##########
File path:
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/expressions.scala
##########
@@ -813,6 +860,39 @@ object NullPropagation extends Rule[LogicalPlan] {
}
+/**
+ * Unwrap the input of IsNull/IsNotNull if the input is NullIntolerant
+ * E.g. IsNull(Not(null)) == IsNull(null)
+ */
+object NullDownPropagation extends Rule[LogicalPlan] {
+ // Not all NullIntolerant can be propagated
+ // Return false if the expression may return null without non-null inputs.
Review comment:
could you rephrase this sentence? I got confused while reading it.
Perhaps "Return true iff the expression returns non-null result for all
non-null inputs"?
##########
File path:
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/expressions.scala
##########
@@ -813,6 +860,39 @@ object NullPropagation extends Rule[LogicalPlan] {
}
+/**
+ * Unwrap the input of IsNull/IsNotNull if the input is NullIntolerant
+ * E.g. IsNull(Not(null)) == IsNull(null)
+ */
+object NullDownPropagation extends Rule[LogicalPlan] {
+ // Not all NullIntolerant can be propagated
Review comment:
nit: could you write some comments explaining why not all NullIntolerant
can be propagated? this can help the future readers to better understand this
code.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]