Github user cloud-fan commented on a diff in the pull request:
https://github.com/apache/spark/pull/20513#discussion_r166184445
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/columnar/InMemoryTableScanExec.scala
---
@@ -61,6 +61,9 @@ case class InMemoryTableScanExec(
}) && !WholeStageCodegenExec.isTooManyFields(conf, relation.schema)
}
+ // TODO: revisit this. Shall we always turn off whole stage codegen if
the output data are rows?
+ override def supportCodegen: Boolean = supportsBatch
--- End diff --
In 2.4 we should look into this. My gut feeling is we don't need to enable
whole stage codegen for scan nodes that output data as rows.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]