Github user BryanCutler commented on a diff in the pull request:
https://github.com/apache/spark/pull/21650#discussion_r202863084
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/python/ExtractPythonUDFs.scala
---
@@ -167,7 +191,8 @@ object ExtractPythonUDFs extends Rule[SparkPlan] with
PredicateHelper {
case (vectorizedUdfs, plainUdfs) if vectorizedUdfs.isEmpty =>
BatchEvalPythonExec(plainUdfs, child.output ++ resultAttrs,
child)
case _ =>
- throw new IllegalArgumentException("Can not mix vectorized
and non-vectorized UDFs")
+ throw new AnalysisException(
+ "Mixed Python and Scalar Pandas UDFs are not expected
here")
--- End diff --
Change this to "Expected either Scalar Pandas UDFs or Batched UDFs but got
both"
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]