Github user cloud-fan commented on a diff in the pull request:
https://github.com/apache/spark/pull/18659#discussion_r138272886
--- Diff: core/src/main/scala/org/apache/spark/api/python/PythonRDD.scala
---
@@ -83,10 +83,23 @@ private[spark] case class PythonFunction(
*/
private[spark] case class ChainedPythonFunctions(funcs:
Seq[PythonFunction])
+/**
+ * Enumerate the type of command that will be sent to the Python worker
+ */
+private[spark] object PythonEvalType {
+ val NON_UDF = 0
+ val SQL_BATCHED_UDF = 1
+ val SQL_ARROW_UDF = 2
--- End diff --
the new udf parameter is pandas `Series`, I think it's more accurate to
call it `SQL_PANDAS_UDF`.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]