Github user cloud-fan commented on a diff in the pull request:
https://github.com/apache/spark/pull/18659#discussion_r138273168
--- Diff: python/pyspark/sql/functions.py ---
@@ -2112,7 +2113,7 @@ def wrapper(*args):
@since(1.3)
-def udf(f=None, returnType=StringType()):
+def udf(f=None, returnType=StringType(), vectorized=False):
--- End diff --
I think `@pandas_udf(DoubleType())` is better than `@udf(DoubleType(),
vectorized=True)`, which is more concise.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]