Github user icexelloss commented on a diff in the pull request: https://github.com/apache/spark/pull/20908#discussion_r177791872 --- Diff: python/pyspark/sql/functions.py --- @@ -2208,7 +2208,8 @@ def pandas_udf(f=None, returnType=None, functionType=None): 1. SCALAR A scalar UDF defines a transformation: One or more `pandas.Series` -> A `pandas.Series`. - The returnType should be a primitive data type, e.g., :class:`DoubleType`. + The returnType should be a primitive data type, e.g., :class:`DoubleType` or + arrays of a primitive data type (e.g. :class:`ArrayType`). --- End diff -- Should :class:`ArrayType` be :class:`ArrayType(DoubleType)`?
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org