cloud-fan commented on a change in pull request #27466:
[SPARK-30722][PYTHON][DOCS] Update documentation for Pandas UDF with Python
type hints
URL: https://github.com/apache/spark/pull/27466#discussion_r377026073
##########
File path: python/pyspark/sql/pandas/functions.py
##########
@@ -348,10 +273,21 @@ def pandas_udf(f=None, returnType=None,
functionType=None):
.. note:: The user-defined functions do not take keyword arguments on the
calling side.
.. note:: The data type of returned `pandas.Series` from the user-defined
functions should be
- matched with defined returnType (see :meth:`types.to_arrow_type` and
+ matched with defined `returnType` (see :meth:`types.to_arrow_type` and
:meth:`types.from_arrow_type`). When there is mismatch between them,
Spark might do
conversion on returned data. The conversion is not guaranteed to be
correct and results
should be checked for accuracy by users.
+
+ .. note:: Currently,
+ :class:`pyspark.sql.types.MapType`,
+ :class:`pyspark.sql.types.ArrayType` of
:class:`pyspark.sql.types.TimestampType` and
+ nested :class:`pyspark.sql.types.StructType`
+ are currently not supported as output types.
Review comment:
what about input types?
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
With regards,
Apache Git Services
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]