Github user BryanCutler commented on a diff in the pull request:
https://github.com/apache/spark/pull/22329#discussion_r215345817
--- Diff: python/pyspark/sql/functions.py ---
@@ -2804,6 +2804,22 @@ def pandas_udf(f=None, returnType=None,
functionType=None):
| 1|1.5|
| 2|6.0|
+---+---+
+ >>> @pandas_udf(
+ ... "id long, additional_key double, v double",
--- End diff --
Sorry, I know you just changed it, but I think just naming the column
"ceil(v1 / 2)" with a type `long` would be a little more clear. Although
"additional_key" is ok too, if you guys want to keep that.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]