Github user cloud-fan commented on a diff in the pull request:
https://github.com/apache/spark/pull/19872#discussion_r162236262
--- Diff: python/pyspark/worker.py ---
@@ -110,6 +110,17 @@ def wrapped(*series):
return wrapped
+def wrap_pandas_group_agg_udf(f, return_type):
+ arrow_return_type = to_arrow_type(return_type)
+
+ def wrapped(*series):
+ import pandas as pd
+ result = f(*series)
+ return pd.Series(result)
--- End diff --
isn't the return type a scalar?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]