Github user ueshin commented on a diff in the pull request:
https://github.com/apache/spark/pull/20211#discussion_r161716594
--- Diff: python/pyspark/sql/group.py ---
@@ -233,6 +233,27 @@ def apply(self, udf):
| 2| 1.1094003924504583|
+---+-------------------+
+ Notes on grouping column:
--- End diff --
I'd like to confirm what the result schema will be like finally.
If users want to include the keys, the udf should include the keys in its
output and the keys will not be prepended automatically?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]